Preview

Vestnik NSU. Series: Information Technologies

Advanced search

Web Platform for Folklore Research Based on Domain Ontology

https://doi.org/10.25205/1818-7900-2021-19-2-53-64

Abstract

In this paper, we take a close look at a web platform that provides the tools necessary for working with folklore materials and conducting scientific research based on them. Folklore studies consist of working with audio and video materials, which contain the reproduction of elements of folk art in national languages, creating specific text recordings with translation and comments, written in a public language, and building a picture of the worlds based on available resources. To structure and present this content, we use an ontology-based approach, which allows linguists to describe not only the resources, but also subject knowledge in the Semantic Web style, i.e. using hierarchies of classes, objects and relationships between them. The main feature of folklore research is the need for synchronization of translations, which is achieved by creating a parallel corpora of texts, and the ability to label texts with entities of the subject area, which is called semantic markup. Moreover, each corpus is connected with a certain nationality and has both its own national language and unique system of concepts of the world around it. Such representation imposes many non-standard requirements for the platform, such as working with arbitrary languages, supporting many ontologies, ensuring the creation and editing of national subject ontologies, semantic text markup, presentation, navigation, and search across heterogeneous resources. The developed platform provides all the necessary tools for research, including tools for the development of ontologies in specific national subject areas and manual annotation of texts in real time by several specialists. Resources of the web-platform are located in the resource ontology, which includes such concepts as corpus, video resource, audio resource, graphic image, person, geographical location, genre of text, etc. Ontologies of subject areas are presented in the form of a hierarchy, where the ontology of universals, common to all folklore studies, is located at the top level. At the same time, inherited ontologies are specialized for each represented national corpus. The web application is built with Python Django framework and the TypeScript React library. Data storage is implemented using the Postgres database.

About the Authors

V. A. Lisin
Novosibirsk State University
Russian Federation

Vladislav A. Lisin - Master's Student, Novosibirsk State University.

Novosibirsk.



E. A. Sidorova
A.P. Ershov Institute of Informatics Systems SB RAS
Russian Federation

Elena A. Sidorova - PhD, Senior Researcher, A. P. Ershov Institute of Systems informatics of the Siberian Branch of the Russian Academy of Sciences, AI laboratory.

Novosibirsk.



References

1. Pontus Stenetorp, Sampo Pyysalo, Goran Topic, Tomoko Ohta, Sophia Ananiadou, Jun'ichi Tsujii. Brat: a Web-based Tool for NLP-Assisted Text Annotation. In: Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics, 2012, p. 102-107.

2. Tobias Daudert. A Web-based Collaborative Annotation and Consolidation Tool. In: Proceedings of the 12th Conference on Language Resources and Evaluation, 2020, p. 7053-7059.

3. Juan Miguel Cejuela, Peter McQuilton, Laura Ponting, Steven J Marygold, Raymund Stefancsik, Gillian H Millburn. Tagtog: interactive and text-mining-assisted annotation of gene mentions in PLOS full-text articles. Database the Journal of Biological Databases and Curation, 2014, no. 2014, p. bau033.

4. Mathilde Janier, John Lawrence, Chris Reed. OVA+: An Argument Analysis Interface. Frontiers in Artificial Intelligence and Applications, 2014, no. 266, p. 463-464.

5. Kustova G. I., Lyashevskaya O. N., Paducheva E. V., Rakhilin E. V. Semantic markup of vocabulary in the National Corpus of the Russian language: principles, problems, prospects. In: National corpus of the Russian language: 2003-2005. Results and prospects. Moscow, 2005, p. 155-174. (in Russ.)

6. Grinevich A. A. Ontology “Images of Khanty bear songs” for the portal “Folklore of the peoples of Siberia”. Bulletin of Musical Science, 2016, no. 3 (13), p. 95-99.

7. Meghini C., Doerr M. A first-order logic expression of the CIDOC Conceptual Reference Model. International Journal of Metadata, Semantics and Ontologies, 2018, no. 13 (2), p. 131149.


Review

For citations:


Lisin V.A., Sidorova E.A. Web Platform for Folklore Research Based on Domain Ontology. Vestnik NSU. Series: Information Technologies. 2021;19(2):53-64. (In Russ.) https://doi.org/10.25205/1818-7900-2021-19-2-53-64

Views: 139


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 1818-7900 (Print)
ISSN 2410-0420 (Online)