Preview

Vestnik NSU. Series: Information Technologies

Advanced search

On an Approach to Developing Information and Reference Systems Based on Large Language Models

https://doi.org/10.25205/1818-7900-2025-23-1-46-66

Abstract

The aim of this study is to develop a corporate context-aware question-answering system in the form of a chatbot to support territorial management by providing fast and accurate access to relevant information. The system is built upon large language models leveraging the Retrieval-Augmented Generation (RAG) approach, combined with modern data processing and retrieval techniques. The knowledge base of the system incorporates textual and tabular data extracted from official reports on environmental conditions. A PostgreSQL database with the pgvector extension was employed to store and retrieve large volumes of vectorized data efficiently. Advanced Bi- and Cross-encoder architectures were integrated to enhance the accuracy of context interpretation and the relevance of retrieved answers. The system’s backend was implemented using the Text Generation Interface platform, while the frontend relies on ReactJS and a Telegram bot, ensuring an intuitive user interface for diverse audiences. Additional features, such as query classification filters and temporal interval normalization, were introduced to refine the system’s response precision and user experience. The results demonstrate the effectiveness of combining state-of-the-art artificial intelligence technologies with robust data management solutions. The developed system not only improves response accuracy and retrieval efficiency but also offers scalability and adaptability for application in various domains. This research highlights the potential of AI-driven solutions in addressing complex challenges in territorial management and beyond.

About the Authors

S. E. Popov
Federal Research Center for Information and Computational Technologies
Russian Federation

Semyon E. Popov, Candidate of Technical Sciences, Senior Researcher

Novosibirsk



V. P. Potapov
Federal Research Center for Information and Computational Technologies
Russian Federation

Vadim P. Potapov, Doctor of Technical Sciences, Professor, Chief Researcher

Novosibirsk



R. Y. Zamaraev
Federal Research Center for Information and Computational Technologies
Russian Federation

Roman Yu. Zamaraev, Candidate of Technical Sciences, Senior Researcher

Novosibirsk



References

1. Jansen B. J., Rieh S. The Seventeen Theoretical Constructs of Information Searching and Information Retrieval. Journal of the American Society for Information Sciences and Technology, 2010, vol. 61(8), pp. 1517–1534. DOI: 10.1002/asi

2. Wang H., Qin Z., Wan T. Text generation based on generative adversarial nets with latent variables. Advances in Knowledge Discovery and Data Mining: 22nd Pacific-Asia Conference, PAKDD 2018, Melbourne, VIC, Australia, June 3-6, 2018, Proceedings, Part II 22. – Springer International Publishing, 2018, pp. 92–103. https://doi.org/10.48550/arXiv.1712.00170

3. Roumeliotis K. I., Tselikas N. D. ChatGPT and Open-AI Models: A Preliminary Review. Future Internet, 2023, vol. 15(6), p. 192. https://doi.org/10.3390/fi15060192

4. Vidivelli S., Ramachandran M., Dharunbalaji A. Efficiency-Driven Custom Chatbot Development: Unleashing LangChain, RAG, and Performance-Optimized LLM Fusion. Computers, Materials and Continua, 2024, vol. 80, iss. 2, pp. 2423−2442.


Review

For citations:


Popov S.E., Potapov V.P., Zamaraev R.Y. On an Approach to Developing Information and Reference Systems Based on Large Language Models. Vestnik NSU. Series: Information Technologies. 2025;23(1):46-66. (In Russ.) https://doi.org/10.25205/1818-7900-2025-23-1-46-66

Views: 31


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 1818-7900 (Print)
ISSN 2410-0420 (Online)