Preview

Vestnik NSU. Series: Information Technologies

Advanced search
Vol 20, No 3 (2022)
View or download the full issue PDF (Russian)
5-13 169
Abstract

The paper proposes a model of the text of a scientifc and technical article for the automation of markup in the corpus of scientifc and technical texts. It is proved that when creating a corpus of scientifc and technical texts, it is necessary to take into account the structural features of texts of scientifc and technical articles. The necessity of adding structural markup to the corpus of scientifc and technical texts has been shown. It is noted that the texts of scientifc and technical articles have the same narration structure for all texts in this class, and also contain a limited set of structural elements. The features of compositional organization of the texts of scientifc and technical articles are analyzed. The approximate content of each of the elements of article structure is described. Compositional structure of the texts of scientifc and technical articles in Bekus-Naur notation is presented. A model of the text of a scientifc and technical article in the form of a graph, the vertices and edges of which are the full-fledged structural elements of a scientifc and technical article, is proposed. It is proved that the representation of a text of scientifc and technical article in the form of a graph makes it possible to determine the type of structural element and the degree of nesting in the process of computer analysis of the text by presenting the scientifc and technical article as a fnite set of its constituent parts. It is proved that the presence of structural markup in the corpus of scientifc and technical texts signifcantly expands its research potential and serves as the basis for the tasks of automatic processing of scientifc and technical texts.

14-28 124
Abstract

The paper presents a full-featured simulation debugging system designed to test programmable controllers for automated process control systems in the laboratory. This system imitates the operating conditions of the automated process control system, as close as possible to the real operating conditions at the automation object. The presented system can form various levels of interference of input signals, influencing the automatic process control system with high-intensity network and impulse noise. The system also allows one to vary the parameters of the communication line. The structure of the conducted interference generator and the variator of the communication line parameters have been developed. A description of all elements necessary for modeling the main types of interference is given. In this work, an analysis of the existing electromagnetic interference in the signal circuits of process control systems was carried out, and the most typical interference was identifed. A block diagram of a conducted interference generator and a variator of communication line parameters has been developed. Both functional blocks are part of the modeling and debugging complex. They allow one to simulate the interference environment for the controller under test by introducing the generated interference into the signal communication lines in a conductive way. The structure of the adapter-former of interference for analog signals is worked out in detail with a description of its main components. Recommendations for choosing the element base are given. The practical signifcance of the performed work lies in the fact that it may improve the efciency of the complex of control and laboratory tests of the systems being created. This allows one to achieve a reduction in complexity and in setup time during implementation at the automation facility

29-37 113
Abstract

The purpose of the work is to develop the information environment of the Siberian Branch of the Russian Academy of Sciences (SB RAS). The information environment of SB RAS has begun to take shape since the end of the 1990s with the creation of a corporate website, websites of institutes and the NSU website. Now the information environment of SB RAS unites more than 100 scientifc organizations and universities of Siberia, serves as a platform where the  scientifc and technical potential of the East of Russia is presented. Because of the recently introduced sanctions and the information warfare against Russia, some urgent tasks have arisen – in particular, to replace the information systems with open data such as Figshare and Zenodo. The article proposes to present a catalog of domestic open data databases on the SB RAS Portal. As an example, the section “Import-independence” that was created in May 2022 and contains open data is presented.

38-50 170
Abstract

This paper describes a research of the restoring distorted audio signal possibility. Based on the previously obtained results of using deep machine learning methods, the concept of a neural network to correct a distorted audio signal has been developed. On the basis of the originally obtained results, several new neural network architectures were developed, focused on the audio signal restoring. The paper contains descriptions of the developed architectures with a theoretical substantiation of the possibility of their application. The presented architectures were tested to solve the problem of restoring the part of a specifc instrument in a musical composition where it was removed. The results of testing the developed architectures of neural networks are presented in several forms.

51-64 177
Abstract

The effective study of Computer Science is impossible without the widespread use of information technology. Boolean algebra is one of the most fundamental sections of computer science. There is a list of disadvantages of this testing method for the evaluation of the master’s degree students’ knowledge of this topic. This results in development of other interactive tools. This article is devoted to the development and operation features of the automated system “Boolean algebra”, which was designed to control the formation of the competencies’ achievement indicators obtained during studying Boolean algebra. The developed system automatically creates task option and contains three types of tasks. The frst task includes construction of a truth table for three Boolean functions, the second requires Boolean circuit construction, and the third task is focused on Boolean expression simplifying. The system architecture consists of eightindependent modules which realize the new task option synthesis and its design, Boolean functions decomposition, truth tables construction, analysis, and evaluation of the results. The article describes the concept of each module. It describes in detail the synthesis of Boolean functions, as well as the analysis modules and their functioning algorithms. The automated system under operation proved to be effective since it allows one to objectively evaluate user’s theoretical knowledge and ability to apply these methods to the truth tables construction. There are also several advantages of the considered automated system: it requires little space on a hard disk, workes in a network and on local computer, has intuitive interface, generates a large variety of tasks and performs their autocheking.

65-76 238
Abstract

Nowadays, the number of scientifc publications existing in the form of electronic text is constantly growing. As a result, the tasks related to the text processing of scientifc articles become especially actual. This paper is dedicated to the task of extracting semantic relations between entities from the texts of scientifc articles in Russian, where we consider scientifc terms as entities. Relation extraction can be useful in some specialized areas, such as searching and question-answering systems, as well as in the compilation of ontologies. In our work, we have created a corpus of scientifc texts consisting of 136 abstracts of scientifc articles in Russian, in which 353 relations of the following types were highlighted: USAGE, ISA, TOOL, SYNONYMS, PART_OF, CAUSE. This corpus was used to train the machine learning models. In addition, we have implemented the automatic semantic relation extraction algorithm and tested it on the already existing corpus RuSERRC. The neural network model BERT was used to implement the algorithm. We’ve done a number of experiments using vectors derived from different language models, as well as two neural network architectures. The developed tool and the annotated corpus are publicly available and can be useful for other researchers.



Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 1818-7900 (Print)
ISSN 2410-0420 (Online)