+7 (495) 987 43 74
Join us -              
Рус   |   Eng

Authors

Sokolov Andrey M.

Degree
Leading Engineer, Scientific Department, Branch of the National Research University "MPEI" in Smolensk
E-mail
ansokol98@mail.ru
Location
Smolensk, Russia
Articles

Rubrication of text information based on the voting of intellectual classifiers

The practical implementation of the concept of electronic government is one of the priorities of Russian state policy. The organization of effective interaction between authorities and citizens is an important element of this concept. In addition to providing public services, it should include the processing of electronic appeals (applications, complaints, suggestions, etc.). Research has shown that the speed and efficiency of appeal processing largely depend on the quality of determining the thematic rubric, i.e. solving the rubrication task. The analysis of citizens' appeals received by the e-mail and official websites of public authorities has revealed several specific features (small size, errors in the text, free presentation style, description of several problems) that do not allow the successful application of traditional approaches to their rubrication. To solve this problem, it has been proposed to use various methods of intellectual analysis of unstructured text data (in particular, fuzzy logical algorithms, fuzzy decision trees, fuzzy pyramidal networks, neuro-fuzzy classifi convolutional and recurrent neural networks). The article describes the conditions of the applicability of six intellectual classifiers proposed for rubricating the electronic citizens’ appeals. They are based on such factors as the size of the document, the degree of intersection of thematic rubrics, the dynamics of their thesauruses, and the amount of accumulated statistical information. For a situation where a specific model cannot make an unambiguous choice of a thematic rubric, it is proposed to use the classifier voting method, which can significantly reduce the probability of rubrication errors based on the weighted aggregation of solutions obtained by several models selected using fuzzy inference. Read more...

Creation of a chemical-technological system digital twin using the Python language

Currently, when modeling complex technological processes in cyber-physical systems, procedures for creating so-called "digital twins" (DT) have become widespread. DT are virtual copies of real objects which reflect their main properties at various stages of the life cycle. The use of digital twins allows real-time monitoring of the current state of the simulated system, and also provides additional opportunities for engineering and deeper customization of its components to improve the quality of products. The development of the "digital twin" technology is facilitated by the ongoing Fourth Industrial Revolution, which is characterized by the massive introduction of cyber-physical systems into production process. These systems are based on the use of the latest technologies for data processing and presentation and have a complex structure of information chain between its components. When creating digital twins of such systems elements, it is advisable to use programming languages, that allow visualization of simulated processes and provide a convenient and developed apparatus for working with complex mathematical dependencies. The Python programming language has similar characteristics. In the article, as an example of a cyber- physical system, a chemical-technological system based on a horizontal-grate machine is considered. This system is designed to implement the process of producing pellets from the apatite-nepheline ore mining wastes. The article describes various aspects of creating a digital twin of its elements that carry out the chemical-technological drying process in relation to a single pellet. The digital twin is implemented using the Python 3.7.5 programming language and provides the visualization of the process in the form of a three-dimensional interactive model. Visualization is done using the VPython library. The description of the digital twin software operation algorithm is given, as well as the type of the information system interface, the input and output information type, the results of modeling the investigated chemical-technological process. It is shown that the developed digital twin can be used in three versions: independently (Digital Twin Prototype), as an instance of a digital twin (Digital Twin Instance), and also as part of a digital twins set (Digital Twin Aggregate). Read more...

Tools for automated collection and analysis of sociological information on the territorial identity of city residents

The paper proposes an algorithm for automated search and initial analysis of sociological information aimed at studying the territorial identity of city area residents using Internet sources. Communities of social networks, e.g. VKontakte, are considered as the main data source, and websites of topographic objects found in the territories under study are used as auxiliary information sources. It is demonstrated that, in terms of information support, public pages and groups with open or restricted access walls have the greatest potential. The developed algorithm implies selecting relevant groups, finding content concerning area issues, and determining the indices of community activity in discussing territorial problems. The required information is retrieved through the interaction with a social network server with the use of the official Application Programming Interface (API). To identify communities and posts, it is proposed to apply methods of morphological analysis of textual information. The software implementation of the algorithm is described in Python 3.8.5, including original functions for the acquisition of data on communities by their identification numbers, for the formation of a set of urbanonyms for a specified area, and some other ones. The developed program has been used to analyze territorial groups in three areas of Moscow; the results of the analysis enable us to estimate the degree of the territorial identity of their residents. The analysis of the error in the results of automated data collection and processing shows good agreement of these results with manually obtained ones, i.e. the error is 2.6% in the identification of relevant groups and about 3% in the identification of posts on area issues. Therewith, a much higher speed of response and lower labor effort required to perform routine operations allow the algorithm and the implementing computer program to be viewed as an effective tool for sociological research based on data from social networks. Read more...

A computer program for electromechanical system operational diagnostics based on the topological approach

The paper presents a method, a mathematical model, and a computer program for the operational diagnostics of an electromechanical system (EMS). During EMS operation, service aging changes the properties of the parametric matrices of the windings and, as a consequence, the characteristics of the EMS vector space. Periodic testing of the vector space offers relevant and reliable data on the current health of the EMS, its changes during operation, and the risk of loss of function. The object of the study is an asynchronous electric motor (AEM). It is urgent to automate the process of assessing the current health of an AEM and to organize the storage of information on its states at different stages of its life cycle. To solve the problem, software (SW) for accumulation of information on AEM operation and for evaluation of its basic performance metrics has been developed in the Python programming language. The SW is based on the topological approach to diagnostics, which implies the analysis of the current responses of motor rotor windings to phase voltage pulses. The SW enables one to determine the rate of the service aging of an item, the probability of its survival and residual life, to obtain access to the history of previous diagnostics, and to visualize the in-service history of the above-mentioned performance metrics. The developed SW can be used to increase the AEM operation efficiency and to plan engineering or repair work; it can also be used as an information source for re- engineering and modification of existing AEMs. The described SW can be extended to perform operational diagnostics based on the topological approach of devices of various types. Also, this SW can be considered as a separate information component of the digital twin of a complex EMS, which will allow us to study the main indicators of its reliability, fault tolerance and operational efficiency at all stages of the life cycle. Read more...