+7 (495) 987 43 74 ext. 3304
Join us -              
Рус   |   Eng

articles

Authors: Dli M. I., Bulygina O. V., Kozlov P.     Published in № 4(76) 31 august 2018 year
Rubric: Models and Methods

Formation of the structure of the intellectual system of analyzing and rubricating unstructured text information in different situations

The analysis of electronic text documents written in natural language is one of the most important tasks implementing in systems of automated analyzing linguistic information. Today the most complicated problem is analyzing unstructured text documents coming to various organizations and authorities through the electronic communications. The increasing volume of such documents leads to the need to rubricate incoming messages, i.e. to solve the classification task. The analysis of the scientific works in this field has showed the impossibility of constructing a unified model for rubricating unstructured electronic text documents in various situations. The main reasons are the lack of statistical data, the dynamism of the thesaurus and the small size of the incoming document. To solve this problem, we propose a multimodel approach to the rubrication that is characterized by the combined use of intellectual and probabilistic-statistical methods of the text document analysis. The choice of a specific model is carried out using fuzzy logic algorithms based on the proposed characteristics (the size of document, the degree of rubric thesaurus intersection, the frequency of meaningful keywords, etc.). The implementation of the proposed multimodel approach will improve the accuracy of attributing unstructured electronic text documents to concrete rubrics taking into account their specificity and various objectives of practical application in the organization.

Key words

electronic unstructured text documents, rubrication, multimodel approach, growing pyramidal networks, fuzzy logic algorithms.

The author:

Dli M. I.

Degree:

Dr. Sci. (Eng.), Professor, Information Technologies in Economics and Management Department, Branch of the National Research University “MPEI” in Smolensk, Smolensk; Leading Researcher, Synergy University

Location:

Smolensk, Russia

The author:

Bulygina O. V.

Degree:

Cand. Sci. (Econ.), Associate Professor, department of Information Technology in Economics and Management, the Branch of National Research University MPEI in Smolensk

Location:

Smolensk

The author:

Kozlov P.

Degree:

PhD in Engineering, Assistant, The Branch of National Research University «MPEI» in Smolensk

Location:

Smolensk