SOFTWARE ARCHITECTURE OF HIGH-PRODUCTION COMPLEXES FOR TEXT OF THE NEWS PROCESSING FOR SOLVING THE PROBLEMS OF DATA MINING

  • Константин Константинович Отраднов Moscow Technological University (MIREA)
  • Антон Сергеевич Алёшкин Moscow Technological University (MIREA)
  • Дмитрий Олегович Жуков Moscow Technological University (MIREA)

Abstract

The article proposes the architecture of the software for processing collections of news text messages, as well as the corresponding composition and structure of the information system. The system is a few steps of obtaining and processing information, functioning on the basis of a hybrid computing cluster. Each stage of receiving, processing and storing information in a generalized information system is represented by a microservice as a separate program unit. At the same time, the possibility of using different technology stacks for each service is emphasized so that properly selected specialized solutions increase the efficiency and quality of the result, and the shortcomings of the classical micro-service architecture are offset by the internal heterogeneity of the micro-services expressed in the form of flexible modularization. The essence of the proposed approach is the application of the principle of pipeline concurrency based on a microservice architecture with dynamic service boundaries.

Author Biographies

Константин Константинович Отраднов, Moscow Technological University (MIREA)

Applicant, Senior Lecturer

Антон Сергеевич Алёшкин, Moscow Technological University (MIREA)

Candidate of Technical Science, Associate Professor of Department of automated management systems

Дмитрий Олегович Жуков, Moscow Technological University (MIREA)

doctor of technical sciences, professor, Deputy Director for Research of The Institute of Comprehensive Security and Special Instrumentation

References

1. Lesko, S.A., Zhukov, D.O. Trends, self-similarity, and forecasting of news events in the information domain, its structure and director. 2015 International Conference on Big Data Intelligence and Computing, Chengdu, China. -- 2015.
2. Zhukov, D.O., Lesko, S.A. Stochastic self-organization of poorly structured data and memory realization in an information domain when designing news events forecasting models. The 2nd IEEE International Conference on Big Data Intelligence and Computing, Auckland, New Zealand. -- 2016.
3. Sigov, A., Zhukov, D., Novikova, O. Modelling of memory realization processes and the implementation of information self-organization in forecasting the new's events using arrays of natural language texts. 1st International Scientific Conference Convergent Cognitive Information Technologies, Moscow, Russian Federation. -- 2016.
4. A. S. Sigov, D.A. Akimov, D.O. Zhukov, E.G. Andrianova, V. E. Sachkov, V.K. Raev. Psycholinguistic analysis of text messages in Russian based on their phono semantic statistical characteristics. Informatics and applications. 2017 volume 11 issue 3, pp. 77 -86.
5. D.O. Zhukov, A.M. Zamyshlyaev, O.A. Novikova. Model of Forecasting the Social News Events on the Basis of Stochastic Dynamics Methods. ITM Web of Conferences 10, 02009 (2017) 2017 Seminar on Systems Analysis, DOI: 10.1051/itmconf/20171002009.
6. Zhukov D.O., Novikova O.A., Otradnov K.K. Methods of analysis of news events in the information space based on the use of almost – periodic functions, wavelet transforms and Hurst’s self-similarity. Proceeding The 7th International Conference on Information Communication and Management ICICM’17, August 28-30, 2017, Moscow, Russian Federation, ACM ISBN 978-1-4503-5279-6/17/08
Published
2017-10-01
How to Cite
ОТРАДНОВ, Константин Константинович; АЛЁШКИН, Антон Сергеевич; ЖУКОВ, Дмитрий Олегович. SOFTWARE ARCHITECTURE OF HIGH-PRODUCTION COMPLEXES FOR TEXT OF THE NEWS PROCESSING FOR SOLVING THE PROBLEMS OF DATA MINING. Modern Information Technologies and IT-Education, [S.l.], v. 13, n. 3, p. 93-99, oct. 2017. ISSN 2411-1473. Available at: <http://sitito.cs.msu.ru/index.php/SITITO/article/view/294>. Date accessed: 03 july 2024. doi: https://doi.org/10.25559/SITITO.2017.3.437.

Most read articles by the same author(s)