Development of a Neural Network Method in the Problem of Classification and Image Recognition

Abstract

In the operation of any algorithm for face recognition or face detection, two logical blocks should be distinguished: an extractor of characteristic features and a classification mechanism. The action of the extractor is based on the extraction of information useful for the classifier from a huge stream of input data. When identifying a person, this information may be the characteristics of uniquely determined features (for example, the relative position of the eyes, eyebrows, lips and nose used in forensic science). When deciding whether to assign a class label to a recognizable object, a classifier should be guided by these very features. Feature selection is the most important task. Obviously, when choosing them, the most unique properties are taken into account, since they are the most reliable way to judge whether an object belongs to a particular class. There are many different approaches to obtaining class traits. The application of Object Detection to the solution of the problem of image classification and recognition is considered. The description of the FastER-RCNN method based on a two-stage neural network is given. The results of applying the YOLOv3 algorithm for training a neural network with different steps are presented. It is proposed to use an improved approach based on YOLO for accurate and fast object detection. The contributions of this work are: an efficient and accurate real-time detection model, ease and ability to locate objects based on improvements to the Fast-RCNN algorithm.

Author Biographies

Moutouama N’dah Bienvenu Mouale, Peoples' Friendship University of Russia

Postgraduate Student of the Department of Applied Informatics and Probability Theory, Faculty of Science

Dmitry Vladimirovich Kozyrev, Peoples' Friendship University of Russia; V. A. Trapeznikov Institute of Control Sciences of Russian Academy of Sciences

Associate Professor of the Department of Applied Informatics and Probability Theory, Faculty of Science; Senior Researcher, Ph.D. (Phys.-Math.), Associate Professor

Hector Gibson Kinmanhon Houankpo, Peoples' Friendship University of Russia

Postgraduate Student of the Department of Applied Informatics and Probability Theory, Faculty of Science

Emmanuel Nibasumba, Peoples' Friendship University of Russia

Postgraduate Student of the Department of Applied Informatics and Probability Theory, Faculty of Science

References

1. Spiridonov I.N. Biometric Technologies in Integrated Automated Systems of State Security. Herald of the Bauman Moscow State Technical University. Series Instrument Engineering. 2011; (S):3-10. Available at: https://www.elibrary.ru/item.asp?id=12450406 (accessed 14.08.2021). (In Russ., abstract in Eng.)
2. Girshick R., Donahue J., Darrell T., Malik J. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. 2014 IEEE Conference on Computer Vision and Pattern Recognition. IEEE Press, Columbus, OH, USA; 2014. p. 580-587. (In Eng.) DOI: https://doi.org/10.1109/CVPR.2014.81
3. Li S.Z., Jain A.K. Handbook of Face Recognition. Springer, London; 2011. 2nd Ed. 699 p. (In Eng.) DOI: https://doi.org/10.1007/978-0-85729-932-1
4. Erhan D., Szegedy C., Toshev A., Anguelov D. Scalable Object Detection Using Deep Neural Networks. 2014 IEEE Conference on Computer Vision and Pattern Recognition. IEEE Press, Columbus, OH, USA; 2014. p. 2155-2162. (In Eng.) DOI: https://doi.org/10.1109/CVPR.2014.276
5. He K., Zhang X., Ren S., Sun J. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2015; 37(9):1904-1916. (In Eng.) DOI: https://doi.org/10.1109/TPAMI.2015.2389824
6. Krizhevsky A., Sutskever I., Hinton G.E. ImageNet classification with deep convolutional neural networks. Communications of the ACM. 2017; 60(6):84-90. (In Eng.) DOI: https://doi.org/10.1145/3065386
7. Mäenpää T. The Local Binary Pattern Approach to Texture Analysis: Extensions and Applications. Acta Universitatis Ouluensis: Series C, Technica. 2003; 187:1-75. (In Eng.)
8. Rothe R., Guillaumin M., Van Gool L. Non-maximum Suppression for Object Detection by Passing Messages Between Windows. In: Ed. by D. Cremers, I. Reid, H. Saito, M. H. Yang. Computer Vision – ACCV 2014. ACCV 2014. Lecture Notes in Computer Science. 2015; 9003:290-306. Springer, Cham. (In Eng.) DOI: https://doi.org/10.1007/978-3-319-16865-4_19
9. Shan C., Gong S., McOwan P.W. Facial expression recognition based on Local Binary Patterns: A comprehensive study. Image and Vision Computing. 2009; 27(6):803-816. (In Eng.) DOI: https://doi.org/10.1016/j.imavis.2008.08.005
10. Maturana D., Mery D., Soto Á. Face Recognition with Local Binary Patterns, Spatial Pyramid Histograms and Naive Bayes Nearest Neighbor Classification. 2009 International Conference of the Chilean Computer Science Society. IEEE Press, Santiago, Chile; 2009. p. 125-132. (In Eng.) DOI: https://doi.org/10.1109/SCCC.2009.21
11. Visani M., Garcia C., Jolion J.M. Bilinear Discriminant Analysis for Face Recognition. In: Ed. by S. Singh, M. Singh, C. Apte, P. Perner. Pattern Recognition and Image Analysis. ICAPR 2005. Lecture Notes in Computer Science. 2005; 3687:247-256. Springer, Berlin, Heidelberg. (In Eng.) DOI: https://doi.org/10.1007/11552499_28
12. Chen D., Cao L. Face recognition based on multi-module singular value features and probabilistic subspaces analysis. 2011 4th International Congress on Image and Signal Processing. IEEE Press, Shanghai, China; 2011. p. 1508-1512. (In Eng.) DOI: https://doi.org/10.1109/CISP.2011.6100445
13. Alekseeva N.N. et al. Primenenie metodov obrabotki izobrazhenij k zadache raspoznavanija vaskuljarnogo risunka ladoni [Application of image processing methods to the problem of palm vascular pattern recognition]. Proceedings of the International Conference on Electronic Devices and Control Systems. TUSUR, Tomsk; 2013, no. 2. p. 131-137. Available at: https://www.elibrary.ru/item.asp?id=23035297 (accessed 14.08.2021). (In Russ.)
14. Zeno B.H. Face Validation Using Skin, Eyes and Mouth Detection. Prikladnaja informatika = Journal of Applied Informatics. 2018; 13(1):69-81. Available at: https://www.elibrary.ru/item.asp?id=32601825 (accessed 14.08.2021). (In Russ., abstract in Eng.)
15. Alizadeh F., Nalousi S., Savari C. Face Detection in Color Images using Color Features of Skin. International Journal of Computer and Information Engineering. 2011; 5(4):366-372. (In Eng.) DOI: https://doi.org/10.5281/zenodo.1063276
16. Mokeyev A.V., Mokeyev V.V. On efficiency of face recognition using linear discriminant analysis and principal component analysis. Biznes-Informatika = Business Informatics. 2015; (3):44-54. Available at: https://elibrary.ru/item.asp?id=24351096 (accessed 14.08.2021). (In Russ., abstract in Eng.)
17. Xu L.S., Meng M.Q.-H., Wang K.Q. Pulse Image Recognition Using Fuzzy Neural Network. 2007 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Press, Lyon, France; 2007. p. 3148-3151. (In Eng.) DOI: https://doi.org/10.1109/IEMBS.2007.4352997
18. Lou G., Shi H. Face image recognition based on convolutional neural network. China Communications. 2020; 17(2):117-124. (In Eng.) DOI: https://doi.org/10.23919/JCC.2020.02.010
19. Iudin D., Koposov E. Fractals: as Simple as Complex. NNGASU Publ., Nizhny Novgorod; 2013. Available at: https://www.nngasu.ru/unesco/resources/Fractals_as_simple.pdf (accessed 14.08.2021). (In Eng.)
20. Gafarov F.M., Galimyanov F.M. Iskusstvennye nejronnye seti i prilozheniya [Artificial Neural Networks and Applications]. Kazan publ., Kazan; 2018. 121 p. (In Russ.)
21. Houankpo H.G.K., Kozyrev D.V., Nibasumba E., Mouale M.N.B., Sergeeva I.A. A Simulation Approach to Reliability Assessment of a Redundant System with Arbitrary Input Distributions. In: Ed. by V. M. Vishnevskiy, K. E. Samouylov, D. V. Kozyrev. Distributed Computer and Communication Networks. DCCN 2020. Lecture Notes in Computer Science. 2020; 12563:380-392. Springer, Cham. (In Eng.) DOI: https://doi.org/10.1007/978-3-030-66471-8_29
22. Panetto H., Cecil J. Information systems for enterprise integration, interoperability and networking: theory and applications. Enterprise Information Systems. 2013; 7(1):1-6. (In Eng.) DOI: https://doi.org/10.1080/17517575.2012.684802
23. Nguyen D.T., Khachumov V.M. Models and methods for matching images in the problem of face recognition. Artificial Intelligence and Decision Making. 2016; (4):5-14. Available at: https://elibrary.ru/item.asp?id=27723582 (accessed 14.08.2021). (In Russ., abstract in Eng.)
24. Abramov N.S., Fralenko V.P., Khachumov M.V. Review of image recognition methods based on brightness invariants and geometric transformations. Sovremennye naukoemkie tehnologii = Modern High Technologies. 2020; (6-1):110-117. (In Russ., abstract in Eng.) DOI: https://doi.org/10.17513/snt.38078
25. Poniszewska-Maranda A. Management of access control in information system based on role concept. Scalable Computing: Practice and Experience. 2011; 12(1):35-50. Available at: https://www.scpe.org/index.php/scpe/article/view/686 (accessed 14.08.2021). (In Eng.)
26. Arulogun O.T., Omidiora E.O., Olaniyi O.M., Ipadeola A.A. Development of Security System using Facial Recognition. The Pacific Journal of Science and Technology. 2008; 9(2):377-385. (In Eng.)
Published
2021-09-30
How to Cite
MOUALE, Moutouama N’dah Bienvenu et al. Development of a Neural Network Method in the Problem of Classification and Image Recognition. Modern Information Technologies and IT-Education, [S.l.], v. 17, n. 3, p. 507-518, sep. 2021. ISSN 2411-1473. Available at: <http://sitito.cs.msu.ru/index.php/SITITO/article/view/750>. Date accessed: 18 sep. 2025. doi: https://doi.org/10.25559/SITITO.17.202103.507-518.
Section
Theoretical Questions of Computer Science, Computer Mathematics