Development of a Neural Network Method in the Problem of Classification and Image Recognition
Abstract
In the operation of any algorithm for face recognition or face detection, two logical blocks should be distinguished: an extractor of characteristic features and a classification mechanism. The action of the extractor is based on the extraction of information useful for the classifier from a huge stream of input data. When identifying a person, this information may be the characteristics of uniquely determined features (for example, the relative position of the eyes, eyebrows, lips and nose used in forensic science). When deciding whether to assign a class label to a recognizable object, a classifier should be guided by these very features. Feature selection is the most important task. Obviously, when choosing them, the most unique properties are taken into account, since they are the most reliable way to judge whether an object belongs to a particular class. There are many different approaches to obtaining class traits. The application of Object Detection to the solution of the problem of image classification and recognition is considered. The description of the FastER-RCNN method based on a two-stage neural network is given. The results of applying the YOLOv3 algorithm for training a neural network with different steps are presented. It is proposed to use an improved approach based on YOLO for accurate and fast object detection. The contributions of this work are: an efficient and accurate real-time detection model, ease and ability to locate objects based on improvements to the Fast-RCNN algorithm.
References
2. Girshick R., Donahue J., Darrell T., Malik J. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. 2014 IEEE Conference on Computer Vision and Pattern Recognition. IEEE Press, Columbus, OH, USA; 2014. p. 580-587. (In Eng.) DOI: https://doi.org/10.1109/CVPR.2014.81
3. Li S.Z., Jain A.K. Handbook of Face Recognition. Springer, London; 2011. 2nd Ed. 699 p. (In Eng.) DOI: https://doi.org/10.1007/978-0-85729-932-1
4. Erhan D., Szegedy C., Toshev A., Anguelov D. Scalable Object Detection Using Deep Neural Networks. 2014 IEEE Conference on Computer Vision and Pattern Recognition. IEEE Press, Columbus, OH, USA; 2014. p. 2155-2162. (In Eng.) DOI: https://doi.org/10.1109/CVPR.2014.276
5. He K., Zhang X., Ren S., Sun J. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2015; 37(9):1904-1916. (In Eng.) DOI: https://doi.org/10.1109/TPAMI.2015.2389824
6. Krizhevsky A., Sutskever I., Hinton G.E. ImageNet classification with deep convolutional neural networks. Communications of the ACM. 2017; 60(6):84-90. (In Eng.) DOI: https://doi.org/10.1145/3065386
7. Mäenpää T. The Local Binary Pattern Approach to Texture Analysis: Extensions and Applications. Acta Universitatis Ouluensis: Series C, Technica. 2003; 187:1-75. (In Eng.)
8. Rothe R., Guillaumin M., Van Gool L. Non-maximum Suppression for Object Detection by Passing Messages Between Windows. In: Ed. by D. Cremers, I. Reid, H. Saito, M. H. Yang. Computer Vision – ACCV 2014. ACCV 2014. Lecture Notes in Computer Science. 2015; 9003:290-306. Springer, Cham. (In Eng.) DOI: https://doi.org/10.1007/978-3-319-16865-4_19
9. Shan C., Gong S., McOwan P.W. Facial expression recognition based on Local Binary Patterns: A comprehensive study. Image and Vision Computing. 2009; 27(6):803-816. (In Eng.) DOI: https://doi.org/10.1016/j.imavis.2008.08.005
10. Maturana D., Mery D., Soto Á. Face Recognition with Local Binary Patterns, Spatial Pyramid Histograms and Naive Bayes Nearest Neighbor Classification. 2009 International Conference of the Chilean Computer Science Society. IEEE Press, Santiago, Chile; 2009. p. 125-132. (In Eng.) DOI: https://doi.org/10.1109/SCCC.2009.21
11. Visani M., Garcia C., Jolion J.M. Bilinear Discriminant Analysis for Face Recognition. In: Ed. by S. Singh, M. Singh, C. Apte, P. Perner. Pattern Recognition and Image Analysis. ICAPR 2005. Lecture Notes in Computer Science. 2005; 3687:247-256. Springer, Berlin, Heidelberg. (In Eng.) DOI: https://doi.org/10.1007/11552499_28
12. Chen D., Cao L. Face recognition based on multi-module singular value features and probabilistic subspaces analysis. 2011 4th International Congress on Image and Signal Processing. IEEE Press, Shanghai, China; 2011. p. 1508-1512. (In Eng.) DOI: https://doi.org/10.1109/CISP.2011.6100445
13. Alekseeva N.N. et al. Primenenie metodov obrabotki izobrazhenij k zadache raspoznavanija vaskuljarnogo risunka ladoni [Application of image processing methods to the problem of palm vascular pattern recognition]. Proceedings of the International Conference on Electronic Devices and Control Systems. TUSUR, Tomsk; 2013, no. 2. p. 131-137. Available at: https://www.elibrary.ru/item.asp?id=23035297 (accessed 14.08.2021). (In Russ.)
14. Zeno B.H. Face Validation Using Skin, Eyes and Mouth Detection. Prikladnaja informatika = Journal of Applied Informatics. 2018; 13(1):69-81. Available at: https://www.elibrary.ru/item.asp?id=32601825 (accessed 14.08.2021). (In Russ., abstract in Eng.)
15. Alizadeh F., Nalousi S., Savari C. Face Detection in Color Images using Color Features of Skin. International Journal of Computer and Information Engineering. 2011; 5(4):366-372. (In Eng.) DOI: https://doi.org/10.5281/zenodo.1063276
16. Mokeyev A.V., Mokeyev V.V. On efficiency of face recognition using linear discriminant analysis and principal component analysis. Biznes-Informatika = Business Informatics. 2015; (3):44-54. Available at: https://elibrary.ru/item.asp?id=24351096 (accessed 14.08.2021). (In Russ., abstract in Eng.)
17. Xu L.S., Meng M.Q.-H., Wang K.Q. Pulse Image Recognition Using Fuzzy Neural Network. 2007 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Press, Lyon, France; 2007. p. 3148-3151. (In Eng.) DOI: https://doi.org/10.1109/IEMBS.2007.4352997
18. Lou G., Shi H. Face image recognition based on convolutional neural network. China Communications. 2020; 17(2):117-124. (In Eng.) DOI: https://doi.org/10.23919/JCC.2020.02.010
19. Iudin D., Koposov E. Fractals: as Simple as Complex. NNGASU Publ., Nizhny Novgorod; 2013. Available at: https://www.nngasu.ru/unesco/resources/Fractals_as_simple.pdf (accessed 14.08.2021). (In Eng.)
20. Gafarov F.M., Galimyanov F.M. Iskusstvennye nejronnye seti i prilozheniya [Artificial Neural Networks and Applications]. Kazan publ., Kazan; 2018. 121 p. (In Russ.)
21. Houankpo H.G.K., Kozyrev D.V., Nibasumba E., Mouale M.N.B., Sergeeva I.A. A Simulation Approach to Reliability Assessment of a Redundant System with Arbitrary Input Distributions. In: Ed. by V. M. Vishnevskiy, K. E. Samouylov, D. V. Kozyrev. Distributed Computer and Communication Networks. DCCN 2020. Lecture Notes in Computer Science. 2020; 12563:380-392. Springer, Cham. (In Eng.) DOI: https://doi.org/10.1007/978-3-030-66471-8_29
22. Panetto H., Cecil J. Information systems for enterprise integration, interoperability and networking: theory and applications. Enterprise Information Systems. 2013; 7(1):1-6. (In Eng.) DOI: https://doi.org/10.1080/17517575.2012.684802
23. Nguyen D.T., Khachumov V.M. Models and methods for matching images in the problem of face recognition. Artificial Intelligence and Decision Making. 2016; (4):5-14. Available at: https://elibrary.ru/item.asp?id=27723582 (accessed 14.08.2021). (In Russ., abstract in Eng.)
24. Abramov N.S., Fralenko V.P., Khachumov M.V. Review of image recognition methods based on brightness invariants and geometric transformations. Sovremennye naukoemkie tehnologii = Modern High Technologies. 2020; (6-1):110-117. (In Russ., abstract in Eng.) DOI: https://doi.org/10.17513/snt.38078
25. Poniszewska-Maranda A. Management of access control in information system based on role concept. Scalable Computing: Practice and Experience. 2011; 12(1):35-50. Available at: https://www.scpe.org/index.php/scpe/article/view/686 (accessed 14.08.2021). (In Eng.)
26. Arulogun O.T., Omidiora E.O., Olaniyi O.M., Ipadeola A.A. Development of Security System using Facial Recognition. The Pacific Journal of Science and Technology. 2008; 9(2):377-385. (In Eng.)

This work is licensed under a Creative Commons Attribution 4.0 International License.
Publication policy of the journal is based on traditional ethical principles of the Russian scientific periodicals and is built in terms of ethical norms of editors and publishers work stated in Code of Conduct and Best Practice Guidelines for Journal Editors and Code of Conduct for Journal Publishers, developed by the Committee on Publication Ethics (COPE). In the course of publishing editorial board of the journal is led by international rules for copyright protection, statutory regulations of the Russian Federation as well as international standards of publishing.
Authors publishing articles in this journal agree to the following: They retain copyright and grant the journal right of first publication of the work, which is automatically licensed under the Creative Commons Attribution License (CC BY license). Users can use, reuse and build upon the material published in this journal provided that such uses are fully attributed.