LEXICON-BASED APPROACH IN GENERALIZATION EVALUATION IN RUSSIAN LANGUAGE MEDIA
Abstract
We consider generalization as a property of human thinking to make general conclusion based on authors’ own experience and observations and one of the techniques of authors use to manipulate the readership and present an algorithm for evaluation of the generalization in texts. The algorithm is based on the lexicon-based approach. To search the generalization we use ready-made dictionary (KEY-dictionary) and RuSentiLex dictionary. KEY-dictionary contains words and phrases (elements) that express the generalization. In RuSentiLex we take the words and phrases that express opinion and fact. The algorithm searches exact matches the elements from text with the elements from the dictionaries, it is also important that the elements from different dictionaries have their weights. New method is developed for automatic detection of generalization in texts from official media. Numerical calculations of generalization were performed using a special software application. To test the proposed approach the expert estimation were used.
References
[2] Teun A. van Dijk. Prejudice in Discourse. An Analysis of Ethnic Prejudice in Cognition and Conversation. Amsterdam: Benjamins. 1984. 172 p.
[3] Ajiboye E. Ideological Discourse Analysis of the Functions of Feedback Comments on Online Reports of Socio-political Crises in Nigeria. Covenant Journal of Language Studies. 2013. 1(2):128-147. Available at: https://journals.covenantuniversity.edu.ng/index.php/cjls/article/view/43/38 (accessed 01.06.2018).
[4] Frolova I.V. On expression of subjectivity in analytical articles of the British and Russian quality press. Political Linguistics. 2015; 1(51):138-145. Available at: https://elibrary.ru/item.asp?id=23590795 (accessed 01.06.2018). (In Russian)
[5] Dankova N.S. Ggeneralization strategy as a means of representing judicial power (on the material of Russian and English print media). Political Linguistics. 2016; 1(55):73–81. Available at: https://elibrary.ru/item.asp?id=25718823 (accessed 01.06.2018). (In Russian)
[6] Recasens M., Danescu-Niculescu-Mizil C., Jurafsky D. Linguistic Models for Analyzing and Detecting Biased Language. Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics. 2013. Vol. 1. Pp. 1650–1659.
[7] Morstatter F., Wu L., Yavanoglu U., Corman S.R., Liu H. Identifying Framing Bias in Online News. ACM Transactions on Social Computing. 2018; 1(2): Article 5. DOI: 10.1145/3204948
[8] Orlova O.G. Russia stereotypes glossary: manner and matter. Political Linguistics. 2013; 3(45):175–182. Available at: https://elibrary.ru/item.asp?id=20378624 (accessed 01.06.2018). (In Russian)
[9] Smit S. Master the power of suggestion! Achieve all you want! [Ovladejte siloj vnushenija! Dobivajtes' vsego chego hotite!]. M.: Izdatel'stvo AST, 2014. 448 p. (In Russian)
[10] D'Andrea A., Ferri F., Grifoni P., Guzzo T. Approaches, Tools and Applications for Sentiment Analysis Implementation. International Journal of Computer Applications. 2015; 125(3):26-33. DOI: 10.5120/ijca2015905866
[11] Vinodhini G., Chandrasekaran R.M. A sampling based sentiment mining approach for e-commerce applications. Information Processing & Management. 2017; 53(1): 223-236. DOI: 10.1016/j.ipm.2016.08.003
[12] Maynard D., Funk A. Automatic Detection of Political Opinions in Tweets. R. García-Castro, D. Fensel D., G. Antoniou G. (Eds.) The Semantic Web: ESWC 2011 Workshops. ESWC 2011. Lecture Notes in Computer Science, Vol. 7117. Springer, Berlin, Heidelberg, 2012. Pp. 88-99. DOI: 10.1007/978-3-642-25953-1_8
[13] Thakkar H., Patel D. Approaches for Sentiment Analysis on Twitter: A State-of-Art study. CoRR. Vol. abs/1512.01043. 2015. Available at: http://arxiv.org/abs/1512.01043 (accessed 01.06.2018).
[14] Taboada M., Brooke J., Tofiloski M., Voll K., Stede M. Lexicon-Based Methods for Sentiment Analysis. Computational Linguistics. 2011; 37(2):267–307. DOI: 10.1162/COLI_a_00049
[15] Repaka R., Pallelra R.R., Koppula A.R., Movva V.S. UMDuluth-CS8761-12: A Novel Machine Learning Approach for Aspect Based Sentiment Analysis. Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015). Association for Computational Linguistics, Denver, Colorado, 2015. Pp. 742-747. Available at: http://aclweb.org/anthology/S/S15/S15-2126.pdf (accessed 01.06.2018).
[16] Rice D., Zorn C. Corpus-based dictionaries for sentiment analysis of specialized vocabularies. Proceedings of NDATAD. 2013. Pp. 98-115. Available at: https://static1.squarespace.com/static/5557a550e4b0443afbea6783/t/5b569390758d4635216e5a73/1532400530505/Rice-Zorn-DictionaryPaper-PSRM-R%26R.pdf (accessed 01.06.2018).
[17] Whitehead M., Yaeger L. Sentiment Mining Using Ensemble Classification Models. T. Sobh (Ed.) Innovations and Advances in Computer Sciences and Engineering. Springer, Dordrecht, 2010. Pp. 509-514. DOI: 10.1007/978-90-481-3658-2_89
[18] Mudinas A., Zhang D., Levene M. Combining lexicon and learning based approaches for concept-level sentiment analysis. Proceedings of the First International Workshop on Issues of Sentiment Discovery and Opinion Mining (WISDOM'12). ACM, New York, NY, USA, 2012. Article 5, 8 pages. DOI: 10.1145/2346676.2346681
[19] Turney P.D. Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. Proceedings of the 40th Annual Meeting on Association for Computational Linguistics (ACL '02). Association for Computational Linguistics, Stroudsburg, PA, USA, 2002. Pp. 417-424. DOI: 10.3115/1073083.1073153
[20] Ualiyeva I., Krassovitsky A., Mussabayev R. Generalization rate evaluation in open publication materials in Russian language. Proceedings of the 16th International Scientific Conference Information Technologies and Management. April 26-27, 2018, ISMA University, Riga, Latvia, 2018. Pp. 81-82. Available at: https://www.isma.lv/FILES/SCIENCE/IT&M2018_THESES/02_CMIT/30_IT&M2018_Ualieva_Krassovitskiy_Mussabayev.pdf (accessed 01.06.2018).
[21] Loukachevitch N.V., Levchik A.V. Creating Russian Sentiment Lexicon. Open Semantic Technologies for Intelligent System. 2016; 6:377-382. Available at: https://elibrary.ru/item.asp?id=30080327 (accessed 01.06.2018). (In Russian)

This work is licensed under a Creative Commons Attribution 4.0 International License.
Publication policy of the journal is based on traditional ethical principles of the Russian scientific periodicals and is built in terms of ethical norms of editors and publishers work stated in Code of Conduct and Best Practice Guidelines for Journal Editors and Code of Conduct for Journal Publishers, developed by the Committee on Publication Ethics (COPE). In the course of publishing editorial board of the journal is led by international rules for copyright protection, statutory regulations of the Russian Federation as well as international standards of publishing.
Authors publishing articles in this journal agree to the following: They retain copyright and grant the journal right of first publication of the work, which is automatically licensed under the Creative Commons Attribution License (CC BY license). Users can use, reuse and build upon the material published in this journal provided that such uses are fully attributed.