Enriched Problem Network as a Core of the Metadata in a Digital Library

Abstract

This issue contributes to advancing of the entity search in problem networks of a digital library. The search is performed manually in a window, that visualizes some kind of knowledge graph, representing the library content involving the relevant metadata. A subset of the graph nodes depicts documents the library contains. The other part of nodes represents technological and scientific problems, being discussed in the documents, phases of problem-solving, sites of publication if any, and authoring. Edges of the graph depict diverse relations of the connected entities. Three levels of the metadata are discernable. The upper level is constituted of clusters built from nodes and edges and corresponding to particular semantic topics recognizable in the library content. The next two levels are of verbal nature. Optional labels of the nodes build the second level of library metadata. The label is a succinct nominative word composition assigned to a node when it was being included into the library. The label assists the entity search that starts from recognizing of clusters upon a cognitive map of their shapes in the user’s memory. The search is continued by moving in the graph space and adjusting the scale. At some scale, the node labels grow visible and amend and rule the search. The third metadata level lies outward of the graph space and is represented by a table. Through selection of a node being of interest, the corresponding table line is displayed, and the user can read full information about the node, such as bibliographic record of the document or problem characterization. This process provides the decisive information for finishing or continuing of the search. In both cases, the user receives full access to the pertinent documents stored in the library by a special browser. Finding of a specific document often appears to be a valuable but not the single outcome of using a library. In multiple cases one turns to a library for observing a collection of sources in some scientific and technological area and get the knowledge in an accumulated and possibly generalized form. An enrichment of the problem network in the core of library metadata is proposed, making the library metadata more informative and hence more suitable for the mode of library application mentioned above. Challenges emerging during the metadata enrichment are considered and corresponding recommendations are formulated. The issue contains a sample of a library collection built pursuant to the given recommendations.

Author Biographies

Andrey Petrovich Gagarin, Moscow Aviation Institute (National Research University)

Professor of the Department of Computers, Systems and Networks, Institute of Control Systems and Computer Science in Engineering, Cand.Sci. (Eng.), Professor

Ilja Andreevich Filimonov, Moscow Aviation Institute (National Research University)

Postgraduate Student of the Department of Computers, Systems and Networks, Institute of Control Systems and Computer Science in Engineering

References

1. Balog K. Entity-Oriented Search. The Information Retrieval Series. Vol. 39. Springer Cham; 2018. 351 p. (In Eng.) doi: http://doi.org/10.1007/978-3-319-93935-3, 2018
2. Siew C.S.Q., Wulff D.U., Beckage N.M., Kenett Y.N., Meštrović A. Cognitive Network Science: A Review of Research on Cognition through the Lens of Network Representations, Processes, and Dynamics. Complex. 2019; 2019:24. (In Eng.) doi: https://doi.org/10.1155/2019/2108423
3. Gagarin A.P., Filimonov I.A. Problem Network as Access Gate into a Document Repository. Sovremennye informacionnye tehnologii i IT-obrazovanie = Modern Information Technologies and IT-Education. 2020; 16(3):582-597. (In Russ., abstract in Eng.) doi: https://doi.org/10.25559/SITITO.16.202003.582-597
4. Heymann S., Grand B.L. Visual Analysis of Complex Networks for Business Intelligence with Gephi. 2013 17th International Conference on Information Visualisation. IEEE Press, London, UK; 2013. p. 307-312. (In Eng.) doi: https://doi.org/10.1109/IV.2013.39
5. Bastian M., Heymann S., Jacomy M. Gephi: An Open Source Software for Exploring and Manipulating Networks. Proceedings of the International AAAI Conference on Web and Social Media. 2009; 3(1):361-362. Available at: https://ojs.aaai.org/index.php/ICWSM/article/view/13937 (accessed 17.09.2021). (In Eng.)
6. Ait El Mouden Z., Jakimi A., Hajar M. An Algorithm of Conversion Between Relational Data and Graph Schema. In: Rocha Á., Serrhini M. (eds.) Information Systems and Technologies to Support Learning. EMENA-ISTL 2018. Smart Innovation, Systems and Technologies. Vol. 111. Springer, Cham; 2019. p. 594-602. (In Eng.) doi: https://doi.org/10.1007/978-3-030-03577-8_65
7. Brandes U., Eiglsperger M., Lerner J., Pich C. Graph Markup Language (GraphML). In: Tamassia R. (ed.) Handbook of Graph Drawing and Visualization. 1st ed. Chapman and Hall/CRC, New York; 2013. p. 517-541. (In Eng.) doi: https://doi.org/10.1201/b15385-19
8. Borgman C.L. What are digital libraries? Competing visions. Information Processing and Management. 1999; 35(3):227-243. (In Eng.) doi: https://doi.org/10.1016/S0306-4573(98)00059-4
9. Fox E.A., Akscyn R.M., Furuta R.K., Leggett J.J. Digital libraries. Communications of the ACM. 1995; 38(4):22-28. (In Eng.) doi: https://doi.org/10.1145/205323.205325
10. Gonçalves M.A., Fox E.A., Watson L.T., Kipp N.A. Streams, Structures, Spaces, Scenarios, Societies (5S): A Formal Model for Digital Libraries. ACM Transactions on Information Systems. 2004; 22(2):270-312. (In Eng.) doi: https://doi.org/10.1145/984321.984325
11. Ferro N., Silvello G. NESTOR: A formal model for digital archives. Information Processing & Management. 2013; 49(6):1206-1240. (In Eng.) doi: https://doi.org/10.1016/j.ipm.2013.05.001
12. Constantin A., Peroni S., Pettifer S., Shotton D., Vitali F. The Document Components Ontology (Do-CO). Semantic Web. 2016; 7(2):167-181. (In Eng.) doi: https://doi.org/10.3233/SW-150177
13. Mann W.C., Thompson S.A. Rhetorical Structure Theory: Toward a functional theory of text organization. Text ‒ Interdisciplinary Journal for the Study of Discourse. 1988; 8(3):243-281. (In Eng.) doi: https://doi.org/10.1515/text.1.1988.8.3.243
14. Taboada M., Mann W.C. Rhetorical Structure Theory: Looking Back and Moving Ahead. Discourse Studies. 2006; 8(3):423-459. (In Eng.) doi: https://doi.org/10.1177/1461445606061881
15. Marcu D. The Rhetorical Parsing of Unrestricted Texts: A Surface-based Approach. Computational Linguistics. 2000; 26(3):395-448. (In Eng.) doi: https://doi.org/10.1162/089120100561755
16. Gagarin A.P., Serdyukov V.V. Dynamic ontology based retrieval and analysis of information on science and engineering. Sovremennye informacionnye tehnologii i IT-obrazovanie = Modern Information Technologies and IT-Education. 2018; 14(3):654-662. (In Russ., abstract in Eng.) doi: https://doi.org/10.25559/SITITO.14.201803.654-662
17. Fatimi S., El Saili C., Alaoui L. Semantic Oriented Text Clustering Based on RDF. 2020 International Conference on Intelligent Systems and Computer Vision (ISCV). IEEE Press, Fez, Morocco; 2020. p. 1-8. (In Eng.) doi: https://doi.org/10.1109/ISCV49265.2020.9204133
18. Khan J.A., Kumar S. Deep analysis for development of RDF, RDFS and OWL ontologies with protégé. Proceedings of 3rd International Conference on Reliability, Infocom Technologies and Optimization. IEEE Press, Noida, India; 2014. p. 1-6. (In Eng.) doi: https://doi.org/10.1109/ICRITO.2014.7014747
19. Brack A., Hoppe A., Stocker M., Auer S., Ewerth R. Requirements Analysis for an Open Research Knowledge Graph. Proceedings of the 24th International Conference on Theory and Practice of Digital Libraries (TPDL 2020). Lyon, France; 2020. p. 1-15. (In Eng.) doi: https://doi.org/10.48550/arXiv.2005.10334
20. Färber M. The Microsoft Academic Knowledge Graph: A Linked Data Source with 8 Billion Triples of Scholarly Data. In: Ghidini C., et al. (eds.) The Semantic Web – ISWC 2019. ISWC 2019. Lecture Notes in Computer Science. Vol. 11779. Springer, Cham; 2019. p. 113-129. (In Eng.) doi: https://doi.org/10.1007/978-3-030-30796-7_8
21. Brodaric B., Reitsma F., Qiang Y. SKIing with DOLCE : toward an e-Science knowledge infrastructure. In: Eschenbach C., Gruninger M. (eds.) Frontiers in Artificial Intelligence and Applications. Vol. 193. Amsterdam, The Netherlands: IOS Press; 2008. p. 208-219. Available at: https://biblio.ugent.be/publication/723935 (accessed 17.09.2021). (In Eng.)
22. Borlund P. Interactive Information Retrieval: An Introduction. Journal of Information Science Theory and Practice. 2013; 1(3):12-32. (In Eng.) doi: https://doi.org/10.1633/JISTaP.2013.1.3.2
23. Shen J., Xiao J., He X., Shang J., Sinha S., Han J. Entity Set Search of Scientific Literature: An Unsupervised Ranking Approach. Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval (SIGIR '18). ACM, New York, NY, USA; 2018. p. 565-574. (In Eng.) doi: https://doi.org/10.1145/3209978.3210055
24. Wu J., Williams K.M., Chen H.-H., Khabsa M., Caragea C., Tuarob S., Ororbia A.G., Jordan D., Mitra P., Giles C.L. CiteSeerX: AI in a Digital Library Search Engine. AI Magazine. 2015; 36(3):35-48. (In Eng.) doi: https://doi.org/10.1609/aimag.v36i3.2601
25. Otasek D., Morris J.H., Bouças J., Pico A.R., Demchak B. Cytoscape Automation: empowering workflow-based network analysis. Genome Biology. 2019; 20:185. (In Eng.) doi: https://doi.org/10.1186/s13059-019-1758-4
Published
2021-12-20
How to Cite
GAGARIN, Andrey Petrovich; FILIMONOV, Ilja Andreevich. Enriched Problem Network as a Core of the Metadata in a Digital Library. Modern Information Technologies and IT-Education, [S.l.], v. 17, n. 4, p. 860-870, dec. 2021. ISSN 2411-1473. Available at: <http://sitito.cs.msu.ru/index.php/SITITO/article/view/805>. Date accessed: 16 oct. 2025. doi: https://doi.org/10.25559/SITITO.17.202104.860-870.