Bibliotekar/The Librarian

APPLICATION OF ARTIFICIAL INTELLIGENCE IN AUTOMATIC UDC CLASSIFICATION

A REVIEW OF PUBLISHED RESEARCH

Authors
  • Olivera Stojanović

    Belgrade City Library

Keywords:
natural language processing, machine learning, artificial intelligence, Semantic Web, UDC, automatic classification
Abstract

The paper provides a review of the literature and research
published between 2020 and April 2025, focusing on the application of machine learning and natural language processing techniques in the field of automatic bibliographic classification, with a special emphasis on the Universal Decimal Classification (UDC) system. The aim of the paper is to provide insight into current trends addressing this topic in librarianship, as well as to offer a brief introduction to key concepts such as automatic classification, the Semantic Web, natural language processing, and machine learning. In conclusion, the paper highlights the need for developing local resources and educating professional staff to ensure the possible and sustainable application of similar AI technologies in domestic practice. 

References

1. Allam, Hesham, Lisa Makubvure, Benjamin Gyamfi, Kwadwo Nyarko Graham, and Kehinde Akinwolere. “Text Classification: How Machine Learning Is Revolutionizing Text Categorization”. Information 16 no. 2 (2025), https://doi.org/10.3390/info16020130 (преузето 15. 7. 2025).

2. Andonovski, Jelena. „Mreža otvorenih podataka i jezički resursi u procesu izgradnje srpsko-nemačkog literarnog korpusa: doktorska disertacija”. Beograd: J. Andonovski, 2019. https://phaidrabg.bg.ac.rs/o:22874 (преузето 3. 4. 2025). (na ćirilici)

3. Aum, S. and S. Choe. “srBERT: Automatic Article Classification Model for Systematic Review Using BERT”. Syst Rev 10 (2021): 285. https://doi.org/10.1186/s13643-021-01763-w. (преузето 28. 8. 2025).

4. Berners-Lee, Tim, James Hendler and Ora Lassila. “The Semantic Web”. Scientific American 284 no. 5 (2001): 34–43. https://doi.org/10.1038/scientificamerican0501-34. (преузето 4. 4. 2025).

5. Beckett, David. RDF 1.2 N-Triples. https://www.w3.org/TR/rdf12-n-triples/ (преузето 24. 8. 2025).

6. Bogdanović, Miloš, Jelena Kocić and Leonid Stoimenov. “SRBerta–A Transformer Language Model for Serbian Cyrillic Legal Texts”. Information 15 (2024): DOI:10.3390/info15020074. (преузето 28. 8. 2025).

7. Borovic, Mladen, Ojstersek, Milan and Strnad, Damjan. “A Hybrid Approach to Recommending Universal Decimal Classification Codes for Cataloguing in Slovenian Digital Libraries”. IEEE Access 10 (n.d.): 85595–605. https://www.academia.edu/111230005/A_Hybrid_Approach_to_Recommending_Universal_Decimal_Classification_Codes_for_Cataloguing_in_Slovenian_Digital_Libraries, doi:10.1109/ACCESS.2022.3198706 (преузето 3. 4. 2025).

8. GeeksforGeeks. “An Introduction to MultiLabel Classification”, 2025. https://www.geeksforgeeks.org/machine-learning/an-introduction-to-multilabel-classification/ (преузето 20. 8. 2025).

9. Heaton, Jeff. “Review of Deep Learning, by Ian Goodfellow, Yoshua Bengio and Aaron Courville”. Genetic Programming and Evolvable Machines 19 (2018): 305–307. https://doi.org/10.1007/s10710-017-9314-z (преузето 24. 8. 2025).

10. Ikonomakis, Emmanouil, Sotiris Kotsiantis and V. Tampakas. “Text Classification Using Machine Learning Techniques”. WSEAS Transactions on Computers 4 (2005): 966–974, https://www.researchgate.net/publication/228084521_Text_Classification_Using_Machine_Learning_Techniques (преузето 15. 7. 2025).

11. Joorabchi, Arash and Abdulhussain E. Mahdi. “An Unsupervised Approach to Automatic Classification of Scientific Literature Utilizing Bibliographic Metadata”. Journal of Information Science 37 no. 5 (2011): 499–514. https://doi.org/10.1177/0165551511417785 (преузето 4. 4. 2025).

12. Kragelj, Matjaž and Mirjana Kljajić Borštnar. “Automatic Classification of Older Electronic Texts into the Universal Decimal Classification-UDC”. Journal of Documentation 77 no. 3 (2021): 755–76. https://doi.org/10.1108/JD-06-2020-0092/full/html (преузето 2. 4. 2025).

13. K. Means Clustering – Introduction, https://www.geeksforgeeks.org/machine-learning/k-means-clustering-introduction (преузето 25. 8. 2025).

14. LangChain. “BM25”. https://python.langchain.com/docs/integrations/retrievers/bm25/ (преузето 4. 4. 2025).

15. Ljubesic, Nikola and Davor Lauc. “BERTić - The Transformer Language Model for Bosnian, Croatian, Montenegrin and Serbian”. CoRR abs/2104.09243 (2021). https://arxiv.org/abs/2104.09243. (преузето 28. 8. 2025).

16. Nađ, Žolt. Osnove veštačke inteligencije i mašinskog učenja. Beograd: Kompjuter biblioteka, 2019. (na ćirilici)

17. Oxford University Press. “Natural-Language Processing”. In Oxford Reference, 2024. https://www.oxfordreference.com/display/10.1093/oi/authority.20110803100225333 (преузето 23. 5. 2025).

18. Roy, Aditi and Saptarshi, Ghosh. “Automated Subject Identification Using the Universal Decimal Classification: The ANN Approach”. Journal of Information and Knowledge 60 (2): 69–76. (2023). https://doi.org/10.17821/srels/2023/v60i2/170963. (преузето 2. 4. 2025).

19. Slavic, Aida, Ronald Siebes and Andrea Scharnhorst. “Publishing a Knowledge Organization System as Linked Data: The Case of the Universal Decimal Classification”. ArXiv 2205 no. 01395 (2022). https://doi.org/10.5771/9783956506611-69 (преузето 27. 3. 2025).

20. Tidake, Vaishali S. and Shirish S. Sane. „Multi-label Classification: A Survey”. International Journal of Engi-neering & Technology 7 no. 4.19 (2018). https://doi.org/10.14419/ijet.v7i4.19.28284. (преузето 24. 7. 2025).

21. Trtovac Aleksandra i Dakić Nataša, „Baza CONOR.SR u sistemu COBISS.SR”. Infotheca: Journal for Digital Humanities v. 20, n. 1–2a (feb. 2021): 75–88. https://infoteka.bg.ac.rs/ojs/index.php/Infoteka/article/view/2020.20.1_2.5_sr (преузето 4. 4. 2025).

22. Univerzitet Union, Računarski fakultet. „Šta je mašinsko učenje i šta su inteligentni algoritmi?” https://raf.edu.rs/citaliste/najnoviji-it-dogadjaji/sta-je-masinsko-ucenje-i-sta-su-inteligentni-algoritmi/ (преузето 3. 4. 2025).

23. Xiao, Tong and Jingbo Zhu. “Foundations of Large Language Models”. NLP Lab, Northeastern University & NiuTrans Research, 2025. https://github.com/NiuTrans/NLPBook/tree/main (преузето 20. 6. 2025).

24. World Wide Web Consortium (W3C). https://www.w3.org (преузето 4. 4. 2025).

Cover Image
Published
2026-02-13
Section
Practice
License

Copyright (c) 2026 The Librarian: The Journal of Theory and Practice of Librarianship

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

How to Cite

APPLICATION OF ARTIFICIAL INTELLIGENCE IN AUTOMATIC UDC CLASSIFICATION: A REVIEW OF PUBLISHED RESEARCH. (2026). Bibliotekar (The Librarian): The Journal of Theory and Practice of Librarianship, 67(2), 113-132. https://bibliotekar.bds.rs/index.php/1/article/view/273

Similar Articles

1-10 of 13

You may also start an advanced similarity search for this article.