Ontology of Terms Present in the Titles of Scientific Articles with the Usage Relationship between Them

Document Type : Research َ Article

Author

Assistant Professor, Department of Computer Engineering, NT.C., Islamic Azad University, Tehran, Iran

10.30484/nastinfo.2025.3876.2351

Abstract

Purpose: The question of what methods have been used by researchers to solve a given scientific ‎problem is an important question for a researcher who intends to address that scientific ‎problem. Important concepts and methods in any field of science are mentioned in the ‎technical terms of that field. Therefore, the "usage" relationship between technical terms is ‎of considerable importance. The purpose of this research is to establish a method that ‎recognizes the usage relationship between terms present in the titles of scientific articles and ‎to include those terms and the usage relationship between them in an ontology. Identifying ‎and recording the usage relationship between terms, when implemented on a large scale, ‎reveals what approaches are examined by researchers to address a given task.‏
Method: This research is applied in purpose and qualitative in nature. The primary data are the titles of articles in a selected scientific journal, a specified time period. We propose a method which detects technical terms present in the title of scientific articles, finds two relations between those terms, namely the hyponym-hypernym and usage relations, and inserts the result into an ontology. First, the noun phrases present in the title of the article are detected. These noun phrases are the technical terms, with some reservations. Our method focuses on titles in which two technical terms are related through a connector, which semantically indicates usage. Such cases are inserted into the ontology after detection. In addition to the usage relation, a hyponymy relation is also proposed based solely on the syntactic structure of each found noun phrase. Appropriate inference rules are designed in the ontology, based on which the usage relation between terms is inherited from the hyponym to the hypernym. Although the usage relationship between the article title components can also be extracted using large language models, performing this for a large number of article titles is costly. In addition, it will be necessary to analyze the response of the language models. Also, the need for inclusion in the ontology will remain.
Findings: To evaluate the method, the titles of one year of articles from a scientific journal were used. After implementing the method, the ontology was examined. In 69% of the titles which contained the pattern of this study, both noun phrase, i.e. the used term and the using term, were completely extracted. In 17% of the titles with the pattern, one or both noun phrase, i.e. at least one of the two terms, were incompletely extracted, so that the extracted part is semantically correct but does not contain the entire meaning of the title. In 14% of the titles with the pattern, at least one of the two noun phrase, i.e., the technical used term or the using term, was extracted incorrectly.
Conclusion: Since the proposed method is automatic, it can be applied to a large number of scientific articles. To increase accuracy, it is suggested that other connectives be considered in the analysis of the article title, in addition to the connectives that reflect the usage relationship.

Keywords

Main Subjects


منابع
اسماعیلی آدرگانی، زهرا و  فراهی، احمد (1392). استخراج کلیدواژه‌های مقالات فارسی بر اساس تحلیل معنایی. هفتمین کنفرانس ملی مهندسی برق با محوریت انرژی های نو. https://civilica.com/doc/244182.
بهاری ورزنه، حسین (1401). مقایسه عملکرد بازیابی اطلاعات موتورهای جستجوی معنایی و کلیدواژه‌ای بر اساس جستجوی عبارتی. فصلنامه مطالعات دانش پژوهی. https://civilica.com/doc/1568953.
درزی خلردی، صغرا و رضوی، علی‌اصغر (1397). همخوانی کلیدواژه‌های مقاله‌های مجلات دانشگاه علوم کشاورزی و منابع طبیعی ساری با اصطلاحنامه کب. مجله دانش‌شناسی. https://civilica.com/doc/1603395.
شریف، عاطفه (1388). مهندسی خودکار هستی‌شناسی: امکان‌سنجی استخراج روابط معنایی از متون فارسی و تعیین میزان پیدایی آن‌ها. کتابداری و اطلاع‌رسانی ۱۲(۲)، ۲۶۳-۲۴۳.
شمس‌فرد، مهرنوش و جعفری‌نژاد، فاطمه (1391). استخراج روابط معنایی میان فعل و وابسته‌های آن از متون زبان فارسی. فصلنامه پازند ۸(۳۰)، ۷۱-۵۳.
غنی پور تفرشی، مریم و حاجی زین العابدینی، محسن (1397). تاثیر مستندسازی کلیدواژه‌ها با استفاده از اصطلاحنامۀ پزشکی فارسی بر بازیابی مقالات از پایگاه‌های ایران مدکس و مدلیب. پژوهش های کتابخانه های دیجیتالی و هوشمند. https://civilica.com/doc/1015990.
مبشری، سروش (۱۴۰۳). مدلی برای بازنمایی فراعلم جهت کار با هستان‌شناسی‌های علمی. پایان‌نامه دکتری. گروه هوش مصنوعی، رباتیک و رایانش شناختی. دانشکده مهندسی و علوم کامپیوتر. دانشگاه شهید بهشتی.
هدایتی، سعید و حسنی آهنگر، محمدرضا (1400). استخراج تعابیر روشی جهت استخراج کلیدواژه ترکیبی در اسناد متنی با استفاده از شبکه عصبی کانولوشن. هشتمین کنفرانس ملی پژوهش های کاربردی در علوم برق، کامپیوتر و مهندسی پزشکی. https://civilica.com/doc/1234162.
References
Bahari-Varzaneh, H. (2022). Comparing the Information Retrieval Performance of Semantic and Keyword Search Engines based on Phrase Search, Quarterly Journal of Scholarly Studies. https://civilica.com/doc/1568953. [In Persian]
Cowie, J., & Lehnert, W. (1996). Information extraction. Communications of the ACM39(1), 80-91.
Darzi-Khalardi, S. & Razavi, A. (2019). Consistency of keywords in journal articles of Sari University of Agricultural Sciences and Natural Resources with the Kab Thesaurus, Journal of Knowledge Studies. https://civilica.com/doc/1603395. [In Persian]
Ding, H., & Luo, X. (2021). Attentionrank: Unsupervised keyphrase extraction using self and cross attentions. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 1919–1928.
Esmaeli-Adargai, Z. & Farahi, A. (2013). Extracting Keywords from Persian Articles based on Semantic Analysis, The 7th National Conference on Electrical Engineering with a Focus on New Energies. https://civilica.com/doc/244182. [In Persian]
Fader, A., Soderland, S., & Etzioni, O. (2011). Identifying relations for open information extraction. In Proceedings of the 2011 conference on empirical methods in natural language processing (pp. 1535-1545).
Ghanipour-Tafreshi, M. and Haji-Zeinolabedini, M. (2019). The effect of documenting keywords using Persian medical thesaurus on retrieving articles from Iran MedEx and MedLib databases, Digital and Smart Libraries Researches. https://civilica.com/doc/1015990. [In Persian]
Hedayati, S. and Hasani-Ahangar, M. (2021). Extracting Interpretations: A Method for Extracting Compound Keywords in Text Documents Using Convolutional Neural Networks, 8th National Conference on Applied Research in Electrical, Computer and Medical Engineering. https://civilica.com/doc/1234162. [In Persian]
Mobasheri, S. (2025). A Meta-science Representation Model for Handling Scientific Ontologies. PhD thesis. Department of Artificial Intelligence, Robotics and Cognitive Computing. Faculty of Computer Science and Engineering. Shahid Beheshti University. [In Persian]
Nomoto, T. (2022). Keyword extraction: a modern perspective. SN Computer Science, 4(1), 92.
OpenAI. (2025). ChatGPT (GPT-5) [Large language model]. OpenAI. https://chat.openai.com/
RiahiNia, N., Shadanpour, F., Borna, K., & Montazer, G. A. (2022). Automatic keyword extraction using Latent Dirichlet Allocation topic modeling: Similarity with golden standard and users’ evaluation. Human Information Interaction, 9(3). http://hii.khu.ac.ir/article-1-3069-fa.html [In Persian]
Shamsfard, M. and Jafarinejad, F. (2012). Extracting Semantic Relations between Verbs and their Arguments from Persian Texts. Pazand Quarterly8(30), 53-71. [In Persian]
Sharif, A. (1388). Automated Ontology Engineering: Feasibility study of extracting semantic relationships from Persian texts and determining their occurrence rate. Library and Information Sciences. [In Persian]
Xu, Z., & Zhang, J. (2021). Extracting keywords from texts based on word frequency and association features. Procedia Computer Science, 187, 77–82.
CAPTCHA Image