Metadata Ontology of Theses and Dissertations: Designing a Model

Document Type : Research َ Article

Author

Assistant Professor of the National Library and Archives and Library of Iran

10.30484/nastinfo.2024.3498.2247

Abstract

Purpose: Designing metadata ontology model for semantic representation of Theses and Dissertations by using the SPAR (Semantic Publishing and Referencing) Ontologies.

Methods: This study was an applied form and two methods was used, Content Analysis and mapping. The metadata of 69 theses and dissertations on National library and Archive of Iran in three Databases: 1) Digital Library of National library and Archive of Iran. 2) Rasa Software and 3) Ganj in Irannian Research Institue for Information Science and Technology were selected and modified and completed by mapping. On the other hand, by analyzing the entities of each SPAR Ontologies and suggesting another entities by researcher, the checklist was formed. This checklist included classes, properties and individuals. At last by entering them into Protégé software version 5.5, the model of metadata ontology, MdOntTDs, was drawn.

Findings: Findings: Findings identified deficiencies in the existence of four important metadata elements (subject, supervisor, advisor and abstract) in RASA and NLAI Digital Library. Among the 18 SPAR Ontologies, the most entities were selected from FaBiO, FRAPO and CiTO respectively. All entities of BiDO, BiRO, C4O, Fivestar, FR, FRBR, PO, PRO, PSO, and PWO were suitable for theses. 195 individuals from 6 SPAR Ontologies, 292 individuals labeled with MdTDs from theses and 100 individuals labeled with SUNMdTDs were selected by the researcher and entered into the software. 1558 entities categorized by class, Properties (object, data and Annotation) and individuals along with the description and definition of each entity were placed in the software, in the form of hierarchical and determining axioms for classes. and specifying domain and range for relationships. Finally, the RDF graph was drawn using the OntoGraf plugin and the final Model, MdOntTDs was developed.

in this research has proposed three new types of metadata: 1) Except for the existing keywords, topics have been categorized and modeled up to three level including 4 main categories, 16 subcategories and many units. Each of these final topics has been related with “hasSubject” and “isSubjectOf“ properties. 2) The research methods of Theses that were connected with “hasMethod” and “usedIn” properties. 3) The papers taken from Theses were also searched, as far as possible, and were connected with “hasJournalArticle” and “journalArticleOf” properties.

Conclusion: This model, if implemented, can overcome keyword search limitations, the problem of linking and Data sharing in the web, and the inconsistency of data. In the software, classes and its related individuals are clearly visible in the form of a hierarchical network in RDF triples, and the connection between entities with increasing of access points promise deeper semantic searches. However, due to the absence or lack of tagged and linked data, usage of the some of selected entities, are not possible.

Keywords: SPAR Ontologies; Metadata Ontologies; Semantic Publishing; Thesis; Dissertations; National Library and Archives of Iran; Ganj; Digital Library; Rasa Software.

Keywords

Main Subjects


CAPTCHA Image

Articles in Press, Accepted Manuscript
Available Online from 28 February 2024
  • Receive Date: 08 October 2023
  • Revise Date: 10 January 2024
  • Accept Date: 28 February 2024