Using terms and informal definitions to classify domain entities into top-level ontology concepts: An approach based on language models

Knowledge-Based Systems - Tập 265 - Trang 110385 - 2023
Alcides Lopes1, Joel Carbonera1, Daniela Schmidt1, Luan Garcia1, Fabricio Rodrigues1, Mara Abel1
1Institute of Informatics, Universidade Federal do Rio Grande do Sul, Porto Alegre, 91501970, Brazil

Tài liệu tham khảo

Garcia, 2020, The GeoCore ontology: A core ontology for general use in Geology, Comput. Geosci., 135, 10.1016/j.cageo.2019.104387 Cicconeto, 2022, GeoReservoir: An ontology for deep-marine depositional system geometry description, Comput. Geosci., 159, 10.1016/j.cageo.2021.105005 Degtyarenko, 2007, ChEBI: a database and ontology for chemical entities of biological interest, Nucleic Acids Res., 36, D344, 10.1093/nar/gkm791 Mabee, 2007, Phenotype ontologies: the bridge between genomics and evolution, Trends Ecol. Evol., 22, 345, 10.1016/j.tree.2007.03.013 Gene Ontology Consortium, 2019, The gene ontology resource: 20 years and still GOing strong, Nucleic Acids Res., 47, D330, 10.1093/nar/gky1055 Junior, 2022, Predicting the top-level ontological concepts of domain entities using word embeddings, informal definitions, and deep learning, Expert Syst. Appl., 203 Devlin, 2018 Liu, 2019 Lan, 2019 Clark, 2020 Gangemi, 2002, Sweetening ontologies with DOLCE, 166 Gangemi, 2003, The OntoWordNet project: Extension and axiomatization of conceptual relations in WordNet, 820 Miller, 1995, WordNet: a lexical database for English, Commun. ACM, 38, 39, 10.1145/219717.219748 N. Mahmoud, H. Elbeh, H.M. Abdlkader, Ontology Learning Based on Word Embeddings for Text Big Data Extraction, in: 2018 14th International Computer Engineering Conference, ICENCO, 2018, pp. 183–188. L.F. Garcia, F.H. Rodrigues, A. Lopes, R.d.S.A. Kuchle, M. Perrin, M. Abel, What Geologists Talk About: Towards a Frequency-Based Ontological Analysis of Petroleum Domain Terms, in: ONTOBRAS, 2020, pp. 190–203. Chen, 2021, ADOL: a novel framework for automatic domain ontology learning, J. Supercomput., 77, 152, 10.1007/s11227-020-03261-7 Lamurias, 2019, BO-LSTM: classifying relations via long short-term memory networks along biomedical ontologies, BMC Bioinformatics, 20, 1, 10.1186/s12859-018-2584-5 Jullien, 2022 Francis, 1965, A standard corpus of edited present-day American English, Coll. Engl., 26, 267, 10.2307/373638 Prechelt, 1998, Early stopping-but when?, 55 Mikolov, 2017