Combining information on structure and content to automatically annotate natural science spreadsheets

International Journal of Human-Computer Studies - Tập 103 - Trang 63-76 - 2017
Martine de Vos1, Jan Wielemaker1, Hajo Rijgersberg2, Guus Schreiber1, Bob Wielinga1, Jan Top1,2
1Computer Science, Network Institute, VU University Amsterdam, De Boelelaan 1081, 1081HV Amsterdam, The Netherlands
2Wageningen University and Research Centre, Food and Biobased Research, P.O. Box 17, NL-6700 AA Wageningen, The Netherlands

Tài liệu tham khảo

Abraham, R., Erwig, M., 2006. Inferring Templates from Spreadsheets. In: Proceedings of the 28th International Conference on Software Engineering. ACM, Shanghai, China, pp. 182–191. Assem, M.V., Rijgersberg, H., Wigham, M., Top, J., 2010. Converting and annotating quantitative data. In: Patel-Schneider, P. (Ed.), ISWC2010. pp. 16–31. Bellon-Maurel, 2014, Streamlining life cycle inventory data generation in agriculture using traceability data and information and communication technologies - Part I: Concepts and technical basis, J. Cleaner Prod., 69, 60, 10.1016/j.jclepro.2014.01.079 Cafarella, 2008, WebTables, Proceedings of the VLDB Endowment, 1, 538, 10.14778/1453856.1453916 Chen, Z., Cafarella, M., 2013. Automatic web spreadsheet data extraction. In: Proceedings of the 3rd International Workshop on Semantic Search Over the Web - SS@ ’13. pp. 1–8. URL 〈http://dl.acm.org.prox.lib.ncsu.edu/citation.cfm?id=2509908.2509909〉. Connor, M.J.O., Halaschek-wiener, C., Musen, M.A., 2010. Mapping master : a flexible approach for mapping spreadsheets to OWL. In: The Semantic WebISWC. Springer, Berlin, Heidelberg, pp. 194–208. De Vos, M., Van Hage, W.R., Ros, J., Schreiber, A., 2012. Reconstructing semantics of scientific models: a case study. In: Proceedings of the OEDW Workshop on Ontology Engineering in a Data Driven World, EKAW 2012, Galway, Ireland. De Vos, M.G., Wielemaker, J., Wielinga, B., Schreiber, G., Top, J., 2015. A methodology for constructing the calculation model of scientific spreadsheets. In: Proceedings of the 8th International Conference on Knowledge Capture. Delbridge, 2013, A whole-farm profitability analysis of organic and conventional cropping systems, Agricult. Syst., 122, 1, 10.1016/j.agsy.2013.07.007 Fisher, M., Rothermel, G., 2005. The EUSES spreadsheet corpus: a shared resource for supporting experimentation with spreadsheet dependability mechanisms. In: ACM SIGSOFT Software Engineering Notes. vol. 1. pp. 1–5 URL 〈http://doi.acm.org/10.1145/1082983.1083242%5Cnhttp://dl.acm.org/citation.cfm?id=1083242〉. Garcia-silva, A., Gomez-perez, A., Suarez-figueroa, M. C., Villazon-terrazas, B., 2008. A Pattern based approach for re-engineering non-ontological resources into ontologies. In: The Semantic Web. No. 2. Springer Berlin Heidelberg, pp. 167–181. González-Beltrán, A., Maguire, E., Sansone, S.-A., Rocca-Serra, P., 2014. linkedISA: semantic representation of ISA-Tab experimental metadata. BMC bioinformatics 15 Suppl 1 (Suppl 14), S4 URL 〈http://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-15-S14-S4%5Cnhttp://www.biomedcentral.com/1471-2105/15/S14/S4〉. Han, L., Finin, T., Parr, C., Sachs, J., Joshi, A., 2008. RDF123: From Spreadsheets to RDF. In: The Semantic Web-ISWC 2008. Springer, Berlin, Heidelberg, pp. 451–466. Hermans, F., Pinzger, M., Deursen, A.V., 2010. Automatically Extracting Class Diagrams from Spreadsheets. In: 24th European Conference on Object-Oriented Programming (ECOOP), Lecture Notes in Computer Science, Springer-Verlag, Maribor, Slovenia, pp. 52–75. Hipfl, S., 2004. Using Layout Information for Spreadsheet Visualization. In: Proceedings of the European Spreadsheet Risks Interest Group 5th Annual Conference. Klagenfurt, Austria. Hodgson, R., Keller, P.J., Hodges, J., Spivak, J., 2014. QUDT – Quantities, Units, Dimensions and Data Types in OWL and XML; Version 1.1 URL 〈http://qudt.org/〉. Ibarra, 2013, Enhancing the potential value of environmental services in urban wetlands, Cities, 31, 438, 10.1016/j.cities.2012.08.002 Langegger, A., Wöß, W., 2009. XLWrap–querying and integrating arbitrary spreadsheets with SPARQL. International Semantic Web Conference. Springer, Berlin, Heidelberg. 〈http://dx.doi.org/10.1007/978-3-642-04930-9_23〉. Limaye, G., Sarawagi, S., Chakrabarti, S., 2010. Annotating and searching web tables using entities, types and relationships. In: Proceedings of the VLDB Endowment, vol. 3. pp. 1338–1347. URL 〈http://portal.acm.org/citation.cfm?id=1921005〉. Maguire, 2013, OntoMaton, Bioinformatics, 29, 525, 10.1093/bioinformatics/bts718 Malcolm, 2015, Energy and greenhouse gas analysis of northeast U.S. dairy cropping systems, Agricul. Ecosyst. Environ., 199, 407, 10.1016/j.agee.2014.10.007 McDonald, 2012, The Biological Observation Matrix (BIOM) format or, GigaScience, 1, 7, 10.1186/2047-217X-1-7 Meroño-Peñuela, A., Ashkpour, A., Rietveld, L., Hoekstra, R., Schlobach, S., 2013. Linked Humanities Data : The next frontier? A case-study in historical census data. In: The Semantic Web: Semantics and Big Data. Springer, Berlin, Heidelberg, pp. 645–649. Mittermeir, R., Clermont, M., 2002. Finding High-Level Structures in Spreadsheet Programs. In: Proceedings of the 9th Working Conference on Reverse Engineering, Richmond, VA, USA, pp. 221–232. Mulwad, V., Finin, T., Joshi, A., 2012. A Domain Independent Framework for Extracting Linked Semantic Data from Tables. In: Search Computing. Springer, Berlin, Heidelberg, pp. 16–33. Plevin, 2009, Modeling corn ethanol and climate, J. Ind. Ecol., 13, 495, 10.1111/j.1530-9290.2009.00138.x Rayner, 2006, A simple spreadsheet-based, MIAME-supportive format for microarray data, BMC Bioinform., 7, 489, 10.1186/1471-2105-7-489 Rijgersberg, 2011, How semantics can improve engineering processes, Adv. Eng. Inform., 25, 276, 10.1016/j.aei.2010.07.008 Rocha Bernardo, 2013, Extracting and semantically integrating implicit schemas from multiple spreadsheets of biology based on the recognition of their nature, J. Inf. Database Manage., 4, 104 Sansone, 2012, Toward interoperable bioscience data, Nat. Genet., 44, 121, 10.1038/ng.1054 Segal, 2008, Developing scientific software, IEEE Softw., 25, 18, 10.1109/MS.2008.85 Shu, 2015, A semantic approach to data translation, Knowl.-Based Syst., 75, 104, 10.1016/j.knosys.2014.11.023 Stoilos, G., Stamou, G., Kollias, S., 2005. A String Metric for Ontology Alignment. In: The Semantic WebISWC, 2005. pp. 624–637. Venetis, P., Halevy, A., Madhavan, J., 2011. Recovering semantics of tables on the web. In: Proceedings of the VLDB Endowment, vol. 4. pp. 528–538. URL 〈http://dl.acm.org/citation.cfm?id=2002939〉. Wolstencroft, K., Owen, S., Horridge, M., Krebs, O., Mueller, W., Snoep, J. L., du Preez, F., Goble, C., 2011. RightField: embedding ontology annotation in spreadsheets. Bioinformatics Oxford, England 27 (July (14)), 2021-2. URL 〈http://www.ncbi.nlm.nih.gov/pubmed/21622664〉.