Reproducible experiments on Three-Dimensional Entity Resolution with JedAI

Information Systems - Tập 102 - Trang 101830 - 2021
George Mandilaras1, George Papadakis1, Luca Gagliardelli2, Giovanni Simonini2, Emmanouil Thanos3, George Giannakopoulos4, Sonia Bergamaschi2, Themis Palpanas5, Manolis Koubarakis1, Alicia Lara-Clares6, Antonio Fariña7
1National and Kapodistrian University of Athens, Greece
2University of Modena and Reggio Emilia, Italy
3KU Leuven, Belgium
4NCSR Demokritos, Greece
5University of Paris & French University Institute (IUF), France
6NLP & IR Research Group, Universidad Nacional de Educación a Distancia (UNED), Spain
7University of A Coruña, CITIC, Database Lab, Spain

Tài liệu tham khảo

Papadakis, 2021 Christen, 2012 Dong, 2015 Christophides, 2015 Elmagarmid, 2007, Duplicate record detection: A survey, IEEE Trans. Knowl. Data Eng., 19, 1, 10.1109/TKDE.2007.250581 Papadakis, 2020, Blocking and filtering techniques for entity resolution: A survey, ACM Comput. Surv., 53, 31:1 Christophides, 2020, An overview of end-to-end entity resolution for big data, ACM Comput. Surv., 53, 10.1145/3418896 Getoor, 2012, Entity resolution: Theory, practice & open challenges, Proc. VLDB Endow., 5, 2018, 10.14778/2367502.2367564 Stefanidis, 2014, Entity resolution in the web of data, 203 G. Papadakis, T. Palpanas, Web-scale, schema-agnostic, end-to-end entity resolution, in: The Web Conference (WWW), Lyon, France, 2018. Papadakis, 2020, Entity resolution: Past, present and yet-to-come, 647 Papadakis, 2020, Three-dimensional entity resolution with JedAI, Inf. Syst., 93, 10.1016/j.is.2020.101565 Papadakis, 2020 J. Euzenat, A. Ferrara, C. Meilicke, J. Pane, F. Scharffe, P. Shvaiko, H. Stuckenschmidt, O. Sváb-Zamazal, V. Svátek, C.T. dos Santos, Results of the Ontology Alignment Evaluation Initiative 2010, in: Proceedings of the 5th International Workshop on Ontology Matching (OM-2010), 2010. 2010 Köpcke, 2010, Evaluation of entity resolution approaches on real-world match problems, Proc. VLDB Endow., 3, 484, 10.14778/1920841.1920904 2010 Gokhale, 2014, Corleone: hands-off crowdsourcing for entity matching, 601 S. Das, A. Doan, G.C.P. Suganthan, C. Gokhale, P. Konda, Y. Govind, D. Paulsen, The Magellan Data Repository, https://sites.google.com/site/anhaidgroup/projects/data. Papadakis, 2011, Efficient entity resolution for large heterogeneous information spaces, 535 G. Papadakis, Blocking Framework, https://sourceforge.net/projects/erframework/. McCallum, 2000, Efficient clustering of high-dimensional data sets with application to reference matching, 169 Repeatability Datasets, https://hpi.de/naumann/projects/repeatability/datasets.html. U. Draisbach, F. Naumann, A comparison and generalization of blocking and windowing algorithms for duplicate detection, in: Proceedings of the International Workshop on Quality in Databases (QDB), 2009, pp. 51–56. Kenig, 2013, Mfiblocks: An effective blocking algorithm for entity resolution, Inf. Syst., 38, 908, 10.1016/j.is.2012.11.008 Konda, 2016, Magellan: Toward building entity matching management systems, Proc. VLDB Endow., 9, 1197, 10.14778/2994509.2994535 Mudgal, 2018, Deep learning for entity matching: A design space exploration, 19