A negotiation agent for distributed heterogeneous statistical databases

S. McClean1, R. Pairceir1, B. Scotney1, K. Greer1
1School of Information and Software Engineering, University of Ulster, UK

Tóm tắt

The World-Wide Web provides an ever-increasing source of diverse information. We focus on query agents, in particular the matching and negotiation agents that are responsible for pre-integration where the matching agent decomposes the query into sub-queries, and then searches metadata to find datasets that match the query fragments. In the case of heterogeneous data, the matching agent utilises a negotiation agent to find datasets that match the query fragments, provides mappings from the data to the query, and constructs the appropriate (sub-)query re-writing rules. Such matching is done by generalising the data and testing if the (sub) query is matchable to the generalised (meta) data: we call this g-matchable; if it is then we can construct an operator stack to transform the data to match the (sub) query. Such an approach provides a capability of automating the process of executing queries on heterogeneous statistical databases that are distributed over the Internet. The novelty lies in the provision of automated methods for statistical aggregates, where the heterogeneity essentially resides in the classification schemes of categorical data, including both heterogeneity of nomenclature and heterogeneity of granularity. In addition, our solution permits queries to be specified in a goal-driven query-by-example format. Rather than impose an a priori global standard, the user can query through a unified interface where integration is done at run-time.

Từ khóa

#Distributed databases #Intelligent agent #Aggregates #Statistics #Distributed computing #Internet #Testing #Runtime #Software agents #Artificial intelligence

Tài liệu tham khảo

heflin, 1999, Coping with Changing Ontologies in a Distributed Environment, Proceedings of the AAAI-99 Workshop on Ontology Management heflin, 2000, Dynamic Ontologies on the Web, Proceedings of the 17th National Conference on Artificial Intelligence (AAAI-00), 443 10.1109/4236.643942 10.1007/978-3-662-03678-5_12 hulliger, 1998, Linking of Classifications by Linear Mappings, Journal of Official Statistics, 14, 255 levy, 1996, Querying Heterogeneous Information Sources using Source Descriptions, Proc VL'96 2000, METANET mcclean, 0, Adding Context to the Retrieval of Aggregate Data from Distributed Databases via the Internet, International Journal of Information Systems mcclean, 2002, A Scalable Approach to Integrating Heterogeneous Aggregate Views of Distributed Databases, IEEE Transactions on Data and Knowledge Engineering forthcoming 10.1142/S0218213002000782 duschka, 1997, Query Optimisation using Local Completeness, Proc AAAI-97, 249 10.1145/170035.170118 10.1145/354756.354853 farquhar, 1996, The Onto lingua Server: a Tool for Collaborative Ontology Construction, Proc of KAW96 duschka, 1997, Recursive Plans for Information Gathering, Proc IJCAI-97, 778 10.1006/knac.1993.1008 10.1145/290593.290605 doan, 2000, Learning Source Description for Data Integration, Proc WebDB, 81 häder, 1999, The Intrinsic Problems of Structural Heterogeneity and an Approach to their Solution, The VLDB Journal, 8, 25, 10.1007/s007780050072 10.1007/978-3-7091-7557-6_67 musen, 1998, Modern Architectures for Intelligent Systems: Reusable Ontologies and Problem-Solving Methods, AMIA Annual Symposium, 46 noy, 2000, PROMPT: Algorithms and Tools for Automated Ontology Merger and Alignment, Proceedings of AAAI-00 2000 pongas, 2001, RAMON: Eurostat's Classification Server, Proc NTTS'2001 papageorgiou, 0, A Statistical Metadata Model for Simultaneous Manipulation of both Data and Metadata, International Journal of Intelligent Systems Forthcoming 10.1017/S0269888900007797 uschold, 2000, Creating, Integrating and Maintaining Local and Global Ontologies, Proc ECAI