Knowledge based collection selection for distributed information retrieval

Information Processing & Management - Tập 54 - Trang 116-128 - 2018
Baoli Han1, Ling Chen1, Xiaoxue Tian1
1College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China

Tài liệu tham khảo

Aly, 2013, Taily: Shard selection using the tail of score distributions, 673 Arguello, 2008, Document representation and query expansion models for blog recommendation, 2008, 1 Baillie, 2011, A multi-collection latent topic model for federated search, Information Retrieval, 14, 390, 10.1007/s10791-010-9147-3 Bizer, 2009, DBpedia-A crystallization point for the web of data, Web Semantics: Science, Services and Agents on the World Wide Web, 7, 154, 10.1016/j.websem.2009.07.002 Blei, 2003, Latent Dirichlet allocation, Journal of Machine Learning Research, 3, 993 Bollacker, 2008, Freebase: A collaboratively created graph database for structuring human knowledge, 1247 Borgatti, 2005, Centrality and network flow, Social Networks, 27, 55, 10.1016/j.socnet.2004.11.008 Callan, 2002, Distributed information retrieval, 127 Callan, 2001, Query-based sampling of text databases, ACM Transactions on Information Systems, 19, 97, 10.1145/382979.383040 Callan, 1995, Searching distributed collections with inference networks, 21 Carpineto, 2012, A survey of automatic query expansion in information retrieval, ACM Computing Surveys, 44, 1, 10.1145/2071389.2071390 Chowdhury, 2010 Cilibrasi, 2007, The Google similarity distance, IEEE Transactions on Knowledge and Data Engineering, 19, 10.1109/TKDE.2007.48 Crestani, 2013, Distributed information retrieval and applications, 865 Dang, 2010, Query reformulation using anchor text, 41 D'Souza, 2004, Collection selection for managed distributed document databases, Information Processing and Management, 40, 527, 10.1016/S0306-4573(03)00008-6 Eiron, 2003, Analysis of anchor text for web search, 459 Francès, 2014, Improving the efficiency of multi-site web search engines, 3 Gabrilovich, 2007, Computing semantic relatedness using Wikipedia-based explicit semantic analysis, 1606 Gravano, 1995, Generalizing GlOSS to vector-space databases and broker hierarchies, 78 Guisado-Gámez, 2014, Massive query expansion by exploiting graph knowledge bases for image retrieval, 33 Hoffart, 2013, YAGO2: A spatially and temporally enhanced knowledge base from Wikipedia, Artificial Intelligence, 194, 28, 10.1016/j.artint.2012.06.001 Hulpus, 2013, Unsupervised graph-based topic labelling using dbpedia, 465 Kim, 2016, Load-balancing in distributed selective search, 905 Kulkarni, 2010, Document allocation policies for selective searching of distributed indexes, 449 Kulkarni, 2015, Selective search: Efficient and effective search of large textual collections, ACM Transactions on Information Systems, 33, 17, 10.1145/2738035 Kulkarni, 2012, Shard ranking and cutoff estimation for topically partitioned collections, 555 Mendes, 2011, DBpedia spotlight: Shedding light on the web of documents, 1 Mendoza, 2016, Reducing hardware hit by queries in web search engines, Information Processing and Management, 52, 1031, 10.1016/j.ipm.2016.04.008 Navigli, 2012, BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network, Artificial Intelligence, 193, 217, 10.1016/j.artint.2012.07.001 Ni, 2016, Semantic documents relatedness using concept graph representation, 635 Paltoglou, 2011, Modeling information sources as integrals for effective and efficient source selection, Information Processing and Management, 47, 18, 10.1016/j.ipm.2010.02.004 Schuhmacher, 2014, Knowledge-based graph document modeling, 543 Shokouhi, 2007, Central-rank-based collection selection in uncooperative distributed information retrieval, 160 Shokouhi, 2006, Capturing collection size for distributed non-cooperative retrieval, 316 Shokouhi, 2007, Using query logs to establish vocabularies in distributed information retrieval, Information Processing and Management, 43, 169, 10.1016/j.ipm.2006.04.003 Si, 2003, Relevant document distribution estimation method for resource selection, 298 Strohman, 2005, Indri: A language model-based search engine for complex queries, 2 Thomas, 2009, SUSHI: Scoring scaled samples for server selection, 419 Wauer, 2011, Integrating explicit semantic analysis for ontology-based resource selection, 519 Witten, 2008, An effective, low-cost measure of semantic relatedness obtained from Wikipedia links, 25 Xu, 1999, Cluster-based language models for distributed retrieval, 254 Yuwono, 1997, Server ranking for distributed text retrieval systems on the internet, 41