Maintenance of top-k materialized viewsSpringer Science and Business Media LLC -  Tập 27  - Trang 95-137 - 2009
Eftychia Baikousi, Panos Vassiliadis
In this paper we present results on the problem of maintaining materialized top-k views and provide results in two directions. The first problem we tackle concerns the maintenance of top-k views in the presence of high deletion rates. We provide a principled method that complements the inefficiency of the state of the art independently of the statistical properties of the data and the characterist......  hiện toàn bộ
 In-memory parallelization of join queries over large ontological hierarchiesSpringer Science and Business Media LLC -  Tập 39  - Trang 545-582 - 2020
Dimitris Bilidas, Manolis Koubarakis
The Resource Description Framework (RDF) data model enables the construction of knowledge graphs over various domains, using ontologies in order to encode information about the domain, and simple statements in the form of subject-predicate-object triples for data representation, facilitating the interlinking and exchange of Web data. However, this simplicity comes with the cost of having to execut......  hiện toàn bộ
 Speeding up AutoTuning of the Memory Management Options in Data AnalyticsSpringer Science and Business Media LLC -  Tập 38  - Trang 841-863 - 2020
Mayuresh Kunjir
Many solutions used towards building autonomous (or, self-driving) data processing systems today are trying to leverage the “black box” algorithm of Bayesian Optimization (BO) both due to its wider applicability and the theoretical guarantees provided on the quality of results produced. The black-box approach, however, could be time and labor-intensive; or otherwise get stuck in a local minima. We......  hiện toàn bộ
 Multi-model query languages: taming the variety of big dataSpringer Science and Business Media LLC -    - 2024
Qingsong Guo, Chao Zhang, Shuxun Zhang, Jiaheng Lu
AbstractA critical issue in Big Data management is to address the variety of data–data are produced by disparate sources, presented in various formats, and hence inherently involves multiple data models. Multi-Model DataBases (MMDBs) have emerged as a promising approach for dealing with this task as they are capable of accommodating multi-model data in a single sys......  hiện toàn bộ
 In-network data acquisition and replication in mobile sensor networksSpringer Science and Business Media LLC -  Tập 29  - Trang 87-112 - 2010
Panayiotis Andreou, Demetrios Zeinalipour-Yazti, Panos K. Chrysanthis, George Samaras
This paper assumes a set of n mobile sensors that move in the Euclidean plane as a swarm. Our objectives are to explore a given geographic region by detecting and aggregating spatio-temporal events of interest and to store these events in the network until the user requests them. Such a setting finds applications in mobile environments where the user (i.e., the sink) is infrequently within communi......  hiện toàn bộ
 Data challenges of time domain astronomySpringer Science and Business Media LLC -  Tập 30  - Trang 371-384 - 2012
Matthew J. Graham, S. G. Djorgovski, Ashish Mahabal, Ciro Donalek, Andrew Drake, Giuseppe Longo
Astronomy has been at the forefront of the development of the techniques and methodologies of data intensive science for over a decade with large sky surveys and distributed efforts such as the Virtual Observatory. However, it faces a new data deluge with the next generation of synoptic sky surveys which are opening up the time domain for discovery and exploration. This brings both new scientific ......  hiện toàn bộ
 Algorithms and framework for computing 2-body statistics on GPUsSpringer Science and Business Media LLC -  Tập 37  - Trang 587-622 - 2018
Napath Pitaksirianan, Zhila Nouri Lewis, Yi-Cheng Tu
Various types of two-body statistics (2-BS) are regarded as essential components of low-level data analysis in scientific database systems. In relational algebraic terms, a 2-BS is essentially a Cartesian product between two datasets (or two instances of the same dataset) followed by a user-defined aggregate. The quadratic complexity of these computations hinders timely processing of data. Use of ......  hiện toàn bộ