Ontological databases with faceted queriesThe VLDB Journal - Tập 32 - Trang 103-121 - 2022
Tadeusz Pankowski
The success of the use of ontology-based systems depends on efficient and user-friendly methods of formulating queries against the ontology. We propose a method to query a class of ontologies, called facet ontologies (fac-ontologies), using a faceted human-oriented approach. A fac-ontology has two important features: (a) a hierarchical view of it can be defined as a nested facet over this ontology...... hiện toàn bộ
Stochastic gradient descent without full data shuffle: with applications to in-database machine learning and deep learning systemsThe VLDB Journal -
Lijie Xu, Shuang Qiu, Binhang Yuan, Jiawei Jiang, Cédric Renggli, Shaoduo Gan, Kaan Kara, Guoliang Li, Ji Liu, Wentao Wu, Jieping Ye, Ce Zhang
AbstractModern machine learning (ML) systems commonly use stochastic gradient descent (SGD) to train ML models. However, SGD relies on random data order to converge, which usually requires a full data shuffle. For in-DB ML systems and deep learning systems with large datasets stored on block-addressable secondary storage such as HDD and S...... hiện toàn bộ
Space efficiency in group recommendationThe VLDB Journal - Tập 19 - Trang 877-900 - 2010
Senjuti Basu Roy, Sihem Amer-Yahia, Ashish Chawla, Gautam Das, Cong Yu
Imagine a system that gives you satisfying recommendations when you want to rent a movie with friends or find a restaurant to celebrate a colleague’s farewell: at the core of such a system is what we call group recommendation. While computing individual recommendations have received lots of attention (e.g., Netflix prize), group recommendation has been confined to studying users’ satisfaction with...... hiện toàn bộ
SW-Store: a vertically partitioned DBMS for Semantic Web data managementThe VLDB Journal - Tập 18 - Trang 385-406 - 2009
Daniel J. Abadi, Adam Marcus, Samuel R. Madden, Kate Hollenbach
Efficient management of RDF data is an important prerequisite for realizing the Semantic Web vision. Performance and scalability issues are becoming increasingly pressing as Semantic Web technology is applied to real-world applications. In this paper, we examine the reasons why current data management solutions for RDF data scale poorly, and explore the fundamental scalability limitations of these...... hiện toàn bộ
SQL extension for spatio-temporal dataThe VLDB Journal - - 2006
Jose R. Rios Viqueira, Nikos A. Lorentzos
An SQL extension is formalized for the management of spatio-temporal data, i.e. of spatial data that evolves with respect to time. The extension is dedicated to applications such as topography, cartography, and cadastral systems, hence it considers discrete changes both in space and in time. It is based on the rigid formalization of data types and of SQL constructs. Data types are defined in terms...... hiện toàn bộ
IntroductionThe VLDB Journal - Tập 7 Số 4 - Trang 205-205 - 1998
M. Tamer Özsu, Stavros Christodoulakis
PSoup: a system for streaming queries over streaming dataThe VLDB Journal - Tập 12 - Trang 140-156 - 2003
Sirish Chandrasekaran, Michael J. Franklin
Recent work on querying data streams has focused on systems where newly arriving data is processed and continuously streamed to the user in real time. In many emerging applications, however, ad hoc queries and/or intermittent connectivity also require the processing of data that arrives prior to query submission or during a period of disconnection. For such applications, we have developed PSoup, a...... hiện toàn bộ
A unified framework for string similarity search with edit-distance constraintThe VLDB Journal - Tập 26 - Trang 249-274 - 2016
Minghe Yu, Jin Wang, Guoliang Li, Yong Zhang, Dong Deng, Jianhua Feng
String similarity search is a fundamental operation in data cleaning and integration. It has two variants: threshold-based string similarity search and top-
$$k$$
string similarity search. Existing algorithms are efficient for either the former or...... hiện toàn bộ