We consider the problem of organizing large scale earth science raster data to
efficiently handle queries for identifying regions whose parameters fall within
certain range values specified by the queries. This problem seems to be critical
to enabling basic data mining tasks such as determining associations between
physical phenomena and spatial factors, detecting changes and trends, and
content b... hiện toàn bộ
#Geoscience #Large-scale systems #Data mining #Information retrieval #Content based retrieval #Data structures #Computational modeling #Educational institutions #Organizing #Tree data structures
D. Shasha, J.T.L. Wang, Huiyuan Shan, Kaizhong Zhang
An unordered labeled tree is a tree in which each node has a string label and
the parent-child relationship is significant, but the order among siblings is
unimportant. This paper presents an approach to the nearest neighbor search
problem for these trees. Given a database D of unordered labeled trees and a
query tree Q, the goal is to find those trees in D that "approximately" contain
Q. Our appr... hiện toàn bộ
Since the last two decades, image database management has been practiced using
different image representation methods. In the literature, images are
represented using two paradigms: the metadata-based and the content-based
representations. Image retrieval using the metadata is done using the
traditional database operations. However, image retrieval by its low-level
features requires similarity-bas... hiện toàn bộ
A.C. Jones, J.S. Robinson, W.A. Gray, J.P. Giddy, N.J. Fiddian
In the GRAB (Grid and Biodiversity) project we are developing a prototype to
illustrate some aspects of the GRID's potential for collaborative biodiversity
research. A catalogue of life, two species information systems (SISs) and a
climate database are made available in a problem solving environment that
demonstrates how bioclimatic modelling can be performed by bringing together
such resources. W... hiện toàn bộ
Summary form only given. We deal with the uncertainty in spatial and
spatiotemporal databases. Due to lack of accurate measurements, or rapid changes
in time, spatial and spatiotemporal data are often uncertain. Our work presents
new abstract and discrete models for uncertain spatial and spatiotemporal
information. The models are based on the principle that one knows that the
uncertain object, reg... hiện toàn bộ
This paper presents an approach to answering queries over an ontology modelled
using a description logic. The ontology acts as a global schema, providing a
declarative description of the concepts of the domain, the instances of which
are stored in (potentially many) object-wrapped sources. Queries are expressed
using terms from the rich vocabulary of the ontology, and are translated into an
equiva... hiện toàn bộ
We investigate the problem of searching similar multiattribute time sequences.
Such sequences arise naturally in a number of medical, financial, video, weather
forecast, and stock market databases where more than one attribute is of
interest at a time instant. We first solve the simple case in which the distance
is defined as the Euclidean distance. Later we extend it to shift and scale
invariance... hiện toàn bộ
Scientific research relies as much on the dissemination and exchange of data
sets as on the publication of conclusions. Accurately tracking the lineage
(origin and subsequent processing history) of scientific data sets is thus
imperative for the complete documentation of scientific work. However, the lack
of a definitive data model for lineage, and the poor fit between current data
management tool... hiện toàn bộ
The performance of nearest neighbor (NN) queries degrades noticeably with
increasing dimensionality of the data due to reduced selectivity of
high-dimensional data and an increased number of seek operations during NN-query
execution. If the NN-radii were known in advance, the disk accesses could be
reordered such that seek operations are minimized. We therefore propose a new
way of estimating the ... hiện toàn bộ