Learn More
Internet search engines have popularized the keyword-based search paradigm. While traditional database management systems offer powerful query languages, they do not allow keyword-based search. In this paper, we discuss DBXplorer, a system that enables keyword-based search in relational databases. DBXplorer has been implemented using a commercial relational(More)
Euclidean spanners are important data structures in geometric algorithm design, because they provide a means of approximating the complete Eu-clidean graph with only O(n) edges, so that the shortest path length between each pair of points is not more than a constant factor longer than the Euclidean distance between the points. In many applications of(More)
The Göttingen State and University Library provides access to digitized documents strictly for noncommercial educational, research and private purposes and makes no warranty with regard to their use for other purposes. Some of our collections are protected by copyright. Publication and/or broadcast in any form (including electronic) requires prior written(More)
We consider the problem of finding rules relating patterns in a time series to other patterns in that series, or patterns in one series to patterns in another series. A simple example is a rule such as "a period of low telephone call activity is usually followed by a sharp rise ill call vohune". Examples of rules relating two or more time series are "if the(More)
Matrix decomposition methods represent a data matrix as a product of two factor matrices: one containing basis vectors that represent meaningful concepts in the data, and another describing how the observed data can be expressed as combinations of the basis vectors. Decomposition methods have been studied extensively, but many methods return real-valued(More)
The problem of obtaining efficient answers to top-<i>k</i> queries has attracted a lot of research attention. Several algorithms and numerous variants of the top-<i>k</i> retrieval problem have been introduced in recent years. The general form of this problem requests the <i>k</i> highest ranked values from a relation, using monotone combining functions on(More)
A b s t r a c t. Chew and Dobkin et. al. have shown that the Delaunay triangulation and its variants are sparse approximations of the complete graph, in that the shortest distance between two sites within the triangulation is bounded by a constant multiple of their Euclidean separation. In this paper, we show that other classical triangulation algorithms,(More)
For the discovery of similar patterns in 1D time-series, it is very typical to perform a normalization of the data (for example a transformation so that the data follow a zero mean and unit standard deviation). Such transformations can reveal latent patterns and are very commonly used in datamining applications. However, when dealing with multidimensional(More)
Overview The goal of this paper is to rank the answers to a database query when many tuples are returned. The probabilistic IR model is extended to structured data to rank the results. The proposed solution is domain independent and makes use of workload statistics and correlation. • The first step is to divide the attributes in a table into specified (the(More)