• Publications
  • Influence
Monitoring k-nearest neighbor queries over moving objects
TLDR
We propose two methods to monitork-NN queries over moving objects. Expand
  • 342
  • 28
  • PDF
LSH Ensemble: Internet-Scale Domain Search
TLDR
We present a new index structure, Locality Sensitive Hashing (LSH) Ensemble, that solves the domain search problem using set containment at Internet scale. Expand
  • 50
  • 10
  • PDF
Keyword query cleaning
TLDR
We introduce the problem of query cleaning for keyword search queries in a database context and propose a set of effective and efficient solutions. Expand
  • 63
  • 5
  • PDF
Table Union Search on Open Data
TLDR
We define the table union search problem and present a probabilistic solution for finding tables that are unionable with a query table within massive repositories. Expand
  • 49
  • 5
  • PDF
Discovering Linkage Points over Web Data
TLDR
We present a framework consisting of a library of efficient lexical analyzers and similarity functions, and a set of search algorithms for effective and efficient identification of linkage points over Web data. Expand
  • 34
  • 5
  • PDF
Concise descriptions of subsets of structured sets
TLDR
We study the problem of economical representation of subsets of structured sets, that is, sets equipped with a set cover. Expand
  • 15
  • 3
  • PDF
Modeling and control of discrete-event systems with hierarchical abstraction
  • K. Pu
  • Computer Science
  • 2000
TLDR
We show that the high-level supervisor of Figure 2.19 (b) is in fact not controllable, and must be adjusted. Expand
  • 20
  • 2
Making Open Data Transparent: Data Discovery on Open Data
TLDR
We present new table join and table union search solutions that provide interactive search speed even over massive collections of millions of attributes with heavily skewed cardinality distributions. Expand
  • 12
  • 2
  • PDF
Scalable Distributed Processing of K Nearest Neighbor Queries over Moving Objects
TLDR
We propose a new index structure called Dynamic Strip Index (DSI), which can better adapt to different data distributions than exiting grid indexes. Expand
  • 50
  • 1
  • PDF
Data Lake Management: Challenges and Opportunities
TLDR
We consider how data lakes are introducing new problems including dataset discovery and how they are changing the requirements for classic problems including data extraction, data cleaning, data integration, data versioning, and metadata management. Expand
  • 29
  • 1
  • PDF
...
1
2
3
4
5
...