- Yufei Tao, Ke Yi, Cheng Sheng, Panos Kalnis
- SIGMOD Conference
- 2009

Nearest neighbor (NN) search in high dimensional space is an important problem in many applications. Ideally, a practical solution (i) should be implementable in a relational database, and (ii) itsâ€¦ (More)

- Yufei Tao, Ke Yi, Cheng Sheng, Panos Kalnis
- ACM Trans. Database Syst.
- 2010

Nearest Neighbor (NN) search in high-dimensional space is an important problem in many applications. From the database perspective, a good solution needs to have two properties: (i) it can be easilyâ€¦ (More)

- Yufei Tao, Stavros Papadopoulos, Cheng Sheng, Kostas Stefanidis
- SIGMOD Conference
- 2011

This paper studies the nearest keyword (NK) problem on XML documents. In general, the dataset is a tree where each node is associated with one or more keywords. Given a node q and a keyword w, an NKâ€¦ (More)

- Yufei Tao, Cheng Sheng, Jian Pei
- SIGMOD Conference
- 2011

Given two vertices s, t in a graph, let P be the shortest path (SP) from <i>s</i> to <i>t</i>, and <i>P*</i> a subset of the vertices in <i>P</i>. <i>P*</i> is a <i>k</i>-skip shortest path fromâ€¦ (More)

- Yufei Tao, Cheng Sheng, Jianzhong Li
- SIGMOD Conference
- 2010

An <i>(edge) hidden graph</i> is a graph whose edges are not explicitly given. Detecting the presence of an edge requires expensive <i>edge-probing</i> queries. We consider the <i>k most connectedâ€¦ (More)

- Cheng Sheng, Nan Zhang, Yufei Tao, Xin Jin
- PVLDB
- 2012

A hidden database refers to a dataset that an organization makes accessible on the web by allowing users to issue queries through a search interface. In other words, data acquisition from such aâ€¦ (More)

- Cheng Sheng, Yufei Tao
- PODS
- 2011

We consider the <i>skyline problem</i> (a.k.a. the <i>maxima problem</i>), which has been extensively studied in the database community. The input is a set <i>P</i> of <i>d</i>-dimensional points. Aâ€¦ (More)

- Cheng Sheng, Yufei Tao
- PODS
- 2011

We consider the <i>orthogonal range aggregation</i> problem. The dataset <i>S</i> consists of <i>N</i> axis-parallel rectangles in R<sup>2</sup>, each of which is associated with an integerâ€¦ (More)

- Cheng Sheng, Yufei Tao
- ACM Trans. Database Syst.
- 2012

We consider the <i>skyline problem</i> (aka the <i>maxima problem</i>), which has been extensively studied in the database community. The input is a set <i>P</i> of <i>d</i>-dimensional points. Aâ€¦ (More)

- Cheng Sheng, Yufei Tao
- PODS
- 2012

In the <i>top-K range reporting</i> problem, the dataset contains <i>N</i> points in the real domain â„œ, each of which is associated with a real-valued <i>score</i>. Given an intervalâ€¦ (More)