Holistic twig joins: optimal XML pattern matching
- Nicolas Bruno, N. Koudas, D. Srivastava
- Computer ScienceACM SIGMOD Conference
- 3 June 2002
This paper proposes a novel holistic twig join algorithm, TwigStack, that uses a chain of linked stacks to compactly represent partial results to root-to-leaf query paths, which are then composed to obtain matches for the twig pattern.
Structural joins: a primitive for efficient XML query pattern matching
- S. Al-Khalifa, H. Jagadish, J. Patel, Yuqing Wu, N. Koudas, D. Srivastava
- Computer ScienceProceedings / International Conference on Data…
- 7 August 2002
It is shown that, in some cases, tree-merge algorithms can have performance comparable to stack-tree algorithms, in many cases they are considerably worse, and this behavior is explained by analytical results that demonstrate that, on sorted inputs, the stack- tree algorithms have worst-case I/O and CPU complexities linear in the sum of the sizes of inputs and output, while the tree-MERge algorithms do not have the same guarantee.
Approximate String Joins in a Database (Almost) for Free
- L. Gravano, Panagiotis G. Ipeirotis, H. Jagadish, N. Koudas, S. Muthukrishnan, D. Srivastava
- Computer ScienceVery Large Data Bases Conference
- 11 September 2001
This paper develops a technique for building approximate string join capabilities on top of commercial databases by exploiting facilities already available in them, and demonstrates experimentally the benefits of the technique over the direct use of UDFs.
TwitterMonitor: trend detection over the twitter stream
- M. Mathioudakis, N. Koudas
- Computer ScienceSIGMOD Conference
- 6 June 2010
TwitterMonitor, a system that performs trend detection over the Twitter stream and provides meaningful analytics that synthesize an accurate description of each topic on Twitter in real time, is presented.
Optimal Histograms with Quality Guarantees
- H. Jagadish, N. Koudas, S. Muthukrishnan, V. Poosala, K. Sevcik, Torsten Suel
- Computer ScienceVery Large Data Bases Conference
- 24 August 1998
Algorithms for computing optimal bucket boundaries in time proportional to the square of the number of distinct data values, for a broad class of optimality metrics and an enhancement to traditional histograms that allows us to provide quality guarantees on individual selectivity estimates are presented.
MRShare: Sharing Across Multiple Queries in MapReduce
- Tomasz Nykiel, Michalis Potamias, Chaitanya Mishra, G. Kollios, N. Koudas
- Computer ScienceProceedings of the VLDB Endowment
- 1 September 2010
A sharing framework tailored to MapReduce is proposed that transforms a batch of queries into a new batch that will be executed more efficiently, by merging jobs into groups and evaluating each group as a single query.
Monitoring k-nearest neighbor queries over moving objects
- Xiaohui Yu, K. Pu, N. Koudas
- Computer ScienceIEEE International Conference on Data Engineering
- 5 April 2005
This work proposes two efficient and scalable algorithms using grid indices based on indexing objects and queries for k-nearest neighbor queries over moving objects within a geographic area, and shows that these algorithms significantly outperform R-tree-based solutions.
Aggregate Query Answering on Anonymized Tables
- Qing Zhang, N. Koudas, D. Srivastava, Ting Yu
- Computer ScienceIEEE International Conference on Data Engineering
- 15 April 2007
A general framework of permutations-based anonymization to support accurate answering of aggregate queries is presented and it is shown that, for the same grouping, permutation-based techniques can always answer aggregate queries more accurately than generalization-based approaches.
PREFER: a system for the efficient execution of multi-parametric ranked queries
- Vagelis Hristidis, N. Koudas, Y. Papakonstantinou
- Computer ScienceACM SIGMOD Conference
- 1 May 2001
The results indicate that the proposed algorithms are superior in performance compared to other approaches, both in preprocessing (preparation of materialized views) as well as execution time.
Keyword proximity search in XML trees
- Vagelis Hristidis, N. Koudas, Y. Papakonstantinou, D. Srivastava
- Computer Science, EconomicsIEEE Transactions on Knowledge and Data…
- 1 April 2006
XML keyword, proximity queries are defined to return the (possibly heterogeneous) set of minimum connecting trees (MCTs) of the matches to the individual keywords in the query to efficiently execute keyword proximity queries on labeled trees (XML).
...
...