A comparison of adaptive radix trees and hash tables

@article{lvarez2015ACO,
  title={A comparison of adaptive radix trees and hash tables},
  author={V. {\'A}lvarez and Stefan Richter and Xiao Chen and J. Dittrich},
  journal={2015 IEEE 31st International Conference on Data Engineering},
  year={2015},
  pages={1227-1238}
}
With prices of main memory constantly decreasing, people nowadays are more interested in performing their computations in main memory, and leave high I/O costs of traditional disk-based systems out of the equation. [...] Key Result The authors of ART presented experiments that indicate that ART was clearly a better choice over other recent tree-based data structures like FAST and B+-trees. However, ART was not the first adaptive radix tree. To the best of our knowledge, the first was the Judy Array (Judy for…Expand
Efficient Processing of Range Queries in Main Memory
TLDR
A cache-optimized, updateable main-memory index structure, the cache-sensitive skip list, is proposed, which targets the execution of range queries on single database columns, and a novel, fast and space-efficient, main- memory based index structure is devised, the BB-Tree, which supports multidimensional range and point queries and provides a parallel search operator that leverages the multi-threading capabilities of modern CPUs. Expand
Cache-Sensitive Skip List: Efficient Range Queries on Modern CPUs
TLDR
This work presents Cache-Sensitive Skip Lists (CSSL) as a novel index structure that is optimized for range queries and exploits modern CPUs and CSSL is based on a cache-friendly data layout and traversal algorithm that minimizes cache misses, branch mispredictions, and allows to exploit SIMD instructions for search. Expand
Parallelizing Approximate Search on Adaptive Radix Trees
TLDR
This work uses the edit distance to compare two search keys in the tree and select appropriate values and proposes several variations of the CPU algorithm like fixed vs. dynamic memory layouts and pointer vs. pointer-less data structures. Expand
S3: A Scalable In-memory Skip-List Index for Key-Value Store
TLDR
Experiments show that S3 achieves a comparable performance to other new memory indexing schemes, and can replace current in-memory skip-list of LevelDB and RocksDB to support huge volume of data. Expand
START — Self-Tuning Adaptive Radix Tree
TLDR
This work introduces START, a self-tuning variant of ART that uses nodes spanning multiple keybytes that performs on average 85 % faster than a regular ART on a wide variety of read-only workloads and 45% faster for read-mostly workloads. Expand
A Six-dimensional Analysis of In-memory Aggregation
TLDR
The results show that the ideal approach in a given situation depends on the input and the workload, and sorting algorithms are faster in holistic aggregate queries, whereas hash tables perform better in distributive queries. Expand
Efficient indexing for big data in Hadoop MapReduce and main memory databases
TLDR
This study indicates that choosing the right hashing method and configuration can make an order of magnitude difference in insert and lookup performance, and identifies seven key factors that influence hashing performance, evaluate their impact, and discuss the implications on hashing in modern databases. Expand
Modern Multi-Dimensional In-Memory Index Structures
Simple ideas for building multi-dimensional indices only work for certain settings or are limited in their performance. For example, linearizing multiple dimensions onto a single-dimensional indexExpand
Parallel Index-based Stream Join on a Multicore CPU
TLDR
This paper introduces an index data structure, called the partitioned in-memory merge tree, to address the challenges that arise when indexing highly dynamic data, which are common in streaming settings, and proposes a low-cost and effective concurrency control mechanism to meet the demands of high-rate update queries. Expand
A Hybrid B+-tree as Solution for In-Memory Indexing on CPU-GPU Heterogeneous Computing Platforms
TLDR
A novel design for a B-tree based on the heterogeneous computing platform and the hybrid memory architecture found in GPUs is proposed, "HB+-tree," which targets high search throughput use cases and unique to the design is the joint and simultaneous use of computing and memory resources of CPU-GPU systems. Expand
...
1
2
3
...

References

SHOWING 1-10 OF 19 REFERENCES
The adaptive radix tree: ARTful indexing for main-memory databases
Main memory capacities have grown up to a point where most databases fit into RAM. For main-memory database systems, index structure performance is a critical bottleneck. Traditional in-memory dataExpand
Making B+- trees cache conscious in main memory
Previous research has shown that cache behavior is important for main memory index structures. Cache conscious index structures such as Cache Sensitive Search Trees (CSS-Trees) perform lookups muchExpand
FAST: fast architecture sensitive tree search on modern CPUs and GPUs
TLDR
FAST is an extremely fast architecture sensitive layout of the index tree logically organized to optimize for architecture features like page size, cache line size, and SIMD width of the underlying hardware, and achieves a 6X performance improvement over uncompressed index search for large keys on CPUs. Expand
Space Efficient Hash Tables with Worst Case Constant Access Time
TLDR
This is the first dictionary that has worst case constant access time and expected constant update time, works with (1 + ε)n space, and supports satellite information. Expand
Balanced Allocation and Dictionaries with Tightly Packed Constant Size Bins
TLDR
It is shown that e> (2/e)d−−1 is sufficient to guarantee that with high probability each ball can be put into one of the two bins assigned to it, without any bin overflowing. Expand
On risks of using cuckoo hashing with simple universal hash classes
TLDR
It is proved that the failure probability is high when cuckoo hashing is run with the multiplicative class or with the very common class of linear hash functions over a prime field, even if space 4n is provided. Expand
Cuckoo hashing
TLDR
A simple dictionary with worst case constant lookup time, equaling the theoretical performance of the classic dynamic perfect hashing scheme of Dietzfelbinger et al, and is competitive with the best known dictionaries having an average case (but no nontrivial worst case) guarantee on lookup time. Expand
A Reliable Randomized Algorithm for the Closest-Pair Problem
TLDR
In the course of solving the duplicate-grouping problem, a new universal class of hash functions of independent interest is described, and it is shown that both of the foregoing problems can be solved by randomized algorithms that useO(n) space and finish inO( n) time with probability tending to 1 asngrows to infinity. Expand
The art of computer programming: sorting and searching (volume 3)
Apparatus for supporting different nets for various sporting purposes including interengaging tubular rods which are arranged to interconnect and have ground engaging portions suitable to be usefulExpand
Some Open Questions Related to Cuckoo Hashing
The purpose of this brief note is to describe recent work in the area of cuckoo hashing, including a clear description of several open problems, with the hope of spurring further research.
...
1
2
...