Optimizing database architecture for the new bottleneck: memory access

@article{Manegold2000OptimizingDA,
  title={Optimizing database architecture for the new bottleneck: memory access},
  author={Stefan Manegold and Peter A. Boncz and Martin L. Kersten},
  journal={The VLDB Journal},
  year={2000},
  volume={9},
  pages={231-246}
}
Abstract. In the past decade, advances in the speed of commodity CPUs have far out-paced advances in memory latency. Main-memory access is therefore increasingly a performance bottleneck for many computer applications, including database systems. In this article, we use a simple scan test to show the severe impact of this bottleneck. The insights gained are translated into guidelines for database architecture, in terms of both data structures and algorithms. We discuss how vertically fragmented… Expand
Efficient Processing of Range Queries in Main Memory
TLDR
A cache-optimized, updateable main-memory index structure, the cache-sensitive skip list, is proposed, which targets the execution of range queries on single database columns, and a novel, fast and space-efficient, main- memory based index structure is devised, the BB-Tree, which supports multidimensional range and point queries and provides a parallel search operator that leverages the multi-threading capabilities of modern CPUs. Expand
Analytical Query Execution Optimized for all Layers of Modern Hardware
TLDR
This thesis focuses on the design and implementation of highly efficient database systems by optimizing analytical query execution for all layers of modern hardware, and introduces advanced SIMD vectorization techniques generalizable across multiple operators. Expand
RC-NVM: Enabling Symmetric Row and Column Memory Accesses for In-memory Databases
  • Peng Wang, Shuo Li, +6 authors Tao Zhang
  • Computer Science
  • 2018 IEEE International Symposium on High Performance Computer Architecture (HPCA)
  • 2018
TLDR
A dual-addressable memory architecture based on non-volatile memory, called RC-NVM, to support both row-oriented and column-oriented accesses is proposed and a group caching technique that combines the IMDB knowledge with the memory architecture to further optimize the system is proposed. Expand
Accelerators for Data Processing
TLDR
This thesis provides a dynamic software acceleration scheme for exploiting inter-lookup parallelism to hide the memory access latency despite the irregularities across lookups, and proposes a programmable hardware accelerator to maximize the efficiency of the data structure lookups. Expand
Generic Database Cost Models for Hierarchical Memory Systems
TLDR
A generic technique to create accurate cost functions for database operations and provides insight to tune algorithms not only in a main-memory DBMS, but also in a disk-based DBMS with a large main- memory buffer cache. Expand
Performance Characterization of Modern Databases on Out-of-Order CPUs
TLDR
It is observed that performance of modern databases is severely limited by poor cache/memory performance, and it is demonstrated that dynamic execution techniques are still effective in hiding a significant fraction of the stalls, thereby improving performance. Expand
Cache-Aware Spatial Indices on Chip Multi-Processors: Limitations and Opportunities
  • Minhui Lv, Wei Xiong
  • Computer Science
  • 2016 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC)
  • 2016
TLDR
This paper evaluates the performance of typical spatial indices on commodity chip multiprocessor using an analytical model incorporating memory hierarchical access cost and derives a list of advises for future spatial indices design to reach most performance on chip multip rocessor. Expand
Rethinking SIMD Vectorization for In-Memory Databases
TLDR
This paper presents novel vectorized designs and implementations of database operators, based on advanced SIMD operations, such as gathers and scatters, and highlights the impact of efficient vectorization on the algorithmic design of in-memorydatabase operators, as well as the architectural design and power efficiency of hardware. Expand
Compiling Database Queries into Machine Code
TLDR
This paper shows how queries can be brought into a form suitable for efficient translation, and how the underlying code generation can be orchestrated, by carefully abstracting away the necessary plumbing infrastructure to build a query compiler that is both maintainable and efficient. Expand
O2-tree: a shared memory resident index in multicore architectures
TLDR
Analysis and comparative experimental study show that the performance of the O2-Tree is superior to other tree-based index structures with respect to various query operations for large datasets and outperforms popular key-value stores such as BerkelyDB and TreeDB of Kyoto Cabinet for various workloads. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 28 REFERENCES
Database Architecture Optimized for the New Bottleneck: Memory Access
TLDR
A simple scan test is used to show the severe impact of main-memory access bottleneck, and radix algorithms for partitioned hash-join are introduced, using a detailed analytical model that incorporates memory access cost. Expand
Optimizing Main-Memory Join on Modern Hardware
TLDR
The partitioned hash-join is refined with a new partitioning algorithm called radix-cluster, which is specifically designed to optimize memory access, and the effect of implementation techniques that optimize CPU resource usage is investigated. Expand
What Happens During a Join? Dissecting CPU and Memory Optimization Effects
TLDR
This work presents a calibration tool that automatically extracts the relevant parameters about the memory subsystem from any hardware and demonstrates how a database system equipped with this calibrator can automatically tune memory-conscious database algorithms to their optimal settings. Expand
Cache Conscious Algorithms for Relational Query Processing
TLDR
It is shown that there are significant benefits in redesigning traditional query processing algorithms so that they can make better use of the cache, and new algorithms run 8%-200% faster than the traditional ones. Expand
Tolerating latency through software-controlled data prefetching
TLDR
This dissertation proposes and evaluates a new compiler algorithm for inserting prefetches into code that attempts to minimize overheads by only issuing prefetched for references that are predicted to suffer cache misses, and investigates the architectural support necessary to make prefetching effective. Expand
Query optimization in a memory-resident domain relational calculus database system
TLDR
This paper addresses aspects of query optimization in memory-resident database systems and presents practical solutions to them and presents results of performance measurements, which prove to be excellent in the current state of the art. Expand
A Study of Index Structures for a Main Memory Database Management System
TLDR
This paper proposes a new index structure, the T Tree, and it is compared to existing index structures in a main memory database environment, and results indicate that the T tree provides good overall performance in main memory. Expand
Monet: An Impressionist Sketch of an Advanced Database System
TLDR
This paper gives the goals and motivation of Monet, and outline its architectural features, including its use of the Decomposed Storage Model (DSM), emphasis on bulk operations, use of main virtual-memory and server customization, and some issues on how to build a GIS on top of Monets. Expand
AlphaSort: a RISC machine sort
TLDR
A new sort algorithm, called AlphaSort, demonstrates that commodity processors and disks can handle commercial batch workloads and proposes two new benchmarks: Minutesort: how much can you sort in a minute, and DollarSort: how to sort for a dollar. Expand
Flattening an object algebra to provide performance
TLDR
It is shown how flattening enabled us to implement a query algebra, using only a very limited set of simple operations, and was evaluated on the 1-GByte TPC-D (Transaction-processing Performance Council's Benchmark D), showing that the divide-and-conquer approach yields excellent results. Expand
...
1
2
3
...