• Publications
  • Influence
LIRS: an efficient low inter-reference recency set replacement policy to improve buffer cache performance
TLDR
LIRS effectively addresses the limits of LRU by using recency to evaluate Inter-Reference Recency (IRR) for making a replacement decision, and significantly outperforms LRU, and outperforms other existing replacement algorithms in most cases. Expand
Hadoop-GIS: A High Performance Spatial Data Warehousing System over MapReduce
TLDR
Hadoop-GIS - a scalable and high performance spatial data warehousing system for running large scale spatial queries on Hadoop and integrated into Hive to support declarative spatial queries with an integrated architecture is presented. Expand
Understanding intrinsic characteristics and system implications of flash memory based solid state drives
TLDR
This study reveals several unanticipated aspects in the performance dynamics of SSD technology that must be addressed by system designers and data-intensive application users in order to effectively place it in the storage hierarchy. Expand
CAFTL: A Content-Aware Flash Translation Layer Enhancing the Lifespan of Flash Memory based Solid State Drives
TLDR
A Content-Aware Flash Translation Layer (CAFTL) is proposed to enhance the endurance of SSDs at the device level to reduce write traffic to flash memory by removing unnecessary duplicate writes and extend available free flash memory space by coalescing redundant data in SSDs. Expand
LDPC-in-SSD: making advanced error correction codes work effectively in solid state drives
TLDR
A strong ECC alternative can be used in NAND flash memory to retain its reliability to respond the continuous cost reduction, and its relatively small increase of response time delay is acceptable to mainstream application users, considering a huge gain in SSD capacity, its reliability, and the price reduction. Expand
Measurements, analysis, and modeling of BitTorrent-like systems
TLDR
An analysis of representative Bit-Torrent traffic provides several new findings regarding the limitations of BitTorrent systems: due to the exponentially decreasing peer arrival rate in reality, service availability in such systems becomes poor quickly, after which it is difficult for the file to be located and downloaded. Expand
Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems
TLDR
This paper has comprehensively evaluated several representative cache partitioning schemes with different optimization objectives, including performance, fairness, and quality of service (QoS) and provides new insights into dynamic behaviors and interaction effects. Expand
RCFile: A fast and space-efficient data placement structure in MapReduce-based warehouse systems
TLDR
This paper presents a big data placement structure called RCFile (Record Columnar File) and its implementation in the Hadoop system and shows the effectiveness of RCFile in satisfying the four requirements. Expand
Hystor: making the best use of solid state drives in high performance storage systems
TLDR
The system study shows that in a highly effective hybrid storage system, SSDs should play a major role as an independent storage where the best suitable data are adaptively and timely migrated in and retained, and it can also be effective to serve as a write-back buffer. Expand
A permutation-based page interleaving scheme to reduce row-buffer conflicts and exploit data locality
TLDR
It is shown that the permutation based scheme dramatically increases the hit rates on DRAM row-buffers and reduces memory stall time of the SPEC95 and TPC-C workloads. Expand
...
1
2
3
4
5
...