Zhihao Huang

We don’t have enough information about this author to calculate their statistics. If you think this is an error let us know.
Learn More
As data grows exponentially within data centers, cluster deduplication storage systems face challenges in providing high throughput, high deduplication ratio and load balance. As the key technique, data routing algorithm has a strong impact on the deduplication ratio, throughput and load balance in cluster deduplication storage systems. In this paper, we(More)
The data warehouse system Hive has emerged as an important facility for supporting data computing and storage. In particular, RCFile is a tailor-made data placement structure implemented in Hive, which is designed for the data processing efficiency. In this paper, we propose several optimized schemes based on RCFile and introduce EStore, which is an(More)
Recently, erasure codes such as Reed-Solomon (RS) code and Cauchy Reed-Solomon (CRS) code have been widely used in distributed file system to reduce the large storage overhead incurred by replication scheme. Now, there is a new erasure code called Binary Reed-Solomon (BRS) code that can achieve better performance than that of RS code, CRS code and is(More)
  • 1