• Publications
  • Influence
BigDataBench: A big data benchmark suite from internet services
TLDR
We choose 19 big data benchmarks from dimensions of application scenarios, operations/ algorithms, data types, data sources, and application types, and they are comprehensive for fairly measuring and evaluating big data systems and architecture. Expand
  • 489
  • 50
  • PDF
RCFile: A fast and space-efficient data placement structure in MapReduce-based warehouse systems
  • Y. He, R. Lee, +4 authors Z. Xu
  • Computer Science
  • IEEE 27th International Conference on Data…
  • 11 April 2011
TLDR
We present a big data placement structure called RCFile (Record Columnar File) and its implementation in the Hadoop system. Expand
  • 262
  • 25
  • PDF
YSmart: Yet Another SQL-to-MapReduce Translator
TLDR
We propose and develop a system called Y Smart, a correlation aware SQL-to-MapReduce translator. Expand
  • 153
  • 21
  • PDF
A Dynamic MapReduce Scheduler for Heterogeneous Workloads
TLDR
We give a new view of the MapReduce model and classify the workloads into three categories based on their CPU and I/O utilization. Expand
  • 177
  • 4
  • PDF
BigDataBench: a Big Data Benchmark Suite from Web Search Engines
TLDR
This paper presents our joint research efforts on big data benchmarking with several industrial partners. Expand
  • 57
  • 4
  • PDF
RWSNet: a semantic segmentation network based on SegNet combined with random walk for remote sensing
TLDR
We propose Random-Walk-SegNet, a semantic segmentation network based on SegNet combined with random walk, which achieves high-performance segmentation of remote sensing images. Expand
  • 4
Four styles of parallel and net programming
TLDR
This paper reviews the programming landscape for parallel and network computing systems, focusing on four styles of concurrent programming models, and example languages/libraries. Expand
  • 4
  • PDF
A Switch Criterion for Hybrid Datasets Merging on Top of Map Reduce
TLDR
This paper proposes a novel hybrid datasets merging algorithm on top of Map Reduce, HDMA. Expand
  • 3
Application of Data Mining on Students' Quality Evaluation
  • Y. He, Shunli Zhang
  • Computer Science
  • 3rd International Workshop on Intelligent Systems…
  • 28 May 2011
TLDR
A decision support system for students'comprehensive evaluation based on data mining. Expand
  • 8
...
1
2
3
4
...