PCS: Predictive Component-Level Scheduling for Reducing Tail Latency in Cloud Online Services

@article{Han2015PCSPC,
  title={PCS: Predictive Component-Level Scheduling for Reducing Tail Latency in Cloud Online Services},
  author={Rui Han and Junwei Wang and Siguang Huang and Chenrong Shao and Shulin Zhan and Jianfeng Zhan and Jos{\'e} Luis V{\'a}zquez-Poletti},
  journal={2015 44th International Conference on Parallel Processing},
  year={2015},
  pages={490-499}
}
Modern latency-critical online services often rely on composing results from a large number of server components. Hence the tail latency (e.g. The 99th percentile of response time), rather than the average, of these components determines the overall service performance. When hosted on a cloud environment, the components of a service typically co-locate with short batch jobs to increase machine utilizations, and share and contend resources such as caches and I/O bandwidths with them. The highly… CONTINUE READING

References

Publications referenced by this paper.
Showing 1-10 of 23 references

McKee . Characterizing and subsetting big data workloads

  • Jianfeng Zhan
  • 2014

Similar Papers

Loading similar papers…