SparkR: Scaling R Programs with Spark

Abstract

R is a popular statistical programming language with a number of extensions that support data processing and machine learning tasks. However, interactive data analysis in R is usually limited as the R runtime is single threaded and can only process data sets that fit in a single machine's memory. We present SparkR, an R package that provides a frontend to… (More)
DOI: 10.1145/2882903.2903740
View Slides

Topics

Statistics

010020020162017
Citations per Year

Citation Velocity: 83

Averaging 83 citations per year over the last 2 years.

Learn more about how we calculate this metric in our FAQ.

Cite this paper

@inproceedings{Venkataraman2016SparkRSR, title={SparkR: Scaling R Programs with Spark}, author={Shivaram Venkataraman and Zongheng Yang and Davies Liu and Eric Liang and Hossein Falaki and Xiangrui Meng and Reynold Xin and Ali Ghodsi and Michael J. Franklin and Ion Stoica and Matei Zaharia}, booktitle={SIGMOD Conference}, year={2016} }