The Comprehensive R Archive Network

  title={The Comprehensive R Archive Network},
  author={Kurt Hornik},
  journal={Wiley Interdisciplinary Reviews: Computational Statistics},
  • K. Hornik
  • Published 1 July 2012
  • Computer Science
  • Wiley Interdisciplinary Reviews: Computational Statistics
The Comprehensive R Archive Network (CRAN) is a network of sites acting as the primary web service distributing R sources and binaries, extension packages, and documentation. We discuss this functionality in more detail, with particular emphasis on the CRAN package repository, and its underlying design and operation principles. WIREs Comput Stat 2012 doi: 10.1002/wics.1212 

Best of both worlds: relational databases and statistics

This paper integrates the analytics-optimized database MonetDB and the R environment for statistical computing in a non-obtrusive, transparent and compatible way.

Building Packages in R

This paper serves as a tutorial for individuals to create their own packages and release those packages on the Comprehensive R Archive Network (CRAN).

Modeling the Impact of R Packages Using Dependency and Contributor Networks

It is found that there are common network and package attributes that are important in determining both the number of downloads and citations of a package, including degree, closeness and betweenness centralities, as well as package attributes such as number of authors and number of commits.

Simulating Supercomputer Workload with Hpcwld Package for R

  • A. Rumyantsev
  • Computer Science
    2014 15th International Conference on Parallel and Distributed Computing, Applications and Technologies
  • 2014
We present the hpcwld package that provides R [1] function for the workload (unfinished work) evaluation of a stochastic model of a supercomputer based on a modified Kiefer -- Wolfowitz recursion.

Poster: A Topic Analysis of the R Programming Language

This work uses Latent Dirichlet Allocation to analyze R source code from 10,051 R packages to identify concepts that are highly specific to scientific and high performance computing applications.

BoscoR: Extending R from the desktop to the Grid

A framework to execute R functions on remote resources from the desktop using Bosco, which is capable of managing many concurrent tasks submitted to remote resources while providing feedback to the interactive R environment and has a flat learning curve for any R user.

Modeling the impact of Python and R packages using dependency and contributor networks

It is found that the more derivative a package is (the more dependencies it has), the less likely it is to have a high impact, and introducing network features to a baseline model using only package features improves the performance of the models.

Enabling Distributed Scientific Computing on the Campus

A remote submission and overlay computational framework called Bosco, a data management framework for managing and provisioning storage resources on the campus, and a policy language to manage the data distribution are created, a unified framework for campus computing.

RDAVIDWebService: a versatile R interface to DAVID

SUMMARY The RDAVIDWebService package provides a class-based interface from R programs/scripts to fully access/control the database for annotation, visualization and integrated discovery, without the

Computational Cost Estimation using Volunteer Computing in R

A decision system is proposed for a Volunteer Computing (VC) platform for R software that, through estimations, selects the most suitable execution site for a given R script.



Prospects and challenges in R package development

This paper states that established quality assurance systems and collaborative infrastructures typically face several challenges, some of which will be discussed in this paper.

Collaborative Software Development Using R-Forge

R-Forge provides a set of tools for source code management and various web-based features that aims to provide a platform for collaborative development of R packages, R-related software or further projects.

Bioconductor: open software development for computational biology and bioinformatics

Details of the aims and methods of Bioconductor, the collaborative creation of extensible software for computational biology and bioinformatics, and current challenges are described.

Watch Your Spelling!

The results of spell checking all Rd files in all CRAN packages and how these can be employed for building a domain-specific statistical dictionary are analyzed.

Team, Writing R Extensions

  • R Foundation for Statistical Computing, Vienna, Austria,
  • 2011

The R FAQ , 2011

  • R : A Language and Environment for Statistical Computing . R Foundation for Statistical Computing , Vienna , Austria ,
  • 2011