Efficient management and analysis of large-scale genome-wide data with two R packages: bigstatsr and bigsnpr

@inproceedings{Priv2017EfficientMA,
  title={Efficient management and analysis of large-scale genome-wide data with two R packages: bigstatsr and bigsnpr},
  author={Florian Priv{\'e} and Hugues Aschard and Michael G. B. Blum},
  year={2017}
}
Genome-wide datasets produced for association studies have dramatically increased in size over the past few years, with modern datasets commonly including millions of variants measured in dozens of thousands of individuals. This increase in data size is a major challenge severely slowing down genomic analyses. Specialized software for every part of the analysis pipeline have been developed to handle large genomic data. However, combining all these software into a single data analysis pipeline… CONTINUE READING
12
Twitter Mentions