Corpus ID: 236447791

Bam-readcount - rapid generation of basepair-resolution sequence metrics

  title={Bam-readcount - rapid generation of basepair-resolution sequence metrics},
  author={Ajay Khanna and David E. Larson and Sridhar Nonavinkere Srivatsan and Matthew Mosior and Travis E. Abbott and Susanna Kiwala and Timothy J. Ley and Eric J. Duncavage and Matthew J. Walter and Jason R. Walker and Obi L. Griffith and Malachi Griffith and Christopher A. Miller},
Summary: Bam-readcount is a utility for generating low-level information about sequencing data at specific nucleotide positions. Originally designed to help filter genomic mutation calls, the metrics it outputs are useful as input for variant detection tools and for resolving ambiguity between variant callers1,2. In addition, it has found broad applicability in diverse fields including tumor evolution, single-cell genomics, climate change ecology, and tracking community spread of SARS-CoV-2.3–6… Expand

Figures from this paper

ViralFlow: an automated workflow for SARS-CoV-2 genome assembly, lineage assignment, mutations and intrahost variants detection
This workflow automates all the steps involved in SARS-CoV-2 genomic analysis: data processing, genome assembly, PANGO lineage assignment, mutation analysis and the screening of intrahost variants. Expand


CONICS integrates scRNA-seq with DNA sequencing to map gene expression to tumor sub-clones
CONICS is a software tool for mapping gene expression from scRNA-seq to tumor clones and phylogenies, with routines enabling: the quantitation of copy-number alterations in sc RNA-seq, robust separation of neoplastic cells from tumor-infiltrating stroma, inter-clone differential- expression analysis and intra-clone co-expression analysis. Expand
Using VarScan 2 for Germline Variant Calling and Somatic Mutation Detection
The identification of small sequence variants remains a challenging but critical step in the analysis of next‐generation sequencing data. Our variant‐calling tool, VarScan 2, employs heuristic andExpand
HTSlib - C library for reading/writing high-throughput sequencing data
HTSlib is a software library for providing programmatic access to sequencing alignment and variant formats and has been incorporated into Perl, Python, Rust and R, significantly expanding the number of uses via other languages. Expand
NeoMutate: an ensemble machine learning framework for the prediction of somatic mutations in cancer
The NeoMutate framework, which incorporates 7 supervised machine learning algorithms to exploit the strengths of multiple variant callers, optimizes somatic variant detection rates, leading to a potentially improved variant selection framework for the diagnosis and treatment of cancer. Expand
Optimizing cancer genome sequencing and analysis.
This dataset, representing the most comprehensively sequenced tumor described to date, will serve as an invaluable community resource (dbGaP accession id phs000159) and reassess optimal sequencing strategies. Expand
appreci8: a pipeline for precise variant calling integrating 8 tools
Appreci8, an automatic variant calling pipeline for calling single nucleotide variants and short indels by combining and filtering the output of eight open‐source variant calling tools, based on a novel artifact‐ and polymorphism score, showed superior performance compared to any evaluated alternative approach. Expand
Genome Modeling System: A Knowledge Management Platform for Genomics
The GMS framework provides detailed tracking of samples and data coupled with reliable and repeatable analysis pipelines, allowing a large team to collaborate on data analysis, or an individual researcher to leverage the work of others effectively within its data management system. Expand
SiNVICT: ultra-sensitive detection of single nucleotide variants and indels in circulating tumour DNA
With increased sequencing and mapping accuracy, SiNVICT might be utilized in clinical settings, making it possible to track the progress of point mutations and indels that are associated with resistance to cancer therapies and provide patients personalized treatment. Expand
Genomic and epigenomic landscapes of adult de novo acute myeloid leukemia.
It is found that a complex interplay of genetic events contributes to AML pathogenesis in individual patients and the databases from this study are widely available to serve as a foundation for further investigations of AMl pathogenesis, classification, and risk stratification. Expand
A deep learning approach to automate refinement of somatic variant calling from cancer sequencing data
This model accurately recapitulated manual refinement labels for three independent testing sets and accurately predicted somatic variants confirmed by orthogonal validation sequencing data and reduces bias stemming from inter-reviewer variability. Expand