Bambino: a variant detector and alignment viewer for next-generation sequencing data in the SAM/BAM format

  title={Bambino: a variant detector and alignment viewer for next-generation sequencing data in the SAM/BAM format},
  author={Michael N. Edmonson and Jinghui Zhang and Chunhua Yan and Richard P. Finney and Daoud M. Meerzaman and Kenneth H. Buetow},
  volume={27 6},
SUMMARY Bambino is a variant detector and graphical alignment viewer for next-generation sequencing data in the SAM/BAM format, which is capable of pooling data from multiple source files. The variant detector takes advantage of SAM-specific annotations, and produces detailed output suitable for genotyping and identification of somatic mutations. The assembly viewer can display reads in the context of either a user-provided or automatically generated reference sequence, retrieve genome… 

Tables from this paper

svviz: a read viewer for validating structural variants

Visualizing read alignments is the most effective way to validate candidate SVs with existing data. We present svviz, a sequencing read visualizer for structural variants (SVs) that sorts and

SNP calling using genotype model selection on high-throughput sequencing data

A novel method of consensus and SNP calling, Genotype Model Selection (GeMS), is given which accounts for the errors that occur during the preparation of the genomic sample.

PyBamView: a browser-based application for viewing short read alignments

  • M. Gymrek
  • Computer Science, Biology
  • 2014
PyBamView, a lightweight Web application for visualizing short read alignments, provides an easy-to-use Web interface for viewing alignments across multiple samples, with a focus on accurate visualization of insertions.

DIVIS: Integrated and Customizable Pipeline for Cancer Genome Sequencing Analysis and Interpretation

DIVIS is a customizable pipeline based on GPyFlow that integrates read preprocessing, alignment, variant detection, and annotation of whole-genome sequencing, whole-exome sequencing and gene-panel sequencing and substantially facilitates complex cancer genome sequencing analyses.

A Bioinformatics Procedure to Identify and Annotate Somatic Mutations in Whole-Exome Sequencing Data

This work proposes a computational procedure to manage large scale sequencing data in order to detect exonic somatic mutations in a tumor sample and proposes several steps based on open-source softwares and R language: alignment, detection of mutations, annotation, functional classification and visualization of results.

VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data

VESPA is an interactive visual analytics tool that integrates high-throughput data into a genomic context to facilitate the discovery of structural mis-annotations in prokaryotic genomes.

MiST: A new approach to variant detection in deep sequencing datasets

Compared with variant calls from the GATK platform, MiST showed better concordance with SNPs from dbSNP and genotypes determined by an exonic-SNP array, and is a valuable alternative tool to analyse variants in deep sequencing data.

Bioinformatics Basics for High-Throughput Hybridization-Based Targeted DNA Sequencing from FFPE-Derived Tumor Specimens: From Reads to Variants.

This chapter reviews common tools used to generate reads from Illumina-derived sequence data, align reads, and call variants from hybridization-based targeted NGS panel data generated from tumor FFPE-derived DNA specimens as well as basic quality metrics to assess for each assayed specimen.

Customisation of the Exome Data Analysis Pipeline Using a Combinatorial Approach

Different freely available tools used at the alignment and post alignment stage are sampled suggesting the use of the most suitable combination determined by a simple framework of pre-existing metrics to create significant datasets.

VCF2CNA: A tool for efficiently detecting copy-number alterations in VCF genotype data and tumor purity

VCF2CNA is an accurate, efficient and platform-independent tool for CNA and tumor purity analyses without accessing raw sequence data and provides accurate tumor purity estimates for samples with sufficient CNAs.



The Sequence Alignment/Map format and SAMtools

Summary: The Sequence Alignment/Map (SAM) format is a generic alignment format for storing read alignments against reference sequences, supporting short and long reads (up to 128 Mbp) produced by

The UCSC Genome Browser database: update 2011

New data highlights include seven new genome assemblies, a Neandertal genome data portal, phenotype and disease association data, a human RNA editing track, and a zebrafish Conservation track.

MagicViewer: integrated solution for next-generation sequencing data visualization and genetic variation detection and annotation

MagicViewer is a sophisticated assembly visualization and genetic variation annotation tool for next-generation sequencing data, which can be widely used in a variety of sequencing-based researches, including genome re-sequencing and transcriptome studies.

The UCSC Genome Browser database: update 2010

The University of California, Santa Cruz (UCSC) Genome Browser website ( provides a large database of publicly available sequence and annotation data along with an integrated

Consed: a graphical tool for sequence finishing.

A finishing tool, consed, which attempts to implement principles of shotgun sequencing by using error probabilities from phred and phrap as an objective criterion to guide the entire finishing process.

VarScan: variant detection in massively parallel sequencing of individual and pooled samples

VarScan is presented, an open source tool for variant detection that is compatible with several short read aligners that demonstrates its ability to detect SNPs and indels with high sensitivity and specificity, in both Roche/454 sequencing of individuals and deep Illumina/Solexa sequencing of pooled samples.

Reliable identification of large numbers of candidate SNPs from public EST data

The SNPpipeline, a polymorphism detection system that uses public-domain sequence data, has identified more than 3,000 candidate single-nucleotide polymorphisms (SNPs) and suggests that existing sequence resources may serve as a valuable source for identifying genetic variation.

Base-calling of automated sequencer traces using phred. I. Accuracy assessment.

The availability of massive amounts of DNA sequence information has begun to revolutionize the practice of biology. As a result, current large-scale sequencing output, while impressive, is not

The UCSC Genome Browser Database: update 2006

The University of California Santa Cruz Genome Browser Database (GBD) contains sequence and annotation data for the genomes of about a dozen vertebrate species and several major model organisms to support fast interactive performance with web tools that provide powerful visualization and querying capabilities for mining the data.

Fast and accurate short read alignment with Burrows–Wheeler transform

Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.