Learn More
UNLABELLED Microarray technology has become a standard molecular biology tool. Experimental data have been generated on a huge number of organisms, tissue types, treatment conditions and disease states. The Gene Expression Omnibus (Barrett et al., 2005), developed by the National Center for Bioinformatics (NCBI) at the National Institutes of Health is a(More)
Enhancers regulate spatiotemporal gene expression and impart cell-specific transcriptional outputs that drive cell identity. Super-enhancers (SEs), also known as stretch-enhancers, are a subset of enhancers especially important for genes associated with cell identity and genetic risk of disease. CD4(+) T cells are critical for host defence and autoimmunity.(More)
Sequence polymorphisms linked to human diseases and phenotypes in genome-wide association studies often affect noncoding regions. A SNP within an intron of the gene encoding Interferon Regulatory Factor 4 (IRF4), a transcription factor with no known role in melanocyte biology, is strongly associated with sensitivity of skin to sun exposure, freckles, blue(More)
Circos is a Perl language based software package for visualizing similarities and differences of genome structure and positional relationships between genomic intervals. Running Circos requires extra data processing procedures to prepare plot data files and configure files from datasets, which limits its capability of integrating directly with other(More)
UNLABELLED The NCBI Gene Expression Omnibus (GEO) represents the largest public repository of microarray data. However, finding data in GEO can be challenging. We have developed GEOmetadb in an attempt to make querying the GEO metadata both easier and more powerful. All GEO metadata records as well as the relationships between them are parsed and stored in(More)
The Sequence Read Archive (SRA) is the largest public repository of sequencing data from the next generation of sequencing platforms including Illumina (Genome Analyzer, HiSeq, MiSeq, .etc), Roche 454 GS System, Applied Biosystems SOLiD System, Helicos Heliscope, PacBio RS, and others. SRAdb is an attempt to make queries of the metadata associated with SRA(More)
To understand the genetic mechanisms driving variant and IGHV4-34-expressing hairy-cell leukemias, we performed whole-exome sequencing of leukemia samples from ten affected individuals, including six with matched normal samples. We identified activating mutations in the MAP2K1 gene (encoding MEK1) in 5 of these 10 samples and in 10 of 21 samples in a(More)
Exome sequencing provides unprecedented insights into cancer biology and pharmacological response. Here we assess these two parameters for the NCI-60, which is among the richest genomic and pharmacological publicly available cancer cell line databases. Homozygous genetic variants that putatively affect protein function were identified in 1,199 genes(More)
Like other retroviruses, human immunodeficiency virus type 1 (HIV-1) selectively packages genomic RNA (gRNA) during virus assembly. However, in the absence of the gRNA, cellular messenger RNAs (mRNAs) are packaged. While the gRNA is selected because of its cis-acting packaging signal, the mechanism of this selection is not understood. The affinity of Gag(More)