ArrayExpress update—an archive of microarray and high-throughput sequencing-based functional genomics experiments

  title={ArrayExpress update—an archive of microarray and high-throughput sequencing-based functional genomics experiments},
  author={Helen E. Parkinson and Ugis Sarkans and Nikolay Kolesnikov and Niran Abeygunawardena and Tony Burdett and Miroslaw Dylag and Ibrahim Emam and Anna Farne and Emma Hastings and Ele Holloway and Natalja Kurbatova and Margus Lukk and James Malone and Roby Mani and Ekaterina Pilicheva and Gabriella Rustici and Anjan Chandra Sharma and Eleanor Williams and Tomasz Adamusiak and Marco Brandizi and Nataliya Sklyar and Alvis Brazma},
  journal={Nucleic Acids Research},
  pages={D1002 - D1004}
The ArrayExpress Archive ( is one of the three international public repositories of functional genomics data supporting publications. It includes data generated by sequencing or array-based technologies. Data are submitted by users and imported directly from the NCBI Gene Expression Omnibus. The ArrayExpress Archive is closely integrated with the Gene Expression Atlas and the sequence databases at the European Bioinformatics Institute. Advanced queries provided… 

ArrayExpress update—trends in database growth and links to data analysis tools

The ArrayExpress Archive of Functional Genomics Data ( is one of three international functional genomics public data repositories, alongside the Gene Expression

NCBI GEO: archive for functional genomics data sets—update

The Gene Expression Omnibus is an international public repository for high-throughput microarray and next-generation sequence functional genomic data sets submitted by the research community and supports archiving of raw data, processed data and metadata which are indexed, cross-linked and searchable.

ArrayExpress update – from bulk to single-cell expression data

With an increasing number of studies that combine different assay modalities (multi-omics experiments), a new more general archival resource the BioStudies Database has been developed, which will eventually supersede ArrayExpress.

BioXpress: an integrated RNA-seq-derived gene expression database for pan-cancer analysis

BioXpress is a gene expression and cancer association database in which the expression levels are mapped to genes using RNA-seq data obtained from The Cancer Genome Atlas, International Cancer Genome

ProfileChaser: searching microarray repositories based on genome-wide patterns of differential expression

ProfileChaser is introduced, a web server that allows for querying the Gene Expression Omnibus based on genome-wide patterns of differential expression using a novel, content-based approach.

From ArrayExpress to BioStudies

With BioStudies now fully functional, it is able to further harmonize the archival data infrastructure at EMBL-EBI, and ArrayExpress is being migrated to BioStudies, and in future, all functional genomics data will be archived at BioStudies.

GEE: An Informatics Tool for Gene Expression Data Explore

Gene Expression data Explore is developed, the first powerful, flexible web and mobile search application for searching whole-genome epigenetic data and microarray data in public databases, such as GEO and ArrayExpress.

Leukemia Gene Atlas – A Public Platform for Integrative Exploration of Genome-Wide Molecular Data

The Leukemia Gene Atlas (LGA) is a public platform designed to support research and analysis of diverse genomic data published in the field of leukemia and provides extensive analysis and visualization tools for various types of molecular data.

arrayMap: A Reference Resource for Genomic Copy Number Imbalances in Human Malignancies

The arrayMap database provides a platform for meta-analysis and systems level data integration of high-resolution oncogenomic CNA data, which readily could be used for genomic feature mining, across a representative range of cancer entities.

Computer Microarray Database Bioinformatics usher Functional Genomics to unveil Biological Knowledge underlying Physiology

Low level analysis performed on leukemia dataset including normalization and outlier detection using dChip and R software identifies 604 significant genes with false discovery rate of 50 permutations from 12600 genes which provide the invaluable information that can pave the path for innovative opportunities for early diagnosis of malignancies.



ArrayExpress—a public repository for microarray gene expression data at the EBI

ArrayExpress is a public repository for microarray data that supports the MIAME (Minimum Informa-tion About a Microarray Experiment) requirements and stores well-annotated raw and normalized data. As

NCBI GEO: archive for high-throughput functional genomic data

The Gene Expression Omnibus at the National Center for Biotechnology Information (NCBI) is the largest public repository for high-throughput gene expression data and offers many tools and features that allow users to effectively explore, analyze and download expression data from both gene-centric and experiment-centric perspectives.

Gene Expression Atlas at the European Bioinformatics Institute

The Gene Expression Atlas ( is an added-value database providing information about gene expression in different cell types, organism parts, developmental stages, disease

Importing ArrayExpress datasets into R/Bioconductor

This work presents a tool that is suitable for both interactive and automated use for importing datasets from ArrayExpress into R/Bioconductor, one of the largest public repositories of microarray datasets.

Design and implementation of microarray gene expression markup language (MAGE-ML)

MAGE will help microarray data producers and users to exchange information by providing a common platform for data exchange, and MAGE-STK will make the adoption of MAGE easier.

Ensembl 2009

Major additions and improvements to Ensembl since the previous report include a major redesign of the website; generation of multiple genome alignments and ancestral sequences using the new Enredo-Pecan-Ortheus pipeline and development of the software infrastructure.

Petabyte-scale innovations at the European Nucleotide Archive

A new repository for next generation sequence data is presented, a brief summary of contents of the ENA is presented and details of major developments to submission pipelines, high-throughput rule-based validation infrastructure and data integration approaches are provided.

MAGETabulator, a suite of tools to support the microarray data format MAGE-TAB

A suite of tools to support MAGE-TAB generation and validation, conversion between existing formats for data exchange, visualization of the experiment designs encoded by MAGE -TAB documents and the mining of such documents for semantic content are presented.

A simple spreadsheet-based, MIAME-supportive format for microarray data: MAGE-TAB

A simple tab-delimited, spreadsheet-based format, MAGE-TAB, will enable laboratories without bioinformatics experience or support to manage, exchange and submit well-annotated microarray data in a standard format using a spreadsheet.

Modeling sample variables with an Experimental Factor Ontology

The application of reference ontologies to data is a key problem, and this work presents guidelines on how community ontologies can be presented in an application ontology in a data-driven way.