ArrayWiki: an enabling technology for sharing public microarray data repositories and meta-analyses

@article{Stokes2008ArrayWikiAE,
  title={ArrayWiki: an enabling technology for sharing public microarray data repositories and meta-analyses},
  author={Todd H. Stokes and J. T. Torrance and Henry Li and May D. Wang},
  journal={BMC Bioinformatics},
  year={2008},
  volume={9},
  pages={S18 - S18}
}
BackgroundA survey of microarray databases reveals that most of the repository contents and data models are heterogeneous (i.e., data obtained from different chip manufacturers), and that the repositories provide only basic biological keywords linking to PubMed. As a result, it is difficult to find datasets using research context or analysis parameters information beyond a few keywords. For example, to reduce the "curse-of-dimension" problem in microarray analysis, the number of samples is… Expand
Microarray meta-analysis database (M2DB): a uniformly pre-processed, quality controlled, and manually curated human clinical microarray database
TLDR
M2DB is a human curated microarray database designed for easy querying, based on clinical information and for interactive retrieval of either raw or uniformly pre-processed data, along with a set of quality-control metrics. Expand
caCORRECT2: Improving the accuracy and reliability of microarray data in the presence of artifacts
TLDR
caCORRECT is shown to improve the accuracy of gene expression, and the reproducibility of experimental results in clinical application, and it is suggested that caCORRECT will be useful to clean up possible artifacts in new as well as archived microarray data. Expand
Provenance of High Throughput Biomedical Experiments
The field of translational biomedical informatics seeks to integrate knowledge from basic science, directed research into diseases, and clinical insights into a form that can be used to discoverExpand
Simplevisgrid: Grid services for visualization of diverse biomedical knowledge and molecular systems data
  • T. Stokes, May D. Wang
  • Computer Science, Medicine
  • 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society
  • 2009
TLDR
SimpleVisGrid builds on Cancer Biomedical Informatics Grid (caBIG) common infrastructure for cancer research, and clearly specifies and extends three standard data formats for inputs and outputs to grid services: comma-separated values (CSV), Portable Network Graphics (PNG), and Scalable Vector Graphics (SVG). Expand
Automated seeding of specialised wiki knowledgebases with BioKb
TLDR
A software system, BioKb, implemented as a plugin for the TWiki engine, and designed to facilitate construction of a field-specific wiki containing collaborative and automatically generated content, which is seeded with data by use of RAASWiki. Expand
A landmark in biomedical information: many ways are leading to PubMed - MediaWiki tags open remote literature access to PubMed
TLDR
An example is presented showing how automated ways of access can help to build customized special-purpose access portals that can be used by users who are not familiar with programming. Expand
Diagnostic biomarkers for renal cell carcinoma: selection using novel bioinformatics systems for microarray data analysis.
TLDR
The tumor biomarkers have potential utility as a multiplex expression panel for classifying renal cell carcinoma with equivocal histology and are effective at identifying unknown biomarkers that could be verified by independent methods. Expand
RiceWiki: a wiki-based database for community curation of rice genes
TLDR
RiceWiki bears the potential to make it possible to build a rice encyclopedia by and for the scientific community that harnesses community intelligence for collaborative knowledge curation, covers all aspects of biological knowledge and keeps evolving with novel knowledge. Expand
The Gene Wiki: community intelligence applied to human gene annotation
TLDR
This work retrospectively examines the community usage and improvement of the Gene Wiki, providing evidence of a critical mass of users and editors and ongoing systematic improvements to these articles to increase their utility. Expand
Clinical and biological data integration for biomarker discovery.
TLDR
A data integration strategy is described that implements a clinical and biological database and a wiki interface and provides examples to illustrate the utility of data integration to explore disease heterogeneity and develop predictive biomarkers. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 44 REFERENCES
ArrayExpress—a public repository for microarray gene expression data at the EBI
ArrayExpress is a public repository for microarray data that supports the MIAME (Minimum Informa-tion About a Microarray Experiment) requirements and stores well-annotated raw and normalized data. AsExpand
ONCOMINE: a cancer microarray database and integrated data-mining platform.
TLDR
ONCOMINE is presented, a cancer microarray database and web-based data-mining platform aimed at facilitating discovery from genome-wide expression analyses and novel biomarkers and therapeutic targets are discovered. Expand
chip artifact CORRECTion (caCORRECT): A Bioinformatics System for Quality Assurance of Genomics and Proteomics Array Data
TLDR
A web-based bioinformatics tool called caCORRECT for chip artifact detection, analysis, and CORRECTion, which removes systematic artifactual noises that are commonly observed in microarray gene expression data. Expand
Bioinformatics approaches in the study of cancer.
A revolution is underway in the approach to studying the genetic basis of cancer. Massive amounts of data are now being generated via high-throughput techniques such as DNA microarray technology andExpand
NCBI GEO: mining millions of expression profiles—database and tools
TLDR
Recent database developments that facilitate effective mining and visualization of gene expression data are described, providing features to examine data from both experiment- and gene-centric perspectives using user-friendly Web-based interfaces accessible to those without computational or microarray-related analytical expertise. Expand
The Stanford Microarray Database accommodates additional microarray platforms and data formats
TLDR
Software is implemented that allows SMD to accept MAGE-ML documents from array manufacturers and to submit MIAME-compliant data in MAGES-ML format directly to ArrayExpress and GEO, greatly increasing the ease with which data from SMD can be published adhering to accepted standards and also increasing the accessibility of published microarray data to the general public. Expand
The MGED Ontology: a resource for semantics-based description of microarray experiments
TLDR
The MGED Ontology was developed to provide terms for annotating experiments in line with the MIAME guidelines and provides a framework to reference terms in other ontologies and therefore facilitates the use of ontologies in microarray data annotation. Expand
Oncomine 3.0: genes, pathways, and networks in a collection of 18,000 cancer gene expression profiles.
TLDR
Oncomine, a bioinformatics initiative aimed at collecting, standardizing, analyzing, and delivering cancer transcriptome data to the biomedical research community, provides an update on the initiative, describes the database and analysis modules, and highlight several notable observations. Expand
Can we trust biomarkers? visualization and quantification of outlier probes in high density oligonucleotide microarrays
TLDR
This paper presents a more advanced method that quantifies and visualizes the direct impact of outlier probes on genes of interest (i.e. biomarkers) and rescreen hundreds of oligo microarray data, and overlay previous published biomarkers so to get rid of noisy ones. Expand
The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements
Over the last decade, the introduction of microarray technology has had a profound impact on gene expression research. The publication of studies with dissimilar or altogether contradictory results,Expand
...
1
2
3
4
5
...