Genome annotation: from sequence to biology

@article{Stein2001GenomeAF,
  title={Genome annotation: from sequence to biology},
  author={Lincoln Stein},
  journal={Nature Reviews Genetics},
  year={2001},
  volume={2},
  pages={493-503}
}
  • L. Stein
  • Published 1 July 2001
  • Biology, Medicine
  • Nature Reviews Genetics
The genome sequence of an organism is an information resource unlike any that biologists have previously had access to. But the value of the genome is only as good as its annotation. It is the annotation that bridges the gap from the sequence to the biology of the organism. The aim of high-quality annotation is to identify the key features of the genome — in particular, the genes and their products. The tools and resources for annotation are developing rapidly, and the scientific community is… Expand
Annotation, comparison and databases for hundreds of bacterial genomes.
TLDR
Advances in the development of bacterial annotation and comparative analysis tools, and progress in the design of novel database structures for the integration of heterogeneous biological information are reviewed. Expand
Bacterial genome annotation.
TLDR
Combining structural and functional annotation across genomes in a comparative manner promotes higher levels of accurate annotation as well as an advanced understanding of genome evolution. Expand
REVIEW OF TECHNIQUES FOR GENE SEQUENCING, ANNOTATION AND COMPARATIVE GENOMICS
TLDR
This work surveyed and presented an overview of common methods, techniques, tools and challenges of Gene Sequencing, Annotation and Comparative genomics. Expand
Annotating the Human Proteome
TLDR
The identification and functional annotation of the proteome is here of special interest and starts with the identification of genes and transcripts as a prerequisite of proteome annotation. Expand
Linking Genome Annotation Projects with Genetic Disorders using Ontologies
TLDR
A system that supports genome annotation processes by providing useful information about orthologous genes and the genetic disorders which can be associated with a gene identified in a sequence is described. Expand
Towards multidimensional genome annotation
TLDR
All four levels of genome annotation are discussed, with specific emphasis on two-dimensional annotation methods, and the study of changes in genome sequences that occur during adaptive evolution is studied. Expand
Genomics and Proteomics Using Computational Biology
TLDR
Proteomic mass spectrometry is a method that enables sequencing of gene product fragments, enabling the validation and refinement of existing gene annotation as well as the elucidation of novel protein coding regions, but the application of proteomics data to genome annotation is hindered by the lack of suitable tools and methods. Expand
Computer software to find genes in plant genomic DNA.
TLDR
This chapter discusses the use of different computer programs that identify protein-coding genes in large genomic sequences, and describes most commonly used gene prediction programs that are available on the World Wide Web. Expand
PLAZA: A Comparative Genomics Resource to Study Gene and Genome Evolution in Plants[W]
TLDR
PLAZA is described, an online platform for plant comparative genomics that integrates structural and functional annotation of published plant genomes together with a large set of interactive tools to study gene function and gene and genome evolution. Expand
Whole Genome Annotation: In Silico Analysis
TLDR
This chapter explores relevant ongoing research on genes and considers the gene as a basic mapping unit and considers genome publication in public-access sites as a pre-requisite for publication in scientific journals. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 93 REFERENCES
Gene Ontology: tool for the unification of biology
TLDR
The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing. Expand
The COG database: a tool for genome-scale analysis of protein functions and evolution
TLDR
The database of Clusters of Orthologous Groups of proteins (COGs) is an attempt on a phylogenetic classification of the proteins encoded in 21 complete genomes of bacteria, archaea and eukaryotes. Expand
Life with 6000 Genes
TLDR
The genome of the yeast Saccharomyces cerevisiae has been completely sequenced through a worldwide collaboration and provides information about the higher order organization of yeast's 16 chromosomes and allows some insight into their evolutionary history. Expand
Genomic strategies to identify mammalian regulatory sequences
TLDR
In this review, several genomic approaches that are being used to identify regulatory sequences in mammalian genomes are highlighted. Expand
Saccharomyces Genome Database.
TLDR
The organization of the Saccharomyces Genome Database, the sources of the data stored in SGD, some methods for retrieving information from the database, connections SGD has with outside databases and non-yeast research communities, and SGD's repository of yeast community information are described. Expand
Genome annotation assessment in Drosophila melanogaster.
TLDR
This experiment presents the first assessment of promoter prediction techniques for a significant number of genes in a large contiguous region and discovered that the promoter predictors' high false-positive rates make their predictions difficult to use. Expand
Computational inference of homologous gene structures in the human genome.
TLDR
A new gene identification algorithm, GenomeScan, which combines exon-intron and splice signal models with similarity to known protein sequences in an integrated model, which shows an accurate and efficient automated approach for identifying genes in higher eukaryotic genomes and provide a first-level annotation of the draft human genome. Expand
The FlyBase database of the Drosophila genome projects and community literature.
  • Medicine
  • Nucleic acids research
  • 2003
TLDR
A complete revision of the annotations of the now-finished euchromatic genomic sequence has been completed and there are many points of entry to the genome within FlyBase, most notably through maps, gene products and ontologies, structured phenotypic and gene expression data, and anatomy. Expand
Initial sequencing and analysis of the human genome
The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce andExpand
Initial sequencing and analysis of the human genome.
TLDR
The results of an international collaboration to produce and make freely available a draft sequence of the human genome are reported and an initial analysis is presented, describing some of the insights that can be gleaned from the sequence. Expand
...
1
2
3
4
5
...