Can Inferred Provenance and Its Visualisation Be Used to Detect Erroneous Annotation? A Case Study Using UniProtKB

  title={Can Inferred Provenance and Its Visualisation Be Used to Detect Erroneous Annotation? A Case Study Using UniProtKB},
  author={M. Bell and M. Collison and P. Lord},
  journal={PLoS ONE},
  • M. Bell, M. Collison, P. Lord
  • Published 2013
  • Computer Science, Biology, Medicine
  • PLoS ONE
  • A constant influx of new data poses a challenge in keeping the annotation in biological databases current. Most biological databases contain significant quantities of textual annotation, which often contains the richest source of knowledge. Many databases reuse existing knowledge; during the curation process annotations are often propagated between entries. However, this is often not made explicit. Therefore, it can be hard, potentially impossible, for a reader to identify where an annotation… CONTINUE READING
    8 Citations
    On patterns and re-use in bioinformatics databases
    • 2
    • PDF
    HAMAP in 2015: updates to the protein family classification and annotation system
    • 78
    • PDF


    An approach to describing and analysing bulk biological annotation quality: a case study using UniProtKB
    • 16
    • PDF
    Investigating Semantic Similarity Measures Across the Gene Ontology: The Relationship Between Sequence and Annotation
    • 906
    • PDF
    Mining sequence annotation databanks for association patterns
    • 39
    • Highly Influential
    • PDF
    Evaluation of human-readable annotation in biomolecular sequence databases with biological rule libraries
    • 52
    • PDF
    Modeling the percolation of annotation errors in a database of protein sequences
    • 169
    • PDF
    Estimating the annotation error rate of curated GO database sequence annotations
    • 138
    UniProt Knowledgebase: a hub of integrated protein data
    • M. Magrane
    • Computer Science, Medicine
    • Database J. Biol. Databases Curation
    • 2011
    • 1,359
    • PDF
    Percolation of annotation errors through hierarchically structured protein sequence databases.
    • 71
    Gene Ontology annotations at SGD: new data sources and annotation methods
    • 246
    • PDF