An Overview of BioCreative II.5

@article{Leitner2010AnOO,
  title={An Overview of BioCreative II.5},
  author={F. Leitner and S. Mardis and Martin Krallinger and G. Cesareni and L. Hirschman and A. Valencia},
  journal={IEEE/ACM Transactions on Computational Biology and Bioinformatics},
  year={2010},
  volume={7},
  pages={385-399}
}
  • F. Leitner, S. Mardis, +3 authors A. Valencia
  • Published 2010
  • Computer Science, Medicine
  • IEEE/ACM Transactions on Computational Biology and Bioinformatics
We present the results of the BioCreative II.5 evaluation in association with the FEBS Letters experiment, where authors created Structured Digital Abstracts to capture information about protein-protein interactions. The BioCreative II.5 challenge evaluated automatic annotations from 15 text mining teams based on a gold standard created by reconciling annotations from curators, authors, and automated systems. The tasks were to rank articles for curation based on curatable protein-protein… Expand
Online assessment of protein interaction information extraction systems
Background. Manual database (DB) curation efforts extracting protein-protein interactions (PPIs) from publications are not able to cover the entire scientific literature on those interactions.Expand
The FEBS Letters/BioCreative II.5 experiment: making biological information accessible
TLDR
The FEBS Letters experiment and the BioCreative II.5 challenge generated the first quantitative data to support the debate on ways to supplement publications with structured information. Expand
Benchmarking of the 2010 BioCreative Challenge III text-mining competition by the BioGRID and MINT interaction databases
TLDR
NLP systems can in the first instance facilitate expert curation by refining the list of candidate publications that contain PPI data, and more ambitiously, NLP approaches may be able to directly extract relevant information from full-text articles for rapid inspection by expert curators. Expand
Overview of the BioCreative III Workshop
TLDR
It is concluded that the best performing systems for GN, P PI-ACT and PPI-IMT in realistic settings are not sufficient for fully automatic use and the importance of interactive systems is presented. Expand
Overview of BioCreative V BioC Track
BioC is a simple XML format for text, annotations and relations, and was developed to achieve interoperability for biomedical text processing. Following the success of BioC in BioCreative IV, theExpand
Detection of interaction articles and experimental methods in biomedical literature
TLDR
A system for document classification which crucially relies on the results of an advanced pipeline of natural language processing tools and a system which is capable of detecting all experimental methods mentioned in scientific literature, and listing them with a competitive ranking. Expand
The Protein-Protein Interaction tasks of BioCreative III: classification/ranking of articles and linking bio-ontology concepts to full text
TLDR
The results of the ACT task of BioCreative III indicate that classification of large unbalanced article collections reflecting the real class imbalance is still challenging, and text-mining tools that report ranked lists of relevant articles for manual selection can potentially reduce the time needed to identify half of the relevant articles to less than 1/4 of the time when compared to unranked results. Expand
Detecting experimental techniques and selecting relevant documents for protein-protein interactions from biomedical literature
TLDR
A novel approach that converts the multi-class, multi-label classification problem to a binary classification problem showed much promise in IMT, and concluded that contextual words surrounding named entities, as well as the MeSH headings associated with the documents were among the main contributors to the performance. Expand
Efficient Extraction of Protein-Protein Interactions from Full-Text Articles
TLDR
This paper presents approaches for protein-named entity recognition, including normalization, and for extraction of protein-protein interactions from full text, and proposes strategies to transfer document-level annotations to the sentence-level, which allows for the creation of a more fine-grained training corpus. Expand
Dynamic programming re-ranking for PPI interactor and pair extraction in full-text articles
TLDR
A new re-ranking algorithm that considers co-occurrence among identifiers in an article to improve INT and IPT ranking results and can boost the IPT performance is presented. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 27 REFERENCES
Overview of the protein-protein interaction annotation extraction task of BioCreative II
TLDR
The BioCreative II PPI task is the first attempt to compare the performance of text-mining tools specific for each of the basic steps of the PPI extraction pipeline, and challenges identified range from problems in full-text format conversion of articles to difficulties in detecting interactor protein pairs and then linking them to their database records. Expand
Overview of BioCreAtIvE task 1B: normalized gene lists
TLDR
This assessment demonstrates that multiple groups were able to perform a real biological task across a range of organisms, and holds out promise that the technology can provide partial automation of the curation process in the near future. Expand
MINT and IntAct contribute to the Second BioCreative challenge: serving the text-mining community with high quality molecular interaction data
TLDR
The development of text-mining tools to extract protein-protein interaction information may increase the literature coverage achieved by manual curation and improve the efficiency of the curation process. Expand
Evaluation of BioCreAtIvE assessment of task 2
TLDR
Concepts provided by GO are currently the most extended set of terms used for annotating gene products, thus they were explored to assess how effectively text mining tools are able to extract those annotations automatically. Expand
Overview of BioCreative II gene normalization
TLDR
Major advances for the BioCreative II gene normalization task include broader participation (20 versus 8 teams) and a pooled system performance comparable to human experts, at over 90% agreement, which show promise as tools to link the literature with biological databases. Expand
Evaluation of text-mining systems for biology: overview of the Second BioCreative community challenge
TLDR
A common characteristic observed in all three tasks was that the combination of system outputs could yield better results than any single system, including the development of the first text-mining meta-server. Expand
Linking entries in protein interaction database to structured text: The FEBS Letters experiment
TLDR
An experiment is presented aimed at integrating each manuscript with a structured summary precisely reporting, with database identifiers and predefined controlled vocabularies, the protein interactions reported in the manuscript. Expand
BioCreAtIvE Task 1A: gene mention finding evaluation
TLDR
The 80% plus F-measure results are good, but still somewhat lag the best scores achieved in some other domains such as newswire, due in part to the complexity and length of gene names, compared to person or organization names in newswire. Expand
Overview of BioCreative II gene mention recognition
TLDR
It is demonstrated that, by combining the results from all submissions, an F score of 0.9066 is feasible, and furthermore that the best result makes use of the lowest scoring submissions. Expand
MINT: a Molecular INTeraction database
TLDR
MINT, a database designed to store data on functional interactions between proteins, consists of entries extracted from the scientific literature by expert curators assisted by ‘MINT Assistant’, a software that targets abstracts containing interaction information and presents them to the curator in a user‐friendly format. Expand
...
1
2
3
...