The CALBC Silver Standard Corpus for Biomedical Named Entities - A Study in Harmonizing the Contributions from Four Independent Named Entity Taggers

@inproceedings{RebholzSchuhmann2010TheCS,
  title={The CALBC Silver Standard Corpus for Biomedical Named Entities - A Study in Harmonizing the Contributions from Four Independent Named Entity Taggers},
  author={Dietrich Rebholz-Schuhmann and Antonio Jimeno-Yepes and Erik M. van Mulligen and Ning Kang and Jan A. Kors and David Milward and Peter T. Corbett and Ekaterina Buyko and Katrin Tomanek and Elena Beisswanger and Udo Hahn},
  booktitle={LREC},
  year={2010}
}
The production of gold standard corpora is time-consuming and costly. We propose an alternative: the ‚silver standard corpus ̳ (SSC), a corpus that has been generated by the harmonisation of the annotations that have been delivered from a selection of annotation systems. The systems have to share the type system for the annotations and the harmonisation solution has use a suitable similarity measure for the pair-wise comparison of the annotations. The annotation systems have been evaluated… CONTINUE READING
Highly Cited
This paper has 26 citations. REVIEW CITATIONS
17 Citations
10 References
Similar Papers

Citations

Publications citing this paper.
Showing 1-10 of 17 extracted citations

References

Publications referenced by this paper.
Showing 1-10 of 10 references

An overview of JCoRe, the JULIE Lab UIMA Component Repository, In Proceedings of the LREC'08 Workshop ̳Towards Enhanced Interoperability for Large HLT Systems: UIMA for NLP

  • U Hahn
  • 2008

Peregrine: Lightweight gene name normalization by dictionary lookup, Proceedings of the Biocreative 2 workshop

  • M. Schuemie
  • 2007

Similar Papers

Loading similar papers…