ScispaCy: Fast and Robust Models for Biomedical Natural Language Processing

@inproceedings{Neumann2019ScispaCyFA,
  title={ScispaCy: Fast and Robust Models for Biomedical Natural Language Processing},
  author={Mark Neumann and Daniel King and Iz Beltagy and Waleed Ammar},
  booktitle={BioNLP@ACL},
  year={2019}
}
Despite recent advances in natural language processing, many statistical models for processing text perform extremely poorly under domain shift. Processing biomedical and clinical text is a critically important application area of natural language processing, for which there are few robust, practical, publicly available models. This paper describes scispaCy, a new tool for practical biomedical/scientific text processing, which heavily leverages the spaCy library. We detail the performance of… CONTINUE READING
5
Twitter Mentions

Citations

Publications citing this paper.

References

Publications referenced by this paper.
SHOWING 1-10 OF 44 REFERENCES

Developing a Robust Part-of-Speech Tagger for Biomedical Text

  • Panhellenic Conference on Informatics
  • 2005
VIEW 6 EXCERPTS
HIGHLY INFLUENTIAL

Distant Supervision for Cancer Pathway Extraction from Text

  • Pacific Symposium on Biocomputing
  • 2014
VIEW 1 EXCERPT
HIGHLY INFLUENTIAL

Concept annotation in the CRAFT corpus

  • BMC Bioinformatics
  • 2011
VIEW 4 EXCERPTS
HIGHLY INFLUENTIAL

Ontonotes: A large training corpus for enhanced processing

Ralph Weischedel, Eduard Hovy, +4 authors Nianwen Xue.
  • Joseph Olive, Caitlin Christian-
  • 2011
VIEW 1 EXCERPT
HIGHLY INFLUENTIAL

Construction of the literature graph in semantic scholar

Peters, Joanna Power, +5 authors Oren Etzioni.
  • NAACLHLT.
  • 2018

Similar Papers