Named Entity Recognition in Biomedical Texts using an HMM Model

  title={Named Entity Recognition in Biomedical Texts using an HMM Model},
  author={Shaojun Zhao},
  • Shaojun Zhao
  • Published in NLPBA/BioNLP 2004
  • Computer Science
  • Although there exists a huge number of biomedical texts online, there is a lack of tools good enough to help people get information or knowledge from them. Named entity Recognition (NER) becomes very important for further processing like information retrieval, information extraction and knowledge discovery. We introduce a Hidden Markov Model (HMM) for NER, with a word similarity-based smoothing. Our experiment shows that the word similarity-based smoothing can improve the performance by using… CONTINUE READING
    Integrating linguistic knowledge into a conditional random fieldframework to identify biomedical named entities
    • 49
    • Highly Influenced
    • Open Access
    Evaluating Word Representation Features in Biomedical Named Entity Recognition Tasks
    • 105
    • Highly Influenced
    • Open Access
    Character-level neural network for biomedical named entity recognition
    • 72
    • Open Access
    Biomedical Named Entity Recognition: A Poor Knowledge HMM-Based Approach
    • 27
    • Open Access
    Biomedical Named Entity Recognition Based on Skip-Chain CRFS
    • 20
    • Highly Influenced
    Bio Named Entity Recognition Based on Co-training Algorithm
    • 16


    Publications referenced by this paper.
    Recognizing names in biomedical texts: a machine learning approach
    • 235
    • Highly Influential
    • Open Access
    Tuning support vector machines for biomedical named entity recognition
    • 251
    • Open Access
    Toward information extraction: identifying protein names from biological papers.
    • 571
    • Open Access
    Extracting the Names of Genes and Gene Products with a Hidden Markov Model
    • 282
    • Open Access
    Automatic Retrieval and Clustering of Similar Words
    • 1,867
    • Open Access
    Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL
    • 1,416
    • Open Access
    Detecting Gene Symbols and Names in Biological Texts: A First Step toward Pertinent Information Extraction.
    • 161
    • Open Access
    The GENIA corpus: an annotated research abstract corpus in molecular biology domain
    • 280
    • Highly Influential
    • Open Access
    Two-Phase Biomedical NE Recognition based on SVMs
    • 100
    • Open Access
    An Algorithm that Learns What's in a Name
    • 813
    • Open Access