Analyzing the Moving Parts of a Large-Scale Multi-label Text Classification Pipeline: Experiences in Indexing Biomedical Articles

@article{Rios2015AnalyzingTM,
  title={Analyzing the Moving Parts of a Large-Scale Multi-label Text Classification Pipeline: Experiences in Indexing Biomedical Articles},
  author={Anthony Rios and Ramakanth Kavuluru},
  journal={2015 International Conference on Healthcare Informatics},
  year={2015},
  pages={1-7}
}
  • Anthony Rios, Ramakanth Kavuluru
  • Published in
    International Conference on…
    2015
  • Computer Science, Medicine
  • Medical subject headings (MeSH) is a controlled hierarchical vocabulary used by the National Library of Medicine (NLM) to index biomedical articles. In the 2014 version of MeSH terminology there are a total of 27,149 terms. Librarians at the NLM tag each biomedical article to be indexed for the PubMed literature search system with terms from MeSH. This means the human indexers look at each article's full text and index it with a small set of descriptors, 13 on average, from over 27,000… CONTINUE READING

    Create an AI-powered research feed to stay up to date with new papers like this posted to ArXiv

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 26 REFERENCES

    Large scale multi-label classification via metalabeler

    VIEW 11 EXCERPTS
    HIGHLY INFLUENTIAL

    PubMed related articles: a probabilistic topic-based model for content similarity

    VIEW 14 EXCERPTS
    HIGHLY INFLUENTIAL

    Scikit-learn: Machine Learning in Python

    VIEW 6 EXCERPTS
    HIGHLY INFLUENTIAL

    A study of thresholding strategies for text categorization

    VIEW 6 EXCERPTS
    HIGHLY INFLUENTIAL

    Multi-label Classification Using Ensembles of Pruned Sets

    VIEW 7 EXCERPTS
    HIGHLY INFLUENTIAL