Document sublanguage clustering to detect medical specialty in cross-institutional clinical texts

@article{DoingHarris2013DocumentSC,
  title={Document sublanguage clustering to detect medical specialty in cross-institutional clinical texts},
  author={Kristina Doing-Harris and Olga V Patterson and Sean Igo and John F. Hurdle},
  journal={Proceedings of the ACM ... International Workshop on Data and Text Mining in Biomedical Informatics. ACM International Workshop on Data and Text Mining in Biomedical Informatics},
  year={2013},
  volume={2013},
  pages={
          9-12
        }
}
This paper reports on a set of studies designed to identify sublanguages in documents for domain-specific processing across institutions. Psychological evidence indicates that humans use context-specific linguistic information when they read. Natural Language Processing (NLP) pipelines are successful within specific domains (i.e., contexts). To limit the number of domain-specific NLP systems, a natural focus would be on sublanguages. Sublanguages are identified by shared lexical and semantic… CONTINUE READING

References

Publications referenced by this paper.

Similar Papers

Loading similar papers…