Corpus ID: 1557484

Efficient induction of probabilistic word classes with LDA

@inproceedings{Chrupaa2011EfficientIO,
  title={Efficient induction of probabilistic word classes with LDA},
  author={Grzegorz Chrupała},
  booktitle={IJCNLP},
  year={2011}
}
  • Grzegorz Chrupała
  • Published in IJCNLP 2011
  • Computer Science
  • Word classes automatically induced from distributional evidence have proved useful many NLP tasks including Named Entity Recognition, parsing and sentence retrieval. The Brown hard clustering algorithm is commonly used in this scenario. Here we propose to use Latent Dirichlet Allocation in order to induce soft, probabilistic word classes. We compare our approach against Brown in terms of efficiency. We also compare the usefulness of the induced Brown and LDA word classes for the semi-supervised… CONTINUE READING

    Citations

    Publications citing this paper.
    SHOWING 1-10 OF 26 CITATIONS

    Arabic Named Entity Recognition Using Topic Modeling

    VIEW 5 EXCERPTS
    CITES METHODS, RESULTS & BACKGROUND
    HIGHLY INFLUENCED

    Fine-Grained POS Tagging of German Tweets

    VIEW 4 EXCERPTS
    CITES METHODS
    HIGHLY INFLUENCED

    Text segmentation with character-level text embeddings

    VIEW 7 EXCERPTS
    CITES METHODS & BACKGROUND

    Concurrent Acquisition of Word Meaning and Lexical Categories

    VIEW 4 EXCERPTS
    CITES METHODS & BACKGROUND

    Hierarchical clustering of word class distributions

    VIEW 6 EXCERPTS
    CITES METHODS & BACKGROUND

    Semi-Supervised Methods for Out-of-Domain Dependency Parsing

    VIEW 5 EXCERPTS
    CITES BACKGROUND & METHODS
    HIGHLY INFLUENCED

    Investigation of features for extraction of named entities from texts in Russian

    VIEW 2 EXCERPTS
    CITES METHODS
    HIGHLY INFLUENCED

    Computational Modeling and Simulation of Language and Meaning: Similarity-Based Approaches

    VIEW 4 EXCERPTS
    CITES BACKGROUND & METHODS
    HIGHLY INFLUENCED

    Semi-supervised experiments at LORIA for the SPMRL 2014 Shared Task

    VIEW 3 EXCERPTS
    CITES METHODS & RESULTS
    HIGHLY INFLUENCED

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 40 REFERENCES

    Design Challenges and Misconceptions in Named Entity Recognition

    VIEW 5 EXCERPTS
    HIGHLY INFLUENTIAL

    Latent Dirichlet Allocation

    VIEW 7 EXCERPTS
    HIGHLY INFLUENTIAL

    FBK-IRST: Semantic Relation Extraction Using Cyc

    VIEW 3 EXCERPTS
    HIGHLY INFLUENTIAL

    Building a Treebank for French

    VIEW 1 EXCERPT
    HIGHLY INFLUENTIAL