Corpus ID: 2575762

Learning Morphology with Morfette

@inproceedings{Chrupaa2008LearningMW,
  title={Learning Morphology with Morfette},
  author={Grzegorz Chrupała and Georgiana Dinu and Josef van Genabith},
  booktitle={LREC},
  year={2008}
}
  • Grzegorz Chrupała, Georgiana Dinu, Josef van Genabith
  • Published in LREC 2008
  • Computer Science
  • Morfette is a modular, data-driven, probabilistic system which learns to perform joint morphological tagging and lemmatization from morphologically annotated corpora. The system is composed of two learning modules which are trained to predict morphological tags and lemmas using the Maximum Entropy classifier. The third module dynamically combines the predictions of the Maximum-Entropy models and outputs a probability distribution over tag-lemma pair sequences. The lemmatization module exploits… CONTINUE READING

    Citations

    Publications citing this paper.
    SHOWING 1-10 OF 113 CITATIONS

    FinnPos: an open-source morphological tagging and lemmatization toolkit for Finnish

    VIEW 7 EXCERPTS
    CITES METHODS & BACKGROUND
    HIGHLY INFLUENCED

    Hybrid algorithms for preprocessing agglutinative languages and less-resourced domains effectively

    VIEW 11 EXCERPTS
    CITES METHODS & BACKGROUND
    HIGHLY INFLUENCED

    Evaluating Lemmatization Models for Machine-Assisted Corpus-Dictionary Linkage

    VIEW 5 EXCERPTS
    CITES METHODS & BACKGROUND
    HIGHLY INFLUENCED

    Efficient Higher-Order CRFs for Morphological Tagging

    VIEW 8 EXCERPTS
    CITES METHODS
    HIGHLY INFLUENCED

    Efficient induction of probabilistic word classes with LDA

    VIEW 6 EXCERPTS
    CITES METHODS & RESULTS

    Efficient induction of probabilistic word classes with LDA

    VIEW 6 EXCERPTS
    CITES METHODS & RESULTS
    HIGHLY INFLUENCED

    Code-switching in Irish tweets: a preliminary analysis

    VIEW 4 EXCERPTS
    CITES METHODS
    HIGHLY INFLUENCED

    FILTER CITATIONS BY YEAR

    2008
    2020

    CITATION STATISTICS

    • 28 Highly Influenced Citations

    • Averaged 12 Citations per year from 2017 through 2019

    • 56% Increase in citations per year in 2019 over 2018

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 13 REFERENCES

    CESS-ECE: A multilingual and multilevel annotated corpus. Available from: http://www.lsi.upc.edu/ ̃mbertran/cess-ece

    • M. Antonia Martı, Mariona Taulé, Lluı́s Márquez, Manuel Bertran
    • 2007
    VIEW 1 EXCERPT

    Maximum Entropy Tiered Tag

    • Alexandru Ceauşu
    • 2006
    VIEW 2 EXCERPTS

    Tiered Tagging Revisited

    VIEW 1 EXCERPT

    Morphological Tagging: Data vs

    VIEW 1 EXCERPT

    Tagging inflective

    • Jan Hajič, Barbora Hladká
    • 1998
    VIEW 2 EXCERPTS