On the Advantages of Word Frequency and Contextual Diversity Measures Extracted from Subtitles: The Case of Portuguese

@article{Soares2015OnTA,
  title={On the Advantages of Word Frequency and Contextual Diversity Measures Extracted from Subtitles: The Case of Portuguese},
  author={A. Soares and J. Machado and A. Costa and {\'A}lvaro Iriarte and A. Sim{\~o}es and Jos{\'e} Jo{\~a}o de Almeida and M. Comesa{\~n}a and Manuel Perea},
  journal={Quarterly Journal of Experimental Psychology},
  year={2015},
  volume={68},
  pages={680 - 696}
}
  • A. Soares, J. Machado, +5 authors Manuel Perea
  • Published 2015
  • Computer Science, Medicine
  • Quarterly Journal of Experimental Psychology
  • We examined the potential advantage of the lexical databases using subtitles and present SUBTLEX-PT, a new lexical database for 132,710 Portuguese words obtained from a 78 million corpus based on film and television series subtitles, offering word frequency and contextual diversity measures. Additionally we validated SUBTLEX-PT with a lexical decision study involving 1920 Portuguese words (and 1920 nonwords) with different lengths in letters (M = 6.89, SD = 2.10) and syllables (M = 2.99, SD = 0… CONTINUE READING
    27 Citations

    Figures, Tables, and Topics from this paper.

    Explore Further: Topics Discussed in This Paper

    SUBTLEX-CAT: Subtitle word frequencies and contextual diversity for Catalan
    The role of word frequency and contextual diversity in visual word recognition: a mini review
    • 1
    • PDF
    Procura-PALavras (P-PAL): A Web-based interface for a new European Portuguese lexical database
    • 12
    • PDF
    Disentangling the effects of word frequency and contextual diversity on serial recall performance
    • 15

    References

    SHOWING 1-10 OF 109 REFERENCES
    SUBTLEX-CH: Chinese Word and Character Frequencies Based on Film Subtitles
    • 328
    • Highly Influential
    • PDF
    SUBTLEX-NL: A new measure for Dutch word frequency based on film subtitles
    • 273
    • PDF
    Subtitle-Based Word Frequencies as the Best Estimate of Reading Behavior: The Case of Greek
    • 38
    • Highly Influential
    • PDF
    SUBTLEX-ESP: Spanish word frequencies based on film subtitles
    • 114
    • Highly Influential
    • PDF
    Contextual Diversity, Not Word Frequency, Determines Word-Naming and Lexical Decision Times
    • 298
    • PDF
    The British Lexicon Project: Lexical decision data for 28,730 monosyllabic and disyllabic English words
    • 218
    • Highly Influential
    • PDF
    EsPal: One-stop shopping for Spanish word properties
    • 208
    • PDF