Automatic Segmentation of Texts and Corpora

  title={Automatic Segmentation of Texts and Corpora},
  author={Cyril Labb{\'e} and Dominique Labb{\'e} and Pierre Hubert},
  journal={Journal of Quantitative Linguistics},
Segmentation of large textual corpora is one of the major questions asked of literary studies. We present a combination of two relevant methods. First, vocabulary growth analysis highlights the main discontinuities in a work. Second, these results are supplemented with the analysis of variations in vocabulary diversity within corpora. A segmentation algorithm, associated with a test of validity, indicates the optimal succession in distinct stages. This method is applied to Racine’s works and… CONTINUE READING

From This Paper

Figures, tables, and topics from this paper.
5 Citations
14 References
Similar Papers


Publications referenced by this paper.
Showing 1-10 of 14 references

La richesse du vocabulaire politique: de Gaulle et Mitterrand

  • D. Labb e
  • 1998
Highly Influential
20 Excerpts

Un mod ele de partition du vocabulaire

  • D. Labb e
  • Etudes sur la richesse et la structure lexicales…
  • 1988
Highly Influential
4 Excerpts

Segmentation automatique des corpus Rapport de stage. Grenoble: Polytech’Grenoble & Institut d’Etudes Politiques

  • G. eaquin
  • 2003

Essai de stylistique quantitative

  • A. Morin
  • VIe Journ  ees d ’ Analyse des Donn  ees…
  • 2002

Vocabulary diversity and its variability: A tool for the analysis of discoursive strategies. Application to the investiture speeches of the Spanish democracy

  • R. Alvarez, M. Becue, Lanero, J.-J
  • Actes des 5 journ
  • 2000
2 Excerpts

La richesse du vocabulaire. Communication au congr es de l’ALLC-ACH, Paris: La Sorbonne

  • P. Hubert, D. Labb e
  • Reproduced in Lexicometrica,
  • 1994

Segmentation des s  eries hydrom  et  eorologiques – Application  a des s  eries de pr  ecipitations et de d  ebits de l ’ Afrique de l ’ Ouest

  • P. Hubert, J.-P. Carbonnel, A. Chaouche
  • Journal of hydrology
  • 1989
1 Excerpt

Similar Papers

Loading similar papers…