Tools for historical corpus research , and a corpus of Latin

  title={Tools for historical corpus research , and a corpus of Latin},
  author={Adam Kilgarriff},
We present LatinISE, a Latin corpus for the Sketch Engine. LatinISE consists of Latin works comprising a total of 13 million words, covering the time span from the 2 nd century B. C. to the 21 st century A. D. LatinISE is provided with rich metadata mark-up, including author, title, genre, era, date and century, as well as book, section, paragraph and line of verses. We have automatically annotated LatinISE with lemma and part-of-speech information. The annotation enables the users to search… CONTINUE READING

From This Paper

Figures, tables, and topics from this paper.


Publications referenced by this paper.
Showing 1-5 of 5 references

Creating a parallel treebank of the old Indo - European Bible translations

  • Adam Rychly Kilgarriff, Pavel Smrz, David Pavel Tugwell
  • Calzolari , Nicoletta / Choukri , Khalid…
  • 2008

The Design and Use of a Latin Dependency Treebank

  • Bamman, DavidCrane, Gregory
  • Proceedings of the Fifth International Workshop…
  • 2006

The Sketch Engine

  • Kilgarriff, AdamRychly, PavelSmrz, PavelTugwell, David
  • Proceedings of the eleventh Euralex International…
  • 2004
1 Excerpt

Similar Papers

Loading similar papers…