The Icelandic Parsed Historical Corpus (IcePaHC)

  title={The Icelandic Parsed Historical Corpus (IcePaHC)},
  author={Eir{\'i}kur R{\"o}gnvaldsson and Anton Karl Ingason and Einar Freyr Sigurðsson and Joel Wallenberg},
We describe the background for and building of IcePaHC, a one million word parsed historical corpus of Icelandic which has just been finished. This corpus which is completely free and open contains fragments of 60 texts ranging from the late 12 century to the present. We describe the text selection and text collecting process and discuss the quality of the texts and their conversion to modern Icelandic spelling. We explain why we choose to use a phrase structure Penn style annotation scheme and… CONTINUE READING
Highly Cited
This paper has 31 citations. REVIEW CITATIONS


Publications referenced by this paper.
Showing 1-10 of 28 references

Annotation manual for the Penn historical corpora and the PCEEC

  • B. Santorini
  • 2010
Highly Influential
5 Excerpts

CorpusSearch 2 Users Guide. University of Pennsylvania, Philadelphia. (http:// Contents.html)

  • B. Randall
  • 2005
Highly Influential
3 Excerpts

The Penn-Helsinki Parsed Corpus of Middle English (PPCME2). Department of Linguistics, University of Pennsylvania. CD-ROM, second edition, (

  • A. Kroch, A. Taylor
  • 2000
Highly Influential
4 Excerpts

A Relative Pronoun in Old Norse? Paper presented at DiGS 13, University of Pennsylvania, Philadelphia

  • C. Sapp
  • June 5th,
  • 2011
1 Excerpt

Annotald, version 11.11. [Software for treebank annotation.] (

  • J. E. Beck, A. Ecay, A. K. Ingason
  • 2011

Coping with Variation in the Icelandic Parsed Historical Corpus (IcePaHC)

  • E. Rögnvaldsson, A. K. Ingason, E. F. Sigurðsson
  • Language Variation Infrastructure. Papers on…
  • 2011
1 Excerpt

Defining the Annotation Scheme of a Treebank: The End-Use Perspective

  • K. Muhonen, T. K. Purtonen
  • Proceedings of the 5th Language and Tech-
  • 2011
1 Excerpt

Similar Papers

Loading similar papers…