An attempt to develop a lemmatiser for the Historical Corpus of Hungarian Gabriella Kiss and

Abstract

For the project of the Historical Dictionary of Hungarian a carefully selected representative corpus was collected (24.5 million running words). The texts were chosen from three centuries. A morphological analyser programme was successfully run on the modern texts, but the analysis of the earlier texts was problematic. In our paper we will describe a method… (More)

Topics

  • Presentations referencing similar topics