Slavic Corpus and Computational Linguistics

  title={Slavic Corpus and Computational Linguistics},
  author={D. Divjak and Dagmar Serge Toma{\vz} Sharoff and Dagmar Serge Toma{\vz} Erjavec},
  journal={Journal of Slavic Linguistics},
  pages={171 - 199}
Abstract:In this paper we focus on corpus-linguistic studies that address theoretical questions and on computational linguistic work on corpus annotation that makes corpora useful for linguistic analysis. First we discuss why the corpus linguistic approach was discredited by generative linguists in the second half of the 20th century, how it made a comeback through advances in computing and was finally adopted by usage-based linguistics at the beginning of the 21st century. Then we move on to… 

Figures and Tables from this paper

Construction Grammar in the Service of Slavic Linguistics, and Vice Versa

Abstract:This paper explores the connection between Slavic languages and the theoretical tenets of construction grammar, a cognitively and functionally oriented approach to linguistic analysis. The

Impersonalization in Slavic: A Corpus-Based Study of Impersonalization Strategies in Six Slavic Languages

Abstract:This paper gives a comprehensive overview of how impersonalization is expressed in Slavic. It presents the results of a comparative corpus study, outlining all possible strategies for

Verb rhyme in Russian poetry: a quantitative analysis

This article provides a quantitative corpus-based investigation of the Russian verb rhyme and its change in the Russian poetic tradition from the beginning of the 19th century to the 1960s.

Lexical Diversity and Colour Hues in Russian Poetry : A Corpus-Based Study of Adjectives

This paper describes the distribution of colour adjectives in Russian poetry of the Silver Age and defines individual preferences with regard to poetic tradition, syllable structure, and metrical



Corpus Linguistics at Work

The book adopts and exemplifies the parameters of the corpus-driven approach and posits a new unit of linguistic description defined systematically in the light of corpus evidence.

Usage-based approaches in Cognitive Linguistics: A technical state of the art

A technical state of the art in usage-based linguistics as defined in the context of Cognitive Linguistics is presented and two criteria emerge as essential for a genuine corpus-oriented usage- based linguistics, viz. the use of quantitative techniques and the systematic operationalization of research hypotheses.

Corpora in cognitive linguistics : corpus-based approaches to syntax and lexis

The volume adopts the methodological perspective of Corpus Linguistics, the rapidly evolving branch of linguistics based on the computerized analysis of language used in authentic settings, to pave the way for further integration of usage-based techniques of analysis within this exciting paradigm.

Cognitive Linguistics’ seven deadly sins

Cognitive Linguistics is an approach to language study based on three central premises: that the function of language is to convey meaning, that linguistic description must rely on constructs that

Grammatical Profiles and Aspect in Old Church Slavonic

We employ a new empirical approach to an enduring controversy concerning the development of a system of imperfective vs. perfective verbs in Slavic. While scholars once claimed that this is an

Grammatical Profiles and Aspect in Old

We employ a new empirical approach to an enduring controversy concerning the development of a system of imperfective vs. perfective verbs in Slavic. While scholars once claimed that this is an

Towards the National Corpus of Polish

This paper presents a new corpus project, aiming at building a national corpus of Polish, with an eye on specific applications in various fields, including lexicography and natural language processing.

Frequency effects in language learning and processing

The volume contains a collection of studies on how the analysis of corpus and psycholinguistic data reveal how linguistic knowledge is affected by the frequency of linguistic elements/stimuli. The

What constructional profiles reveal about synonymy: A case study of Russian words for sadness and happiness

Abstract We test two hypotheses relevant to the form-meaning relationship and offer a methodological contribution to the empirical study of near-synonymy within the framework of cognitive

MULTEXT-East Version 3: Multilingual Morphosyntactic Specifications, Lexicons and Corpora

The paper presents the third release of the MULTEXT-East language resources, which brings together the first two, makes them available in TEI P4 XML, and introduces further extensions, e.g., the specification for Resian, a dialect of Slovene.