Making choices in Russian: pros and cons of statistical methods for rival forms

@article{Baayen2013MakingCI,
  title={Making choices in Russian: pros and cons of statistical methods for rival forms},
  author={R. Baayen and A. Endresen and L. Janda and A. Makarova and T. Nesset},
  journal={Russian Linguistics},
  year={2013},
  volume={37},
  pages={253-291}
}
Sometimes languages present speakers with choices among rival forms, such as the Russian forms ostrič’ vs. obstrič’ ‘cut hair’ and proniknuv vs. pronikši ‘having penetrated’. The choice of a given form is often influenced by various considerations involving the meaning and the environment (syntax, morphology, phonology). Understanding the behavior of rival forms is crucial to understanding the form-meaning relationship of language, yet this topic has not received as much attention as it… Expand
Morphological variation and sensitivity to frequency of forms among native speakers of Czech
This article looks at inter-speaker variation in two environments: the genitive and locative singular cases of masculine ‘hard inanimate’ nouns in Czech, using a large-scale survey of native speakersExpand
The decade construction rivalry in Russian: Using a corpus to study historical linguistics
This article addresses the diachronic development of so-called rival forms, i.e., words or grammatical constructions that appear to be synonyms, based on a detailed empirical analysis of twoExpand
A long birth
in English This article investigates the diachronic development of Russian numeral constructions consisting of a paucal numeral (dva ‘two’, tri ‘three’, četyre ‘four’) followed by an adjective and aExpand
Machine Meets Man: Evaluating the Psychological Reality of Corpus-based Probabilistic Models
TLDR
This paper addresses the question of how to evaluate such models and explicitly compare the performance of a statistical model derived from a corpus with that of native speakers in selecting one of six Russian TRY verbs. Expand
Pitting corpus-based classification models against each other: a case study for predicting constructional choice in written Estonian
TLDR
Two distinct modelling techniques are applied – logistic regression and naïve discriminative learning – to predict the choice between two constructional alternatives in written Estonian. Expand
Who needs it? Variation in experiencer marking in Estonian ‘need’-constructions 1
In this paper, we tackle the twin issues of obligatoriness of semantic arguments and variation in their expression through a study of Estonian constructions denoting need. The variation underExpand
From intrusive to resumptive The acquisition of wh-dependencies by Behdini learners of English
TLDR
It was shown that Behdini features true resumption, but RPs are not truly optional, and they are less marked than in English and subject to complex variability patterns, which appear to be associated with the interaction of split ergativity and the higher subject restriction. Expand
With or without articles? A comparison of article-like determiners in Estonian and Finnish
In this paper, we compare the use and functions of definite and indefinite article-like determiners in Estonian and Finnish. Our main aim is to explore whether the factors that explain the choice ofExpand
Two subjunctives or three?
This paper examines the use of the three non-periphrastic subjunctives in Spanish in embedded clauses under obligatory subjunctive predicates in the past tense in three Spanish varieties:Expand
Frequency data from corpora partially explain native-speaker ratings and choices in overabundant paradigm cells
TLDR
It is shown that different instantiations of frequency help interpret the way variation is perceived and maintained by native speakers, and certain types of absolute frequency seem to have a dominant role in production tasks. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 119 REFERENCES
The grammar of words : an introduction to linguistic morphology
TLDR
The Grammar of Words by Geert Booij covers a broad range of topics from structural questions to psycholinguistic issues and problems of language change, and is especially strong at showing the significance of empirical facts for theoretical reasoning. Expand
‘Nu-drop’ in Russian verbs: a corpus-based investigation of morphological variation and change
In the present article we are offering a corpus-based analysis of nu-drop in Russian verbs, the process whereby certain verbs with the suffix -nu- omit this morpheme in past tense forms. We willExpand
Introducing Linguistic Morphology
An expanded and updated new edition of this best-selling introduction to linguistic morphology. The text guides the reader from the very first principles of the internal structure of words through toExpand
Predicting the dative alternation
TLDR
Presenting a case study of work on the English dative alternation, it is shown that linguistic intuitions ofgrammaticality are deeply flawed and seriously underestimate the space of grammatical possibility, and that the four problems in the critique of usage data are empirical issues that can be resolved by using modern statistical theory and modeling strategies widely used in other fields. Expand
Capturing correlational structure in Russian paradigms: A case study in logistic mixed-effects modeling
TLDR
This study addresses the statistical analysis of a phenomenon in Russian verbal paradigms, a suffix shift that is spreading through the paradigm and making it more regular, and shows how mixed-effects modeling can be used to do this in a principled way. Expand
An amorphous model for morphological processing in visual comprehension based on naive discriminative learning.
TLDR
A 2-layer symbolic network model based on the equilibrium equations of the Rescorla-Wagner model (Danks, 2003) is proposed, showing that for pseudo-derived words no special morpho-orthographic segmentation mechanism is required and predicting that productive affixes afford faster response latencies for new words. Expand
The effects of frequency and neighbourhood density on adult speakers’ productivity with Polish case inflections: an empirical test of usage-based approaches to morphology
Abstract An experiment testing adult Polish speakers’ ability to supply dative forms of unfamiliar nouns revealed strong effects of type frequency (performance was better on inflections that apply toExpand
Aspectual clusters of Russian verbs
The traditional “pair” model of Russian aspect fails to distinguish among Perfectives and ignores the fact that most verbs exist in larger clusters of three or more aspectually related forms. IExpand
Naive v. expert intuitions: An empirical study of acceptability judgments
Abstract Judgments about the grammaticality/acceptability of sentences are the most widely used data source in the syntactic literature. Typically, syntacticians rely on their own judgments, or thoseExpand
Models, forests, and trees of York English: Was/were variation as a case study for statistical practice
TLDR
It is concluded that variationist research can be substantially enriched by an expanded tool kit, including mixed-effects models, random forests, and conditional inference trees that may open additional possibilities for data exploration, analysis, and interpretation. Expand
...
1
2
3
4
5
...