Knowledge and learning in natural language

  title={Knowledge and learning in natural language},
  author={Charles D. Yang},
The present dissertation is a study of language development in children. From a biological perspective, the development of language, as the development of any other organic systems, is an interaction between internal and external factors; specifically, between the child's internal knowledge of linguistic structures and the external linguistic experience he receives. Drawing insights from the study of biological evolution, we put forth a quantitative model of language acquisition that make this… 
Structures and distributions in morphology learning
This thesis investigates the acquisition of morphology and forms a cognitively-oriented computational framework for studying language acquisition that consists of four components: the linguistic representation, the statistical distribution of the input data, the observed behavior of the human learner, and the performance of the learning algorithm.
Multiple Grammars and the Logic of Learnability in Second Language Acquisition
It is argued that every language contains Multiple Grammars which may reflect different language families and a theory of how language modules interact to predict L2 acquisition paths is developed.
Language Acquisition Meets Language Evolution
It is argued that understanding the acquisition of any cultural form, whether linguistic or otherwise, during development, requires considering the corresponding question of how that cultural form arose through processes of cultural evolution, which helps resolve the "logical" problem of language acquisition.
Learning English Metrical Phonology:When Probability Distributions Are Not Enough
This paper focuses on the acquisition of the complex linguistic system of metrical phonology, using a model that learns from child-directed speech input and keeps in mind restrictions on the time course of acquisition and children’s cognitive limitations.
Language and Other Cognitive Systems. What Is Special About Language?
The traditional conception of language is that it is, in Aristotle's phrase, sound with meaning. The sound-meaning correlation is, furthermore, unbounded, an elementary fact that came to be
Learning and the language of thought
An inductive statistical model is presented over a compositionally structured representation system, a language of thought (LOT) (Fodor, 1975), that formalizes an optimal Bayesian trade-off between representational complexity and fit to the observed data.
Computational Models of First Language Acquisition Special Issue of Research on Language and Computation
This work evaluates a model of some aspect of language acquisition as a computational system and evaluating it on naturally occurring corpora to see whether naturally occurring distributions of examples in corpora provide sufficient information to support the studied claims across a divergent range of acquisition theories.
Combining Language Corpora With Experimental and Computational Approaches for Language Acquisition Research
Historically, first language acquisition research was a painstaking process of observation, requiring the laborious hand coding of children's linguistic productions, followed by the generation of
Evaluation measures and the study of language acquisition
Developmental linguistics finds a focus on diagnosing learners’ patterns of behavior as being due to grammatical knowledge or to factors of language use, and test a theory of linguistic universals and cross-linguistic variation by showing how that space constrains the hypotheses that learners consider during development.


A computational learning model for metrical phonology
Unsupervised Language Acquisition: Theory and Practice
This thesis presents various algorithms for the unsupervised machine learning of aspects of natural languages using a variety of statistical models, and examines the interaction between the various components to show how these algorithms can form the basis for a empiricist model of language acquisition.
Analogical Modeling of Language
This book addresses the general question of whether language is governed by rules or by analogy. Skousen's contribution to the long-standing debate on the subject is of primary importance, for he
The Selection of Syntactic Knowledge
This article develops the thesis that parameter setting is correctly viewed as nondeductive, and discusses three basic learnability properties that must characterize the learner's linguistic environment.
Connectionist, Statistical and Symbolic Approaches to Learning for Natural Language Processing
Embedded machine learning systems for natural language processing: Acquiring and updating hierarchical knowledge for machine translation based on a clustering technique and applying an existing machine learning algorithm to text categorization.
Analogy and the dual-route model of morphology
Learning language using genetic algorithms
Strict pattern-based methods of grammar induction are often frustrated by the apparently inexhaustible variety of novel word combinations in large corpora. Statistical methods offer a possible
ABL: Alignment-Based Learning
This ])al)er introdu(:es a new tyl)e of grammar learning algorithm, iilst)ired l)y sl;ring edit distance (Wagner and Fis(:her, 1974). The algor i thm takes a (:ortms of tlat S(~lltell(:es as input
Class-Based n-gram Models of Natural Language
This work addresses the problem of predicting a word from previous words in a sample of text and discusses n-gram models based on classes of words, finding that these models are able to extract classes that have the flavor of either syntactically based groupings or semanticallybased groupings, depending on the nature of the underlying statistics.
Ambiguity resolution in language learning - computational and cognitive models
This paper presents a meta-analyses of semantic categorisation in the context of Wikipedia, which aims to provide a scaffolding for the future categorisation of Wikipedia into categories of “food” and “people”.