The Proposition Bank: An Annotated Corpus of Semantic Roles

  title={The Proposition Bank: An Annotated Corpus of Semantic Roles},
  author={Martha Palmer and Paul R. Kingsbury and Daniel Gildea},
  journal={Computational Linguistics},
The Proposition Bank project takes a practical approach to semantic representation, adding a layer of predicate-argument information, or semantic role labels, to the syntactic structures of the Penn Treebank. The resulting resource can be thought of as shallow, in that it does not represent coreference, quantification, and many other higher-order phenomena, but also broad, in that it covers every instance of every verb in the corpus and allows representative statistics to be calculated. We… 

Generalizing the semantic roles in the Chinese Proposition Bank

This article describes the effort to “re-annotate” the CPB with a set of “global” semantic roles that are predicate-independent and investigates their impact on automatic semantic role labeling systems.

Persian Proposition Bank

This paper describes the procedure of semantic role labeling and the development of the first manually annotated Persian Proposition Bank (PerPB) which added a layer of predicate-argument information

Generalizing the semantic roles in the Chinese Proposition Bank

This article describes the effort to “re-annotate” the CPB with a set of “global” semantic roles that are predicate-independent and investigates their impact on automatic semantic role labeling systems.

From Treebank to Propbank: A Semantic-Role and VerbNet Corpus for Danish

This paper presents the first version of a Danish Propbank/VerbNet corpus, annotated at both the morphosyntactic, dependency and semantic levels, and provides a statistical break-down of frames and roles for both the corpus as a whole and across different text types.

A Study of Dictionary Based Korean Semantic Role Labeling

This work proposes a method for automatic semantic role labeling (SRL) using frame files included in the Korean version of Proposition Bank, which is one of the most widely used corpora.

Adding semantic roles to the Chinese Treebank

This discussion focuses on the syntactic variations in the realization of the arguments as well as the approach to annotating dislocated and discontinuous arguments, and the creation of a lexical database of frame files and its role in guiding predicate-argument annotation.

Automatic Semantic Role Labeling for Chinese Verbs

The results using hand-crafted parses are slightly higher than the results reported for the state-of-the-art semantic role labeling systems for English using the Penn English Proposition Bank data, even though the Chinese Proposition Bank is smaller in size.

PropBank: Semantics of New Predicate Types

This research focuses on expanding PropBank, a corpus annotated with predicate argument structures, with new predicate types; namely, noun, adjective and complex predicates, such as Light Verb Constructions, in order for PropBank to reach the same level of coverage and continue to serve as the bedrock for Abstract Meaning Representation.

Issues in Synchronizing the English Treebank and PropBank

The types of mismatches between the syntactic bracketing and the semantic role labeling that can be found are discussed, and the plans for reconciling them are discussed.

A PropBank for Portuguese: the CINTIL-PropBank

CINTIL-PropBank is reported on, which makes possible to obtain a complete PropBank with both syntactic and semantic levels of linguistic annotation, and presents itself as a great resource to train a semantic role labeller.



Proposition Bank II: Delving Deeper

An overview of the second phase of PropBank Annotation, PropBank II, which is being applied to English and Chinese, and includes (Neodavidsonian) eventuality variables, nominal references, sense tagging, and connections to the Penn Discourse Treebank (PDTB), a project for annotating discourse connectives and their arguments.

Automatic Verb Classification Based on Statistical Distributions of Argument Structure

This work reports on supervised learning experiments to automatically classify three major types of English verbs, based on their argument structure-specifically, the thematic roles they assign to participants, using linguistically-motivated statistical indicators extracted from large annotated corpora to train the classifier.

Annotating the Propositions in the Penn Chinese Treebank

It is described how diathesis alternation patterns can be used to make coarse sense distinctions for Chinese verbs as a necessary step in annotating the predicate-structure of Chinese verbs.

The Necessity of Parsing for Predicate Argument Recognition

The effect of parser accuracy on these systems' performance is quantified, and the question of whether a flatter "chunked" representation of the input can be as effective for the purposes of semantic role identification is examined.

The Notion of Argument in Prepositional Phrase Attachment

This article refine the formulation of the problem of prepositional phrase (PP) attachment as a four-way disambiguation problem, and introduces a method to learn arguments and adjuncts based on a definition of arguments as a vector of features.

The Berkeley FrameNet Project

This report will present the project's goals and workflow, and information about the computational tools that have been adapted or created in-house for this work.

Thematic proto-roles and argument selection

As a novel attack on the perennially vexing questions of the theoretical status of thematic roles and the inventory of possible roles, this paper defends a strategy of basing accounts of roles on

Supertagging: An Approach to Almost Parsing

Novel methods for robust parsing that integrate the flexibility of linguistically motivated lexical descriptions with the robustness of statistical techniques are proposed.

From Grammar to Lexicon: Unsupervised Learning of Lexical Syntax

  • M. Brent
  • Linguistics
    Comput. Linguistics
  • 1993
This paper describes an approach based on two principles, rely on local morpho-syntactic cues to structure rather than trying to parse entire sentences, and treat these cues as probabilistic rather than absolute indicators of syntactic structure.

Head-Driven Statistical Models for Natural Language Parsing

Three statistical models for natural language parsing are described, leading to approaches in which a parse tree is represented as the sequence of decisions corresponding to a head-centered, top-down derivation of the tree.