Share This Author
Predicting the Semantic Orientation of Adjectives
A log-linear regression model uses constraints from conjunctions to predict whether conjoined adjectives are of same or different orientations, achieving 82% accuracy in this task when each conjunction is considered independently.
Discourse Segmentation of Multi-Party Conversation
A domain-independent topic segmentation algorithm for multi-party speech that combines knowledge about content using a text-based algorithm as a feature and about form using linguistic and acoustic cues about topic shifts extracted from speech.
Text generation - using discourse strategies and focus constraints to generate natural language text
- K. McKeown
- SociologyStudies in natural language processing
- 1 March 1985
Preface Introduction 2. Discourse structure 3. Focusing in discourse 4. TEXT system implementation 5. Discourse history 6. Related generation research 7. Summary and conclusions Appendices…
Extracting Social Networks from Literary Fiction
The method involves character name chunking, quoted speech attribution and conversation detection given the set of quotes, which provides evidence that the majority of novels in this time period do not fit two characterizations provided by literacy scholars.
Inferring Strategies for Sentence Ordering in Multidocument News Summarization
A strategy for ordering information that combines constraints from chronological order of events and topical relatedness is implemented and Evaluation of the augmented algorithm shows a significant improvement of the ordering over two baseline strategies.
Translating Collocations for Bilingual Lexicons: A Statistical Approach
A program named Champollion is described which, given a pair of parallel corpora in two different languages and a list of collocations in one of them, automatically produces their translations, to provide a tool for compiling bilingual lexical information above the word level in multiple languages, for different domains.
Extracting Paraphrases from a Parallel Corpus
This work presents an unsupervised learning algorithm for identification of paraphrases from a corpus of multiple English translations of the same source text that yields phrasal and single word lexical paraphrasing as well as syntactic paraphrase.
A Survey of Text Summarization Techniques
This chapter gives a broad overview of existing approaches based on how representation, sentence scoring or summary selection strategies alter the overall performance of the summarizer, and points out some of the peculiarities of the task of summarization.
Introduction to the Special Issue on Summarization
This work focuses on automatic summarization of open-domain multiparty dialogues in diverse genres, and on the development of a robust practical text summarizer based on rhetorical structure extraction.
The Pyramid Method: Incorporating human content selection variation in summarization evaluation
This article proposes a method for analysis of multiple human abstracts into semantic content units, which serves as the basis for an evaluation method that incorporates the observed variation and is predictive of different equally informative summaries.