Corpus ID: 8015669

Automatic Generation of Story Highlights

@inproceedings{Woodsend2010AutomaticGO,
  title={Automatic Generation of Story Highlights},
  author={Kristian Woodsend and Mirella Lapata},
  booktitle={ACL},
  year={2010}
}
In this paper we present a joint content selection and compression model for single-document summarization. The model operates over a phrase-based representation of the source document which we obtain by merging information from PCFG parse trees and dependency graphs. Using an integer linear programming formulation, the model learns to select and combine phrases subject to length, coverage and grammar constraints. We evaluate the approach on the task of generating "story highlights"---a small… Expand
Title Generation with Quasi-Synchronous Grammar
TLDR
A model that simultaneously optimizes selection and rendering preferences is presented, based on an integer linear programming formulation, that learns to generate summaries that satisfy both types of preferences, while ensuring that length, topic coverage and grammar constraints are met. Expand
Improving Multi-documents Summarization by Sentence Compression based on Expanded Constituent Parse Trees
TLDR
An innovative sentence compression method by considering every node in the constituent parse tree and deciding its status – remove or retain is proposed, which has similar ROUGE-2 scores but better linguistic quality on TAC data. Expand
Abstractive Multi-Document Summarization via Phrase Selection and Merging
TLDR
This work proposes an abstraction-based multi-document summarization framework that can construct new sentences by exploring more fine-grained syntactic units than sentences, namely, noun/verb phrases, and achieves reasonably well results on manual linguistic quality evaluation. Expand
Jointly Learning to Extract and Compress
TLDR
A joint model of sentence extraction and compression for multi-document summarization and its jointly extracted and compressed summaries outperform both unlearned baselines and the authors' learned extraction-only system on both ROUGE and Pyramid, without a drop in judged linguistic quality. Expand
Improving Summarization Quality with Topic Modeling
TLDR
This paper considers two summarization models - supervised and unsupervised - enhanced by integrating the topic knowledge into their standard lexicon-based models, providing a new measure for the informaition coverage. Expand
Utilizing Microblogs for Automatic News Highlights Extraction
TLDR
A novel method to improve news highlights extraction by using microblogs based on the hypothesis that microblog posts, although noisy, are not only indicative of important pieces of information in the news story, but also inherently “short and sweet” resulting from the artificial compression effect due to the length limit. Expand
Topic modeling combined with classification technique for extractive multi-document text summarization
  • R. Roul
  • Computer Science
  • Soft Comput.
  • 2021
TLDR
The paper proposes a novel methodology for summarizing a corpus of documents to generate a coherent summary using topic modeling and classification technique and the outcomes show that the proposed model is more promising compared to the well-known text summarization models. Expand
Multilingual Summarization with Polytope Model
TLDR
This paper describes the linear programming-based global optimization model to rank and extract the most relevant sentences to a summary, and introduces three different objective functions being optimized. Expand
Neural Summarization by Extracting Sentences and Words
TLDR
This work develops a general framework for single-document summarization composed of a hierarchical document encoder and an attention-based extractor that allows for different classes of summarization models which can extract sentences or words. Expand
Iterative Document Representation Learning Towards Summarization with Polishing
In this paper, we introduce Iterative Text Summarization (ITS), an iteration-based model for supervised extractive text summarization, inspired by the observation that it is often necessary for aExpand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 36 REFERENCES
Headline Generation Based on Statistical Translation
TLDR
This paper presents results on experiments using this approach, in which statistical models of the term selection and term ordering are jointly applied to produce summaries in a style learned from a training corpus. Expand
Summarization with a Joint Model for Sentence Extraction and Compression
TLDR
This work proposes a one-step approach for document summarization that jointly performs sentence extraction and compression by solving an integer linear program. Expand
Syntactic Simplification for Improving Content Selection in Multi-Document Summarization
TLDR
It is shown how simplifying parentheticals by removing relative clauses and appositives results in improved sentence clustering, by forcing clustering based on central rather than background information. Expand
Summarization beyond sentence extraction: A probabilistic approach to sentence compression
TLDR
This paper focuses on sentence compression, a simpler version of this larger challenge, and aims to achieve two goals simultaneously: the authors' compressions should be grammatical, and they should retain the most important pieces of information. Expand
Multi-candidate reduction: Sentence compression as a tool for document summarization tasks
This article examines the application of two single-document sentence compression techniques to the problem of multi-document summarization-a ''parse-and-trim'' approach and a statisticalExpand
Using Hidden Markov Modeling to Decompose Human-Written Summaries
TLDR
A hidden Markov model solution to the decomposition problem of summary sentence decomposition is proposed, which can lead to better text generation techniques for summarization. Expand
Automatic Text Summarization of Newswire: Lessons Learned from the Document Understanding Conference
TLDR
An overview of the achieved results in the different types of summarization tasks, comparing both the broader classes of baselines, systems and humans, as well as individual pairs of summarizers (both human and automatic). Expand
A Noisy-Channel Model for Document Compression
TLDR
The system outperforms both a baseline and a sentence-based compression system that operates by simplifying sequentially all sentences in a text and supports the claim that discourse knowledge plays an important role in document summarization. Expand
Experimenting with Clause Segmentation for Text Summarization
TLDR
In this paper, experiments with clause segmentation in producing summaries for the TAC 2008 Update Summarization Track suggest that a slight trend in improvement was detected. Expand
Cut and Paste Based Text Summarization
TLDR
This work includes a statistically based sentence decomposition program that identifies where the phrases of a summary originate in the original document, producing an aligned corpus of summaries and articles which is used to develop the summarizer. Expand
...
1
2
3
4
...