A compositional context sensitive multi-document summarizer: exploring the factors that influence summarization

@article{Nenkova2006ACC,
  title={A compositional context sensitive multi-document summarizer: exploring the factors that influence summarization},
  author={A. Nenkova and Lucy Vanderwende and K. McKeown},
  journal={Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval},
  year={2006}
}
  • A. Nenkova, Lucy Vanderwende, K. McKeown
  • Published 2006
  • Computer Science
  • Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
The usual approach for automatic summarization is sentence extraction, where key sentences from the input documents are selected based on a suite of features. While word frequency often is used as a feature in summarization, its impact on system performance has not been isolated. In this paper, we study the contribution to summarization of three factors related to frequency: content word frequency, composition functions for estimating sentence importance from word frequency, and adjustment of… Expand
Measuring Importance and Query Relevance in Topic-focused Multi-document Summarization
TLDR
A principled comparison between the two most commonly used schemes for assigning importance to words in the context of query focused multi-document summarization finds that log-likelihood ratio is more suitable for query-focused summarization since, unlike raw frequency, it is more sensitive to the integration of the information need defined by the user. Expand
Content selection in multi-document summarization
TLDR
It is shown that a modular extractive summarizer using the estimates of word importance can generate summaries comparable to the state-of-the-art systems, and a new framework of system combination for multi-document summarization is presented. Expand
Beyond SumBasic: Task-focused summarization with sentence simplification and lexical expansion
TLDR
This paper details the design of a generic extractive summarization system, which ranked first out of 22 systems in terms of overall mean Pyramid score; and in the human evaluation of summary responsiveness to the topic, the system ranked third out of 35 systems. Expand
Semantic argument frequency-based multi-document summarization
TLDR
This paper scores sentences according to their inclusion of frequent semantic phrases and form the summary using the top-scored sentences and compares this method with a term-based sentence scoring approach to investigate the effects of using semantic units instead of single words for sentence scoring. Expand
Multi-Document Summarization by Maximizing Informative Content-Words
We show that a simple procedure based on maximizing the number of informative content-words can produce some of the best reported results for multi-document summarization. We first assign a score toExpand
Summarization Approaches Based on Document Probability Distributions
TLDR
The research shows that the above summarizer, which is light and simple, can deliver good summaries comparable to other state-of-the-art systems. Expand
Multiple data document summarization
  • V. Kishore, P. Singh
  • Computer Science
  • 2017 Conference on Information and Communication Technology (CICT)
  • 2017
TLDR
A method for generating extraction based summaries using a stack based formulation that uses a sentence importance score calculator based on various semantic features and a semantic similarity score to select sentences that are the most representative of the document. Expand
Exploiting Surface, Content and Relevance Features for Learning-Based Extractive Summarization
TLDR
Various features under a learning-based classification framework are explored, including basic surface features, content features a sentence may represent and the features indicating the relevance among sentences, finding that relevance features are able to improve the summarization performance obviously. Expand
Multi-Document Summarization from First Principles
TLDR
The resulting system, SumBasic+, is a strong baseline system that is ideal for comparing with new summarization approaches, as it principally uses existing techniques and per- forms surprisingly well. Expand
A Survey of Text Summarization Techniques
TLDR
This chapter gives a broad overview of existing approaches based on how representation, sentence scoring or summary selection strategies alter the overall performance of the summarizer, and points out some of the peculiarities of the task of summarization. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 32 REFERENCES
Using N-Grams To Understand the Nature of Summaries
TLDR
Empirically characterize human-written summaries provided in a widely used summarization corpus and suggest that extraction-based techniques which have been successful for single-document summarization may not be sufficient when summarizing multiple documents. Expand
Creating and evaluating multi-document sentence extract summaries
TLDR
This paper examined how humans create multi-document summaries as well as the characteristics of such summaries and use these summaries to evaluate the performance of various multidocument summarization algorithms. Expand
Experiments in Multidocument Summarization
This paper describes a multidocument summarizer built upon research into the detection of new information. The summarizer uses several new strategies to select interesting and informative sentences,Expand
Sentence Fusion for Multidocument News Summarization
TLDR
This article introduces sentence fusion, a novel text-to-text generation technique for synthesizing common information across documents that moves the summarization field from the use of purely extractive methods to the generation of abstracts that contain sentences not found in any of the input documents and can synthesize information across sources. Expand
The use of MMR, diversity-based reranking for reordering documents and producing summaries
TLDR
This paper presents a method for combining query-relevance with information-novelty in the context of text retrieval and summarization, and preliminary results indicate some benefits for MMR diversity ranking in document retrieval and in single document summarization. Expand
Summarization beyond sentence extraction: A probabilistic approach to sentence compression
TLDR
This paper focuses on sentence compression, a simpler version of this larger challenge, and aims to achieve two goals simultaneously: the authors' compressions should be grammatical, and they should retain the most important pieces of information. Expand
Bayesian Multi-Document Summarization at MSE
We describe our entry into the Multilingual Summarization Evaluation (MSE) competition for evaluating generic multidocument summarization systems, where documents are drawn both from English data andExpand
Evaluation Challenges in Large-Scale Document Summarization
TLDR
A large-scale meta evaluation of eight evaluation measures for both single-document and multi-document summarizers is presented, showing the strengths and draw-backs of all evaluation methods and how they rank the different summarizers. Expand
Left-Brain / Right-Brain Multi-Document Summarization
Since we began participating in DUC in 2001, our summarizer has been based on an HMM (Hidden Markov Model) for sentence selection within a document and a pivoted QR algorithm to generate aExpand
A trainable document summarizer
TLDR
The trends in the results are in agreement with those of Edmundson who used a subjectively weighted combination of features as opposed to training the feature weights using a corpus, which suggests that even shorter extracts may be useful indicative summmies. Expand
...
1
2
3
4
...