A Survey of Text Summarization Techniques

@inproceedings{Nenkova2012ASO,
  title={A Survey of Text Summarization Techniques},
  author={A. Nenkova and K. McKeown},
  booktitle={Mining Text Data},
  year={2012}
}
Numerous approaches for identifying important content for automatic text summarization have been developed to date. [...] Key Method These indicators are combined, very often using machine learning techniques, to score the importance of each sentence. Finally, a summary is produced by selecting sentences in a greedy approach, choosing the sentences that will go in the summary one by one, or globally optimizing the selection, choosing the best set of sentences to form a summary. In this chapter we give a broad…Expand
Multi-Document Summarization and Semantic Relatedness
TLDR
This thesis presents novel ways of measuring the similarity between sentences, based on neural word embeddings and sentiment analysis, and shows that combining multiple sentence similarity scores, by multiplicative aggregation, helps in the process of creating better extractive summaries. Expand
Content selection in multi-document summarization
TLDR
It is shown that a modular extractive summarizer using the estimates of word importance can generate summaries comparable to the state-of-the-art systems, and a new framework of system combination for multi-document summarization is presented. Expand
Recent advances in document summarization
TLDR
Significant contributions made in recent years are emphasized, including progress on modern sentence extraction approaches that improve concept coverage, information diversity and content coherence, as well as attempts from summarization frameworks that integrate sentence compression, and more abstractive systems that are able to produce completely new sentences. Expand
An Assessment of Sentence Simplification Methods in Extractive Text Summarization
TLDR
Four methods are used to simplify a corpus of news articles in English: a rule-based method, an optimization method, a supervised deep learning model and an unsupervisedDeep learning model, which suggests sentence compression with restrictions and models learned from large parallel corpora tend to perform better and yield gains over summarization without prior simplification. Expand
A Deep Look into Extractive Text Summarization
TLDR
The experimented results show a satisfactory and understandable summary and summaries were found to be able to run efficiently and quickly, showing which are the most important text sentences according to the threshold selected by the user. Expand
Multi-document extractive text summarization: A comparative assessment on features
TLDR
Results of experiments showed that the implemented neural model tended to incorrectly infer sentences that were not considered salient by human annotators, however, for distinguishing between summary-worthy and summary-unworthy sentences, the fuzzy inference systems performed better than the utilized neural network. Expand
Advancement of Text Summarization Using Machine Learning and Deep Learning: A Review
TLDR
This paper incorporated machine learning aspects to summarize the large text using topic models and presented a comparison of techniques over numerous performance measures. Expand
A solution to extractive summarization based on document type and a new measure for sentence similarity
TLDR
A new text summarization solution which identifies the document type before perform the summarization, and the creation of a new sentence similarity measure based on lexical, syntactic and semantic evaluation to deal with meaning and word order problems are proposed. Expand
Marathi Extractive Text Summarizer Using Graph Based Model
  • V. V. Sarwadnya, S. Sonawane
  • Computer Science
  • 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA)
  • 2018
TLDR
This paper describes a multi document marathi extractive summarizer based on an extractive concept implemented on the studied models to provide the meaning of text in less words and sentences. Expand
Automatically Determining a Proper Length for Multi-Document Summarization: A Bayesian Nonparametric Approach
TLDR
A Bayesian nonparametric model for multidocument summarization is proposed in order to automatically determine the proper lengths of summaries and the ”reconstruction” of an original document can be reconstructed by a Bayesian framework which selects sentences to form a good summary. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 113 REFERENCES
Document concept lattice for text understanding and summarization
TLDR
It is argued that the quality of a summary can be evaluated based on how many concepts in the original document(s) that can be preserved after summarization, and an optimized selection of a set of distinct and salient local topics that lead to maximal coverage of concepts with the given number of sentences is proposed. Expand
Document Summarization Using Conditional Random Fields
TLDR
A Conditional Random Fields (CRF) based framework is presented to keep the merits of the above two kinds of approaches while avoiding their disadvantages and can take the outcomes of previous methods as features and seamlessly integrate them. Expand
Generic text summarization using relevance measure and latent semantic analysis
TLDR
This paper proposes two generic text summarization methods that create text summaries by ranking and extracting sentences from the original documents, and uses the latent semantic analysis technique to identify semantically important sentences, for summary creations. Expand
A compositional context sensitive multi-document summarizer: exploring the factors that influence summarization
TLDR
The research shows that a frequency based summarizer can achieve performance comparable to that of state-of-the-art systems, but only with a good composition function; context sensitivity improves performance and significantly reduces repetition. Expand
Beyond SumBasic: Task-focused summarization with sentence simplification and lexical expansion
TLDR
This paper details the design of a generic extractive summarization system, which ranked first out of 22 systems in terms of overall mean Pyramid score; and in the human evaluation of summary responsiveness to the topic, the system ranked third out of 35 systems. Expand
A Hybrid Hierarchical Model for Multi-Document Summarization
TLDR
This paper forms extractive summarization as a two step learning problem building a generative model for pattern discovery and a regression model for inference based on the lexical and structural characteristics of the sentences. Expand
The Automated Acquisition of Topic Signatures for Text Summarization
TLDR
A method for automatically training topic signatures-sets of related words, with associated weights, organized around head topics, is described and illustrated with signatures the authors created with 6,194 TREC collection texts over 4 selected topics. Expand
Multidocument summarization: An added value to clustering in interactive retrieval
TLDR
This article proposes in addition to the classification capacity of clustering techniques, the possibility of offering a indicative extract about the contents of several sources by means of multidocument summarization techniques. Expand
Automatic Text Structuring and Summarization
TLDR
This study applies the ideas from the automatic link generation research to attack another important problem in text processing—automatic text summarization, and generates intra-document links between passages of a document. Expand
Towards Multidocument Summarization by Reformulation: Progress and Prospects
TLDR
The evaluation of system components shows that learning over multiple extracted linguistic features is more effective than information retrieval approaches at identifying similar text units for summarization and that it is possible to generate a fluent summary that conveys similarities among documents even when full semantic interpretations of the input text are not available. Expand
...
1
2
3
4
5
...