A method for evaluating modern systems of automatic text summarization

@article{Yatsko2007AMF,
  title={A method for evaluating modern systems of automatic text summarization},
  author={Viatcheslav Yatsko and T. N. Vishnyakov},
  journal={Automatic Documentation and Mathematical Linguistics},
  year={2007},
  volume={41},
  pages={93-103}
}
Four modern systems of automatic text summarization are tested on the basis of a model vocabulary composed by subjects. Distribution of terms of the vocabulary in the source text is compared with their distribution in summaries of different length generated by the systems. Principles for evaluation of the efficiency of the current systems of automatic text summarization are described. 

Figures, Tables, and Topics from this paper

Multilingual Summarization Approaches
TLDR
The various state-of-the-art multilingual summarization approaches have been grouped based on their characteristics and presented in this chapter. Expand
Multilingual Summarization Evaluation without Human Models
TLDR
This work applies a new content-based evaluation framework called Fresa to compute a variety of divergences among probability distributions in text summarization tasks including generic and focus-based multi-document summarization in English and generic single-document summary in French and Spanish. Expand
Summary Evaluation with and without References
TLDR
A new content–based method for the evaluation of text summarization systems without human models which is used to produce system rankings is studied and a variety of divergences among probability distributions are computed. Expand
A Keyphrase-Based Approach to Text Summarization for English and Bengali Documents
  • K. Sarkar
  • Computer Science
  • Int. J. Technol. Diffusion
  • 2014
TLDR
A keyphrase based approach to single document summarization that extracts first a set of keyphrases from a document, use the extracted keyphRases to choose sentences from the document and finally form an extractive summary with the chosen sentences. Expand
Statistical Automatic Summarization in Organic Chemistry
TLDR
An oriented numerical summarizer algorithm is presented that combines a specific document pre-processing with a sentence scoring method relying on the statistical properties of documents that achieves the best results among several other summarizers on a corpus of Organic Chemistry articles. Expand
ON SOME GRAPHS CONNECTED WITH TEXTS IN A NATURAL LANGUAGE, LINK GRAMMAR AND THE SUMMARIZATION PROCESS
TLDR
The paper describes the generalization of the summarization algorithm of Niraj Kumar and considers the possibility of applying this algorithm to estimate the relevance of posts published in the Internet to the selected articles published before. Expand
An Efficient Statistical Approach for Automatic Organic Chemistry Summarization
TLDR
An efficient strategy for summarizing scientific documents in Organic Chemistry that concentrates on numerical treatments is proposed named yachs (Yet Another Chemistry Summarizer) that combines a specific document pre-processing with a sentence scoring method relying on the statistical properties of documents. Expand
Finding the Best Approach for Multi-lingual Text Summarisation: A Comparative Analysis
TLDR
The evaluation carried out employing the JRC corpus shows that the approach which uses languagespecific resources is the most appropriate in the comparison framework, performing better than state-of-the-art multi-lingual summarisers. Expand
ASHuR: Evaluation of the Relation Summary-Content Without Human Reference Using ROUGE
TLDR
This work proposes a method for automatic text summarization to evaluate a summary (ASHuR) based on extraction of sentences based on sentences extraction, and assesses made to ASHuR show reliability to evaluate summaries written by users in collaborative sites or to review texts generated by students in online learning systems. Expand
Towards automatic tweet generation: A comparative study from the text summarization perspective in the journalism genre
TLDR
It was observed that although the original tweets were considered as model tweets with respect to their informativeness, they were not among the most interesting ones from a human viewpoint, and relying only on these tweets may not be the ideal way to communicate news through Twitter, especially if a more personalized and catchy way of reporting news wants to be performed. Expand
...
1
2
3
4
...

References

SHOWING 1-9 OF 9 REFERENCES
Concept Identification and Presentation in the Context of Technical Text Summarization
TLDR
A method of text summarization that produces indicative-informative abstracts for technical papers that indicates good performance in both tasks when compared with other summarization technologies. Expand
Automatic Text Structuring and Summarization
TLDR
This study applies the ideas from the automatic link generation research to attack another important problem in text processing—automatic text summarization, and generates intra-document links between passages of a document. Expand
Evaluating Content Selection in Summarization: The Pyramid Method
TLDR
It is argued that the method presented is reliable, predictive and diagnostic, thus improves considerably over the shortcomings of the human evaluation method currently used in the Document Understanding Conference. Expand
New Methods in Automatic Extracting
TLDR
New methods of automatically extracting documents for screening purposes, i.e. the computer selection of sentences having the greatest potential for conveying to the reader the substance of the document, indicate that the three newly proposed components dominate the frequency component in the production of better extracts. Expand
Extractive summarization of meeting recordings
TLDR
This work investigates how the summarization results might deteriorate when carried out on ASR output as opposed to manual transcripts, and contrasts feature-based approaches using prosodic and lexical features with maximal marginal relevance and latent semantic analysis approaches to summarization. Expand
New Term Weighting Formulas for the Vector Space Method in Information Retrieval
TLDR
The vector space model, which models documents and queries as vectors in the term space, is used and several new schemes that offer improved performance are presented. Expand
Recalling and Summarizing Complex Discourse
In this paper we investigate the properties of complex semantic information processing involved in the comprehension, (re-)production and summarizing of longer narrative discourse. In the theoreticalExpand
Summarization Evaluation: An Overview
TLDR
The new variety is distinct with its large, thick, coriaceous leaves, rust colored tomentose new growth, stout stems, single, white flowers, improved cold hardiness, and increased resistance to leaf spot and fireblight. Expand