PSO-Based Feature Selection for Arabic Text Summarization

Abstract

Feature-based approaches play an important role and are widely applied in extractive summarization. In this paper, we use particle swarm optimization (PSO) to evaluate the effectiveness of different state-of-the-art features used to summarize Arabic text. The PSO is trained on the Essex Arabic summaries corpus data to determine the best particle that represents the most appropriate simple/combination of eight informative/structure features used regularly by Arab summarizers. Based on the elected features and their relevant weights in each PSO iteration, the input text sentences are scored and ranked to extract the top ranking sentences in the form of an output summary. The output summary is then compared with a reference summary using the cosine similarity function as the fitness function. The experimental results illustrate that Arabs summarize texts simply, focusing on the first sentence of each paragraph.

Extracted Key Phrases

7 Figures and Tables

Cite this paper

@article{AlZahrani2015PSOBasedFS, title={PSO-Based Feature Selection for Arabic Text Summarization}, author={Ahmed M. Al-Zahrani and Hassan Mathkour and Hassan Ismail Abdalla}, journal={J. UCS}, year={2015}, volume={21}, pages={1454-1469} }