Mixed method for extraction of domain terminology from text: Linguistic and statistical filtering

Abstract

Extraction of identifier terminology from a specific domain is an indispensable task in extracting information from text, In this work we propose a hybrid method of extracting complex terms from Arabic texts which combines between linguistic and statistical approach, which focuses on a linguistic and morph syntactic analysis of the Arabic language deep to introduce an linguistic filtering algorithm of complex terms.

DOI: 10.1109/CIST.2014.7016634

Cite this paper

@article{Lamrani2014MixedMF, title={Mixed method for extraction of domain terminology from text: Linguistic and statistical filtering}, author={El Khadir Lamrani and El Habib Ben Lahmar and Abdelaziz Marzak and Hammad Ballaoui}, journal={2014 Third IEEE International Colloquium in Information Science and Technology (CIST)}, year={2014}, pages={291-295} }