IGICA: A Hybrid Feature Selection Approach in Text Categorization

@article{Mojaveriyan2016IGICAAH,
  title={IGICA: A Hybrid Feature Selection Approach in Text Categorization},
  author={Mohammad Mojaveriyan and Hossein Ebrahimpour-Komleh and Seyed Jalaleddin Mousavirad},
  journal={International Journal of Intelligent Systems and Applications},
  year={2016},
  volume={8},
  pages={42-47}
}
Feature selection problem is one of the most important issues in machine learning and statistical pattern recognition. This problem is important in many applications such as text categorization because there are many redundant and irrelevant features in these applications which may reduce the classification performance. Indeed, feature selection is a method to select an appropriate subset of features for increasing the performance of learning algorithms. In the text categorization, there are… 

Tables from this paper

An Evolutionary Hybrid Feature Selection Approach for Biomedical Data Classification
TLDR
A hybrid algorithm according to simulated annealing (SA) and grey wolf optimizer (GWO) to be applied in feature selection for biomedical data and the results showed that the presented methods outperform their rivals.
Simulation of English text recognition model based on ant colony algorithm and genetic algorithm
TLDR
This paper combines ant colony algorithm and genetic algorithm to construct an English text recognition model based on machine learning, and based on the characteristics of ant colony intelligent algorithm optimization, a method of using ant colony algorithms to solve the central node of the core function.
Recognition of English information and semantic features based on SVM and machine learning
TLDR
This study analyzes the English information anaphora resolution based on SVM and machine learning algorithms and uses the CNN three-layer network as the basis to model the structure and shows that the performance of the system using the dual candidate model is better than that of the single candidate model system.
Test of English vocabulary recognition based on natural language processing and corpus system
  • Du Longjiang
  • Computer Science, Linguistics
    J. Intell. Fuzzy Syst.
  • 2021
TLDR
A multi-feature fusion adaptive Kernel-related filter tracking algorithm for the problems of kernel-related filtering algorithms, based on the KCF algorithm, which improves the algorithm from three parts: feature fusion, adaptive change of update rate, and scale detection.
Novel Multirole-Oriented Deep Learning Text Classification Model
  • Ting Luo
  • Computer Science
    Security and Communication Networks
  • 2022
TLDR
The research results show that the deep learning text classification model for multiple roles in novels proposed in this article has good effects on role analysis and text classification.
Intelligent English writing system based on fusion of herding effect and artificial intelligence
TLDR
An artificial intelligence-based English intelligent writing system, an improved algorithm for optimization of swarm particle walking paths, and a relative attractiveness to initialize the formation of small-scale groups based on the herd effect are constructed.
Computational Application on English Translation System Based on Intelligent Image Text Recognition
This work offers an enhanced intelligent picture text recognition algorithm based on the intelligent image text recognition method to increase the impact of English image text translation. Texture
Analysis of Poetry Style Based on Text Classification Algorithm
  • C. Wang
  • Computer Science
    Scientific Programming
  • 2022
TLDR
The effect of the poetry style analysis method based on the text classification algorithm proposed in this paper is very good, which meets the actual needs of poetry styleAnalysis.
A Novel Method for Discovering Process Based on the Network Analysis Approach in the Context of Social Commerce Systems
TLDR
The research objective is to present a new method for commercial process discovery that has not been considered before, based on network analysis methods, multi-layered networks (networks with heterogeneous relations), and attributed networks.
Simulation of English Word Order Sorting Based on Semionline Model and Artificial Intelligence
  • Zhang MengNan Li
  • Computer Science
    Computational intelligence and neuroscience
  • 2022
TLDR
To improve the word order ranking effect of English language retrieval, this paper combines a semionline model to construct an artificial intelligence ranking model for English word order based on a semIONline model and establishes a semisupervised ELM regression model.
...
...

References

SHOWING 1-10 OF 19 REFERENCES
Two-stage text feature selection method using fuzzy entropy measure and an t colony optimization
TLDR
A two-stage method for text feature selection using the k-nearest neighbor classifier on top 10 Retures-21578 categories is proposed and results show the efficiency of the proposed method.
Evaluation of Feature Selection Approaches for Urdu Text Categorization
Efficient feature selection is an important phase of designing an effective text categorization system. Various feature selection methods have been proposed for selecting dissimilar feature sets. It
Feature selection using modified imperialist competitive algorithm
TLDR
Results showed the features set selected by the imperialist competitive algorithm provide the better classification performance compared to the other methods.
Feature selection for text classification with Naïve Bayes
A Comparative Study on Feature Selection in Text Categorization
TLDR
This paper finds strong correlations between the DF IG and CHI values of a term and suggests that DF thresholding the simplest method with the lowest cost in computation can be reliably used instead of IG or CHI when the computation of these measures are too expensive.
Application of Imperialist Competitive Algorithm for Feature Selection: A Case Study on Bulk Rice Classification
TLDR
Results showed the feature set selected by the imperialist competition algorithm provide the better classification performance compared to that obtained by genetic algorithm technique.
An Introduction to Variable and Feature Selection
TLDR
The contributions of this special issue cover a wide range of aspects of variable selection: providing a better definition of the objective function, feature construction, feature ranking, multivariate feature selection, efficient search methods, and feature validity assessment methods.
Imperialist competitive algorithm: An algorithm for optimization inspired by imperialistic competition
TLDR
Applying the proposed algorithm for optimization inspired by the imperialistic competition to some of benchmark cost functions shows its ability in dealing with different types of optimization problems.
EhsanBasiri. "Text feature selection using ant colony optimization." Expert systems with applications
  • 2009
...
...