A Pareto-based Ensemble with Feature and Instance Selection for Learning from Multi-Class Imbalanced Datasets
@article{Fernndez2017APE, title={A Pareto-based Ensemble with Feature and Instance Selection for Learning from Multi-Class Imbalanced Datasets}, author={Alberto Fern{\'a}ndez and Crist{\'o}bal Jos{\'e} Carmona and Mar{\'i}a Jos{\'e} del Jes{\'u}s and Francisco Herrera}, journal={International journal of neural systems}, year={2017}, volume={27 6}, pages={ 1750028 } }
Imbalanced classification is related to those problems that have an uneven distribution among classes. [] Key Method Selection of instances from all classes will address the imbalance itself by finding the most appropriate class distribution for the learning task, as well as possibly removing noise and difficult borderline examples.
39 Citations
Multi-criteria analysis involving Pareto-optimal misclassification tradeoffs on imbalanced datasets
- Computer Science2020 International Joint Conference on Neural Networks (IJCNN)
- 2020
This work takes into account the existing conflict among the learning losses of the classes, and uses a deterministic multi-objective optimization method, called MONISE, to create a set of solutions with diverse misclassification tradeoffs among the classes.
Combined weighted multi-objective optimizer for instance reduction in two-class imbalanced data problem
- Computer ScienceEng. Appl. Artif. Intell.
- 2020
A multiclass classification using one-versus-all approach with the differential partition sampling ensemble
- Computer ScienceEng. Appl. Artif. Intell.
- 2021
Instance selection based on boosting for instance-based learners
- Computer SciencePattern Recognit.
- 2019
Imbalanced Classification with Multiple Classes
- Computer Science
- 2018
Dealing with multi-class problems is a hard issue, which becomes more severe in the presence of imbalance, and most of the techniques proposed for binary imbalanced classification are not directly applicable for multiple classes.
A memetic approach for training set selection in imbalanced data sets
- Computer ScienceInt. J. Mach. Learn. Cybern.
- 2019
The best training samples are selected from data samples with the goal of improving the performance of classifier when dealing with imbalanced data and some heuristic methods are presented which use local information to give a proper view about whether removing or retaining each sample of training set.
Improving the combination of results in the ensembles of prototype selectors
- Computer ScienceNeural Networks
- 2019
Revisiting data complexity metrics based on morphology for overlap and imbalance: snapshot, new overlap number of balls metrics and singular problems prospect
- Computer ScienceKnowl. Inf. Syst.
- 2021
This research work focuses on revisiting complexity metrics based on data morphology based on ball coverage by classes, which provide both good estimates for class overlap, and great correlations with the classification performance.
Joint feature and instance selection using manifold data criteria: application to image classification
- Computer ScienceArtificial Intelligence Review
- 2020
This paper targets the joint feature and instance selection by adopting feature subset relevance and sparse modeling representative selection, and evaluates the performance of the proposed schemes using image classification where classifiers are the nearest neighbor classifier and support vector machine classifier.
Ensemble Learning via Multimodal Multiobjective Differential Evolution and Feature Selection
- Computer ScienceBIC-TA
- 2019
A novel ensemble method, which utilizes a multimodal multiobjective differential evolution (MMODE) algorithm to select feature subsets and optimize base classifiers parameters, is proposed and experimental results on several benchmark classification databases evidence that the proposed algorithm is valid.
References
SHOWING 1-10 OF 90 REFERENCES
A proposal for evolutionary fuzzy systems using feature weighting: Dealing with overlapping in imbalanced datasets
- Computer ScienceKnowl. Based Syst.
- 2015
EUSBoost: Enhancing ensembles for highly imbalanced data-sets by evolutionary undersampling
- Computer SciencePattern Recognit.
- 2013
Enhancing the effectiveness and interpretability of decision tree and rule induction classifiers with evolutionary training set selection over imbalanced problems
- Computer ScienceAppl. Soft Comput.
- 2009
A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches
- Computer ScienceIEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews)
- 2012
A taxonomy for ensemble-based methods to address the class imbalance where each proposal can be categorized depending on the inner ensemble methodology in which it is based is proposed and a thorough empirical comparison is developed by the consideration of the most significant published approaches to show whether any of them makes a difference.
Empowering difficult classes with a similarity-based aggregation in multi-class classification problems
- Computer ScienceInf. Sci.
- 2014
On the importance of the validation technique for classification with imbalanced datasets: Addressing covariate shift when data is skewed
- Computer ScienceInf. Sci.
- 2014
Analysing the classification of imbalanced data-sets with multiple classes: Binarization techniques and ad-hoc approaches
- Computer ScienceKnowl. Based Syst.
- 2013
An insight into classification with imbalanced data: Empirical results and current trends on using data intrinsic characteristics
- Computer ScienceInf. Sci.
- 2013
On the k-NN performance in a challenging scenario of imbalance and overlapping
- Computer SciencePattern Analysis and Applications
- 2007
This local model is compared to other machine learning algorithms, attending to how their behaviour depends on a number of data complexity features (global imbalance, size of overlap region, and its local imbalance) and several conclusions useful for classifier design are inferred.
Evolutionary rule-based systems for imbalanced data sets
- Computer ScienceSoft Comput.
- 2009
This paper adapts and analyzes LCSs for challenging imbalanced data sets and establishes the bases for further studying the combination of re-sampling technique and learner best suited to a specific kind of problem.