Self-paced Ensemble for Highly Imbalanced Massive Data Classification

  title={Self-paced Ensemble for Highly Imbalanced Massive Data Classification},
  author={Zhining Liu and Wei Cao and Zhifeng Gao and Jiang Bian and Hechang Chen and Yi Chang and Tie-Yan Liu},
  journal={2020 IEEE 36th International Conference on Data Engineering (ICDE)},
  • Zhining Liu, Wei Cao, +4 authors Tie-Yan Liu
  • Published 2020
  • Computer Science, Mathematics
  • 2020 IEEE 36th International Conference on Data Engineering (ICDE)
Many real-world applications reveal difficulties in learning classifiers from imbalanced data. The rising big data era has been witnessing more classification tasks with large-scale but extremely imbalance and low-quality datasets. Most of existing learning methods suffer from poor performance or low computation efficiency under such a scenario. To tackle this problem, we conduct deep investigations into the nature of class imbalance, which reveals that not only the disproportion between… Expand
MESA: Boost Ensemble Imbalanced Learning with MEta-SAmpler
Multiple Balance Subsets Stacking for Imbalanced Healthcare Datasets
DoubleEnsemble: A New Ensemble Method Based on Sample Reweighting and Feature Selection for Financial Data Analysis
Influence of Optimizing XGBoost to handle Class Imbalance in Credit Card Fraud Detection
  • C. Priscilla, D. Prabha
  • Computer Science
  • 2020 Third International Conference on Smart Systems and Inventive Technology (ICSSIT)
  • 2020
Summary of PAKDD CUP 2020: From Organizers’ Perspective
Semi-supervised Optimal Transport with Self-paced Ensemble for Cross-hospital Sepsis Early Detection
  • Ruiqing Ding, Yu Zhou, +7 authors Man Huang
  • Computer Science
  • 2021
Harmonization Centered Ensemble For Small And Highly Imbalanced Medical Data Classification


Diversity analysis on imbalanced data sets by using ensemble models
  • S. Wang, X. Yao
  • Computer Science
  • 2009 IEEE Symposium on Computational Intelligence and Data Mining
  • 2009
Exploratory Undersampling for Class-Imbalance Learning
SMOTEBoost: Improving Prediction of the Minority Class in Boosting
RUSBoost: A Hybrid Approach to Alleviating Class Imbalance
Imbalanced-learn: A Python Toolbox to Tackle the Curse of Imbalanced Datasets in Machine Learning
SMOTE: Synthetic Minority Over-sampling Technique
Learning with Class Skews and Small Disjuncts
An Empirical Study of the Behavior of Classifiers on Imbalanced and Overlapped Data Sets