Corpus ID: 8718713

Learning to Classify Data Streams with Imbalanced Class Distributions

  title={Learning to Classify Data Streams with Imbalanced Class Distributions},
  author={Ryan Lichtenwalter and Nitesh V. Chawla},
Streaming data is pervasive in a multitude of data mining applications. One fundamental problem in the task of mining streaming data is distributional drift over time. Streams may also exhibit high and varying degrees of class imbalance, which can further complicate the task. In scenarios like these, class imbalance is particularly difficult to overcome and has not been as thoroughly studied. In this paper, we consider the issue of high class imbalacne in conjunction with data streams. We… CONTINUE READING

Figures and Tables from this paper.

RLS-A reduced labeled samples approach for streaming imbalanced data with concept drift
  • 10
  • Highly Influenced
An Efficient Ensemble Method for Classifying Skewed Data Streams
  • 5
  • Open Access
Learning Patterns from Imbalanced Evolving Data Streams
Two-layered ensemble Kohonen nets for imbalanced streaming data


Publications referenced by this paper.
Mining concept-drifting data streams using ensemble classifiers
  • 1,315
  • Open Access
SMOTE: Synthetic Minority Over-sampling Technique
  • 9,092
  • Open Access
A General Framework for Mining Concept-Drifting Data Streams with Skewed Distributions
  • 228
  • Highly Influential
  • Open Access
Borderline-SMOTE: A New Over-Sampling Method in Imbalanced Data Sets Learning
  • 1,296
  • Open Access
Mining time-changing data streams
  • 1,530
  • Open Access
Learning on the border: active learning in imbalanced data classification
  • 301
  • Open Access
Statistical Comparisons of Classifiers over Multiple Data Sets
  • 7,484
  • Open Access
Learning from Time-Changing Data with Adaptive Windowing
  • 783
  • Open Access
Learning in the Presence of Concept Drift and Hidden Contexts
  • 901
  • Open Access
The relationship between Precision-Recall and ROC curves
  • 3,137
  • Open Access