Highly Influenced

# Addressing data complexity for imbalanced data sets: analysis of SMOTE-based oversampling and evolutionary undersampling

@article{Luengo2011AddressingDC, title={Addressing data complexity for imbalanced data sets: analysis of SMOTE-based oversampling and evolutionary undersampling}, author={Juli{\'a}n Luengo and Alberto Fern{\'a}ndez and Salvador Garc{\'i}a and Francisco Herrera}, journal={Soft Comput.}, year={2011}, volume={15}, pages={1909-1936} }

- Published 2011 in Soft Comput.
DOI:10.1007/s00500-010-0625-8

In the classification framework there are problems in which the number of examples per class is not equitably distributed, formerly known as imbalanced data sets. This situation is a handicap when trying to identify the minority classes, as the learning algorithms are not usually adapted to such characteristics. An usual approach to deal with the problem of imbalanced data sets is the use of a preprocessing step. In this paper we analyze the usefulness of the data complexity measures in order… CONTINUE READING

Highly Cited

This paper has 88 citations. REVIEW CITATIONS

#### From This Paper

##### Figures, tables, and topics from this paper.

41 Citations

47 References

Similar Papers

#### Citations

##### Publications citing this paper.

#### Citation Statistics

#### 88 Citations

Citations per Year

Semantic Scholar estimates that this publication has

**88**citations based on the available data.See our **FAQ** for additional information.