Data pre-processing

Known as: Data preprocessing 
Data pre-processing is an important step in the data mining process. The phrase "garbage in, garbage out" is particularly applicable to data mining… (More)
Wikipedia

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
Review
2017
Review
2017
Data preprocessing and reduction have become essential techniques in current knowledge discovery scenarios, dominated by… (More)
  • figure 1
  • table 1
  • table 2
  • table 3
  • table 4
Is this relevant?
2015
2015
With the abundance of raw data generated from various sources, Big Data has become a preeminent approach in acquiring, processing… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
Is this relevant?
2010
2010
In the paper we present a new framework for improving classifiers learned from imbalanced data. This framework integrates the… (More)
  • table 1
  • table 2
  • table 3
  • table 4
  • table 5
Is this relevant?
2010
2010
The nearest neighbor (NN) classifier represents one of the most popular non-parametric classification approaches and has been… (More)
  • figure 1
  • figure 2
  • figure 3
  • table 1
  • table 2
Is this relevant?
Highly Cited
2006
Highly Cited
2006
KDD is a complex and demanding task. While a large number of methods has been established for numerous problems, many challenges… (More)
  • figure 1
  • figure 2
Is this relevant?
Highly Cited
2004
Highly Cited
2004
UNLABELLED The Weka machine learning workbench provides a general-purpose environment for automatic classification, regression… (More)
  • figure 1
Is this relevant?
2003
2003
UNLABELLED PreP is a versatile, powerful, standalone application that aims at pre-processing gene expression data. AVAILABILITY… (More)
  • table 1
  • table 2
Is this relevant?
Highly Cited
2003
Highly Cited
2003
ISSN: 0883-9514 (Print) 1087-6545 (Online) Journal homepage: http://www.tandfonline.com/loi/uaai20 Data preparation for data… (More)
Is this relevant?
Highly Cited
2002
Highly Cited
2002
We introduce a statistical model for microarray gene expression data that comprises data calibration, the quantification of… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 5
Is this relevant?
Highly Cited
1999
Highly Cited
1999
The Ball-Pivoting Algorithm (BPA) computes a triangle mesh interpolating a given point cloud. Typically, the points are surface… (More)
  • figure 1
  • figure 3
  • figure 2
  • figure 4
  • figure 5
Is this relevant?