Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 226,930,433 papers from all fields of science
Search
Sign In
Create Free Account
Data pre-processing
Known as:
Data preprocessing
Data pre-processing is an important step in the data mining process. The phrase "garbage in, garbage out" is particularly applicable to data mining…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
6 relations
Data mining
Data quality
Feature extraction
Feature selection
Expand
Broader (1)
Machine learning
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2013
Highly Cited
2013
Comparative Analysis of Data Mining Tools and Classification Techniques using WEKA in Medical Bioinformatics
S. K. David
,
Khalid Al Rubeaan
2013
Corpus ID: 2812586
The availability of huge amounts of data resulted in great need of data mining technique in order to generate useful knowledge…
Expand
Highly Cited
2013
Highly Cited
2013
Census Data Mining and Data Analysis using WEKA
S. Jagtap
,
S. Vivekanand
arXiv.org
2013
Corpus ID: 15324286
Data mining (also known as knowledge discovery from databases) is the process of extraction of hidden, previously unknown and…
Expand
Highly Cited
2012
Highly Cited
2012
A wavelet‐based damage diagnosis algorithm using principal component analysis
K. Kesavan
,
A. Kiremidjian
2012
Corpus ID: 17000797
The applicability of the Haar and Morlet wavelet transforms of the vibration signals for structural damage diagnosis was…
Expand
2010
2010
Determination of Bloom's cognitive level of question items using artificial neural network
N. Yusof
,
C. Hui
International Conference on Intelligent Systems…
2010
Corpus ID: 14279719
We propose a classification model for the cognitive level of question items in examinations based on Bloom's taxonomy. The model…
Expand
2010
2010
Data Mining of Mass Storage Based on Cloud Computing
Jianzong Wang
,
Ji-guang Wan
,
Zhuo Liu
,
Peng Wang
Ninth International Conference on Grid and Cloud…
2010
Corpus ID: 18269989
Cloud computing is an elastic computing model that the users can lease the resources from the rentable infrastructure. Cloud…
Expand
Highly Cited
2005
Highly Cited
2005
Orthogonal locality preserving indexing
Deng Cai
,
Xiaofei He
Annual International ACM SIGIR Conference on…
2005
Corpus ID: 1623769
We consider the problem of document indexing and representation. Recently, Locality Preserving Indexing (LPI) was proposed for…
Expand
Highly Cited
2001
Highly Cited
2001
Music Database Retrieval Based on Spectral Similarity
Cheng Yang
2001
Corpus ID: 1957130
We present an efficient algorithm to retrieve similar music pieces from an audio database. The algorithm tries to capture the…
Expand
Highly Cited
1999
Highly Cited
1999
RoboLog Koblenz: Spatial Agents Implemented in a Logical Expressible Language
Frieder Stolzenburg
,
Oliver Obst
,
Jan Murray
,
B. Bremer
1999
Corpus ID: 8133630
In this paper, we present a multi-layered architecture for spatial and temporal agents. The focus is laid on the declarativity of…
Expand
Highly Cited
1989
Highly Cited
1989
Object recognition based on graph matching implemented by a Hopfield-style neural network
W. Li
,
N. Nasrabadi
International Joint Conference on Neural…
1989
Corpus ID: 9560295
A model-based object recognition technique is presented. For each model, distinct features such as curvature points are extracted…
Expand
Highly Cited
1976
Highly Cited
1976
The Automatic Recognition of Human Faces from Profile Silhouettes
Gerald J. Kaufman
,
K. J. Breeding
IEEE Transactions on Systems, Man and Cybernetics
1976
Corpus ID: 22606575
A pattern recognition system is described which is capable of identifying human faces from their full profile silhouettes. Each…
Expand
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE