Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 233,382,666 papers from all fields of science
Search
Sign In
Create Free Account
Data pre-processing
Known as:
Data preprocessing
Data pre-processing is an important step in the data mining process. The phrase "garbage in, garbage out" is particularly applicable to data mining…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
6 relations
Data mining
Data quality
Feature extraction
Feature selection
Expand
Broader (1)
Machine learning
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
2012
2012
An Efficient Algorithm for Data Cleaning of Log File using File Extensions
Surbhi Anand
,
R. Aggarwal
2012
Corpus ID: 18042082
Wide Web is a monolithic repository of web pages that provides the Internet users with heaps of information. With the growth in…
Expand
2011
2011
Implementation of Data Mining Techniques for Meteorological Data Analysis (A case study for Gaza Strip)
S. Kohail
,
A. El-Halees
2011
Corpus ID: 18695939
Meteorological data mining is a form of data mining concerned with finding hidden patterns inside largely available…
Expand
2011
2011
A Novel Technique for Web Log mining with Better Data Cleaning and Transaction Identification
J. Vellingiri
,
S. Pandian
2011
Corpus ID: 55240394
Problem statement: In the internet era web sites on the internet are useful source of information for almost every activity. So…
Expand
2009
2009
Research on Web Session Clustering
Chaofeng Li
Journal of Software
2009
Corpus ID: 46702063
The task of clustering web sessions is to group web sessions based on similarity and consists of maximizing the intra-group…
Expand
2008
2008
Pixel Level Fusion of Panchromatic and Multispectral Images Based on Correspondence Analysis
H. Cakir
,
S. Khorram
2008
Corpus ID: 13966737
A pixel level data fusion approach based on correspondence analysis (CA) is introduced for high spatial and spectral resolution…
Expand
2006
2006
A Multiple Neural Network System to Classify Solder Joints on Integrated Circuits
G. Acciani
,
G. Brunetti
,
G. Fornarelli
2006
Corpus ID: 16699730
The following paper introduces a diagnostic process to detect solder joint defects on Printed Circuit Boards assembled in Surface…
Expand
2006
2006
Modeling Meteorological Prediction Using Particle Swarm Optimization and Neural Network Ensemble
Jiansheng Wu
,
Long Jin
,
Mingzhe Liu
International Symposium on Neural Networks
2006
Corpus ID: 39544629
In this paper a novel optimization approach is presented. Network architecture and connection weights of neural networks (NN) are…
Expand
Highly Cited
2005
Highly Cited
2005
Categorization Rehab Duwairi Department of Computer Information Systems , Jordan University of Science and Technology , Jordan
R. Duwairi
2005
Corpus ID: 15302159
In this paper, we compare the performance of three classifiers for Arabic text categorization. In particular, the naïve Bayes, k…
Expand
2004
2004
Seafloor classification using echo-waveforms: a method employing hybrid neural network architecture
B. Chakraborty
,
V. Mahale
,
Carlyle de Sousa
,
P. Das
IEEE Geoscience and Remote Sensing Letters
2004
Corpus ID: 41117658
This letter presents seafloor classification study results of a hybrid artificial neural network architecture known as learning…
Expand
Highly Cited
2001
Highly Cited
2001
Music Database Retrieval Based on Spectral Similarity
Cheng Yang
2001
Corpus ID: 1957130
We present an efficient algorithm to retrieve similar music pieces from an audio database. The algorithm tries to capture the…
Expand
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE