A Sampling Method Based on Gauss Kernel Learning and the Expanding Research

  title={A Sampling Method Based on Gauss Kernel Learning and the Expanding Research},
  author={Shunzhi Zhu and Kaibiao Lin and Zhi-qiang Zeng and Lizhao Liu and Wenxing Hong},
  journal={J. Comput.},
In this paper, the expansion of feature points of the linear scale space is transformed into the classification of multi-scale data set within the same scale, which belongs to the classification of scale invariant non-equilibrium .The paper presents a sample approach based on kernel learning to solve classification on imbalance dataset by Support Vector Machine (SVM). The method first preprocesses the data by oversampling the minority class in kernel space, and then the pre-images of the… 

Figures and Tables from this paper

A New Social Network Sampling Algorithm Based on Temperature Conduction Model

A new social network sampling algorithm based on the Temperature Conduction model is proposed that is able to effectively maintain the topological similarity between the sampled network and its original network.

Analysis of the Correlation between Football Education Environment and Students' Psychology Health Based on Gauss Characteristics

Campus football has become a core content of school physical education. Through football education, we can cultivate students' sound personality and promote students' all-round physical and mental

Voronoi Diagram Generation Algorithm based on Delaunay Triangulation

Theoretical analysis and experimental results show that the proposed algorithm based on Delaunay triangulation of randomly distributed points in the Euclidean plane is an efficient method of generating Voronoi diagram.



A Classification Method for Imbalance Data Set Based on Hybrid Strategy

A novel and effective classification method for imbalanced data sets by re-sample the imbalance data by using variable SOM clustering and cutting down the sampled data sets according to the K-NN rule to solve the problem of data confusion, which improves the generalization of SVM.

The pre-image problem in kernel methods

This paper addresses the problem of finding the pre-image of a feature vector in the feature space induced by a kernel and proposes a new method which directly finds the location of thePre-image based on distance constraints in thefeature space.

Cost-sensitive learning methods for imbalanced data

Two empirical methods that deal with class imbalance using both resampling and CSL are presented, one of which can reduce the misclassification costs, and the second can improve the classifier performance.

SMOTE: Synthetic Minority Over-sampling Technique

A combination of the method of oversampling the minority (abnormal) class and under-sampling the majority class can achieve better classifier performance (in ROC space) and a combination of these methods and the area under the Receiver Operating Characteristic curve (AUC) and the ROC convex hull strategy is evaluated.

Exploratory Undersampling for Class-Imbalance Learning

Experimental results show that the proposed EasyEnsemble and BalanceCascade algorithms have higher Area Under the ROC Curve, F-measure, and G-mean values than many existing class-imbalance learning methods.

Large margin cost-sensitive learning of conditional random fields

Applying Support Vector Machines to Imbalanced Datasets

An algorithm is proposed based on a variant of the SMOTE algorithm by Chawla et al, combined with Veropoulos et al's different error costs algorithm for overcoming problems of imbalanced datasets in which negative instances heavily outnumber the positive instances.

Boosting Prediction Accuracy on Imbalanced Datasets with SVM Ensembles

The integrated sampling technique combines both over-sampling and undersampling techniques and outperforms individual SVMs as well as several other state-of-the-art classifiers to improve the prediction performance.

Evolutionary rule-based systems for imbalanced data sets

This paper adapts and analyzes LCSs for challenging imbalanced data sets and establishes the bases for further studying the combination of re-sampling technique and learner best suited to a specific kind of problem.

Improving Academic Performance Prediction by Dealing with Class Imbalance

The purpose of this paper is to tackle the class imbalance for improving the prediction/classification results by over-sampling techniques as well as using cost-sensitive learning (CSL).