Corpus ID: 16797862

A Data Mining Approach for secure Cloud using Enhanced Random Forest

  title={A Data Mining Approach for secure Cloud using Enhanced Random Forest},
  author={Shikha Pathania and Rajdeep Kaur},
Data mining is the process of extracting and analyzing the large datasets to find out various hidden relationship patterns and much other useful information. Random forest is an ensemble method which is widely used is application having large datasets because of its interesting features like handling imbalanced data, identifying variable importance and detecting error rate. For building random forest randomness is established in two ways: Firstly by creating samples from original datasets… Expand

Figures and Tables from this paper


An Improvement to Feature Selection of Random Forests on Spark
This paper studied the problem of adopting the Random Forests algorithm to learn raw data from real usage scenario and proposed an improvement, which is stable, strict, high efficient, data-driven, problem independent and has no impact on algorithm performance. Expand
An Approach towards Optimizing Random Forest using Dynamic Programming Algorithm
It is shown that the problem of selection of optimal subset of random forest follows the dynamic programming paradigm and it is concluded that such subsets do exist and that they are not unique. Expand
Effective Learning and Classification using Random Forest Algorithm
An attempt is made to improve performance of Random Forest classifiers in terms of accuracy, and time required for learning and classification, to achieve this, five new approaches are proposed. Expand
Unbiased Feature Selection in Learning Random Forests for High-Dimensional Data
This work proposes a new RF algorithm, called xRF, to select good features in learning RFs for high-dimensional data, and shows that RFs with the proposed approach outperformed the existing random forests in increasing the accuracy and the AUC measures. Expand
Efficient Learning of Random Forest Classifier using Disjoint Partitioning Approach
Random Forest is an Ensemble Supervised Machine Learning technique. Research work in the area of Random Forest aims at either improving accuracy or improving performance. In this paper we areExpand
Random KNN feature selection - a fast and stable alternative to Random Forests
This work proposes RKNN-FS, an innovative feature selection procedure based on Random KNN (RKNN), a novel generalization of traditional nearest-neighbor modeling that is significantly more stable and more robust than Random Forests for feature selection when the input data are noisy and/or unbalanced. Expand
Random Forests of Very Fast Decision Trees on GPU for Mining Evolving Big Data Streams
This work presents a method for building Random Forests that use Very Fast Decision Trees for data streams on GPUs and shows how this method can benefit from the massive parallel architecture of GPUs, which are becoming an efficient hardware alternative to large clusters of computers. Expand
Research and implementation of the data mining algorithm based on cloud platform
The Hadoop platform technology and common data mining algorithm are studied and it is shown that combining traditional algorithm migration with the cloud platform for parallel improvement can effectively solve the problem of data mining. Expand
Improving Classification Accuracy based on Random Forest Model with Uncorrelated High Performing Trees
An attempt has been made to improve the performance of the model by including only uncorrelated high performing trees in a random forest to enhance the classification accuracy. Expand
Random Forest ( RF ) Machine Learning Algorithm to Detect Abnormal Behavior in Cloud – Based Mobile Services
-Cloud computing has become a realized computing phenomenon. In fact it is the new way of computing in which systems operate. Now mobile devices also can participate in cloud computing. MobileExpand