• Corpus ID: 235795173

Self-service Data Classification Using Interactive Visualization and Interpretable Machine Learning

@article{Wagle2021SelfserviceDC,
  title={Self-service Data Classification Using Interactive Visualization and Interpretable Machine Learning},
  author={Sridevi Narayana Wagle and Boris Kovalerchuk},
  journal={ArXiv},
  year={2021},
  volume={abs/2107.04971}
}
Machine learning algorithms often produce models considered as complex black-box models by both end users and developers. Such algorithms fail to explain the model in terms of the domain they are designed for. The proposed Iterative Visual Logical Classifier (IVLC) is an interpretable machine learning algorithm that allows end users to design a model and classify data with more confidence and without having to compromise on the accuracy. Such technique is especially helpful when dealing with… 

References

SHOWING 1-10 OF 23 REFERENCES

Reversible Data Visualization to Support Machine Learning

This paper shows that the proposed visual analytics approach enhances the ML classification models, and decreases the use of external and irrelevant-to-the-domain assumptions in the ML models.

Visual Knowledge Discovery and Machine Learning

This book vastly expands the class of reversible lossless 2-D and 3-D visualization methods which preserve the n-D information for the knowledge discovery, called the General Lines Coordinates (GLCs), which is accompanied by a set of algorithms for n- D data classification, clustering, dimension reduction, and Pareto optimization.

Interpretable Machine Learning with Boosting by Boolean Algorithm

  • Nathan NeuhausBoris Kovalerchuk
  • Computer Science
    2019 Joint 8th International Conference on Informatics, Electronics & Vision (ICIEV) and 2019 3rd International Conference on Imaging, Vision & Pattern Recognition (icIVPR)
  • 2019
A new interpretable algorithm RPPR is proposed, to bridge the gap between the prediction accuracy of interpretable and non-interpretable algorithms on these data, by boosting DCP via discovering properties of misclassified cases.

Knowledge discovery from database Using an integration of clustering and classification

The results of the experiment show that integration of clustering and classification gives promising results with utmost accuracy rate and robustness even when the data set is containing missing values.

Enhancement of Cross Validation Using Hybrid Visual and Analytical Means with Shannon Function

This paper is improving the cross validation approach using the combined visual and analytical means in a hybrid setting and involves the adaptation of the Shannon function to obtain the worst case error estimate.

Auto-sklearn: Efficient and Robust Automated Machine Learning

A robust new AutoML system based on the Python machine learning package scikit-learn, which improves on existing AutoML methods by automatically taking into account past performance on similar datasets, and by constructing ensembles from the models evaluated during the optimization.

Breast Cancer Diagnosis on Three Different Datasets Using Multi-Classifiers

A comparison among the different classifiers decision tree (J48), Multi-Layer Perception (MLP), Naive Bayes (NB), Sequential Minimal Optimization (SMO), and Instance Based for K-Nearest neighbor (IBK) on three different databases of breast cancer by using classification accuracy and confusion matrix based on 10-fold cross validation method.

Automated Machine Learning: Methods, Systems, Challenges

This open access book presents the first comprehensive overview of general methods in Automatic Machine Learning (AutoML), collects descriptions of existing systems based on these methods, and

Human Interface and the Management of Information. Interaction, Visualization, and Analytics

  • H. Mori
  • Computer Science
    Lecture Notes in Computer Science
  • 2018
This work wants to show that dynamic diagrams, whose content is modified and adapted in realtime by monitoring developer’s actions can be of great benefit as their contents are perfectly suited to the developer current task.

An Empirical Comparison of Data Mining Classification Methods

A comparative study of the performance of C4.5, Naive Bayes, SVM and KNN Classification Algorithms is performed.