Corpus ID: 221397665

diproperm: An R Package for the DiProPerm Test

@article{Allmon2020dipropermAR,
  title={diproperm: An R Package for the DiProPerm Test},
  author={Andrew G. Allmon and J. S. Marron and Michael G. Hudgens},
  journal={ArXiv},
  year={2020},
  volume={abs/2009.00003}
}
High-dimensional low sample size (HDLSS) data sets emerge frequently in many biomedical applications. A common task for analyzing HDLSS data is to assign data to the correct class using a classifier. Classifiers which use two labels and a linear combination of features are known as binary linear classifiers. The direction-projection-permutation (DiProPerm) test was developed for testing the difference of two high-dimensional distributions induced by a binary linear classifier. This paper… Expand

Figures from this paper

References

SHOWING 1-10 OF 22 REFERENCES
Direction-Projection-Permutation for High-Dimensional Hypothesis Tests
High-dimensional low sample size (HDLSS) data are becoming increasingly common in statistical applications. When the data can be partitioned into two classes, a basic task is to construct aExpand
A machine learning approach to knee osteoarthritis phenotyping: data from the FNIH Biomarkers Consortium.
TLDR
Using methods that provide a way to assess numerous variables of different types and scalings simultaneously in relation to an outcome of interest enabled a data-driven approach that identified key variables associated with a progression phenotype. Expand
Machine learning approach yields epigenetic biomarkers of food allergy: A novel 13-gene signature to diagnose clinical reactivity
TLDR
A purely-computational machine learning approach to accurately diagnose food allergies and potentially find epigenetic targets for the disease using DNA Methylation data, using only 18 highly discriminating CpGs (0.005% of the total available features). Expand
Fast Algorithms for Large-Scale Generalized Distance Weighted Discrimination
TLDR
This work designs a scalable and robust algorithm for solving large-scale generalized DWD problems, and sometimes even more efficient than the highly optimized LIBLINEAR and LIBSVM for solving the corresponding SVM problems. Expand
Edibility Detection of Mushroom Using Ensemble Methods
Mushrooms are the most familiar delicious food which is cholesterol free as well as rich in vitamins and minerals. Though nearly 45,000 species of mushrooms have been known throughout the world, mostExpand
Novel statistical methodology reveals that hip shape is associated with incident radiographic hip osteoarthritis among African American women.
TLDR
The proximal femurs of African American women demonstrated significantly different shapes between cases and controls, implying an important role for sex and race in the development of RHOA. Expand
Distance-weighted discrimination
TLDR
A useful property of distance-weighted discrimination, beyond just good classification performance, is that it provides a direction vector in high-dimensional data space with several purposes, including indication of driving phenomena behind class differences, data visualization, and batch adjustment tasks. Expand
Persistent Homology Analysis of Brain Artery Trees.
TLDR
The correlation with age continues to be significant even after controlling for correlations from earlier significant summaries, and novel approaches to the statistical analysis lead to heightened correlations with covariates such as age and sex relative to earlier analyses of this data set. Expand
Promises and challenges for the implementation of computational medical imaging (radiomics) in oncology
  • E. Limkin, R. Sun, +7 authors C. Ferté
  • Medicine
  • Annals of oncology : official journal of the European Society for Medical Oncology
  • 2017
TLDR
This Review addresses the critical issues to ensure the proper development of radiomics as a biomarker and facilitate its implementation in clinical practice. Expand
The Elements of Statistical Learning: Data Mining, Inference, and Prediction
In the words of the authors, the goal of this book was to “bring together many of the important new ideas in learning, and explain them in a statistical framework.” The authors have been quiteExpand
...
1
2
3
...