# David N. Reshef

Identifying interesting relationships between pairs of variables in large data sets is increasingly important. Here, we present a measure of dependence for two-variable relationships: the maximal information coefficient (MIC). MIC captures a wide range of associations both functional and not, and for functional relationships provides a score that roughly(More)
• ArXiv
• 2015
As data sets grow in dimensionality, non-parametric measures of dependence have seen increasing use in data exploration due to their ability to identify non-trivial relationships of all kinds. One common use of these tools is to test a null hypothesis of statistical independence on all variable pairs in a data set. However, because this approach attempts to(More)
• Journal of Machine Learning Research
• 2016
<lb>For high-dimensional data sets, it is common to evaluate a measure of dependence on<lb>every variable pair and retain the highest-scoring pairs for follow-up. If the statistic used<lb>systematically assigns higher scores to some relationship types (e.g., linear, exponential,<lb>etc.) over others, important relationships may be overlooked because of(More)
Background: During an influenza pandemic, a substantial proportion of transmission is thought to occur in households. We used data on influenza progression in individuals and their contacts collected by the City of Milwaukee Health Department (MHD) to study the transmission of pandemic influenza A/H1N1 virus in 362 households in Milwaukee, WI, and the(More)
• ArXiv
• 2015
In exploratory data analysis, we are often interested in identifying promising pairwise associations for further analysis while filtering out weaker, less interesting ones. This can be accomplished by computing a measure of dependence on all possible variable pairs and examining the highest-scoring pairs, provided the measure of dependence used assigns(More)
BACKGROUND During an influenza pandemic, a substantial proportion of transmission is thought to occur in households. We used data on influenza progression in individuals and their contacts collected by the City of Milwaukee Health Department (MHD) to study the transmission of pandemic influenza A/H1N1 virus in 362 households in Milwaukee, WI, and the(More)
• Proceedings of the National Academy of Sciences…
• 2014
Although we appreciate Kinney and Atwal’s interest in equitability and maximal information coefficient (MIC), we believe they misrepresent our work. We highlight a few of our main objections below. Regarding our original paper (1), Kinney and Atwal (2) state “MIC is said to satisfy not just the heuristic notion of equitability, but also the mathematical(More)
Using data from the Gonococcal Isolate Surveillance Project, we studied changes in ciprofloxacin resistance in Neisseria gonorrhoeae isolates in the United States during 2002-2007. Compared with prevalence in heterosexual men, prevalence of ciprofloxacin-resistant N. gonorrhoeae infections showed a more pronounced increase in men who have sex with men(More)