• Publications
  • Influence
When Does Non-Negative Matrix Factorization Give a Correct Decomposition into Parts?
Theoretical results are shown to be predictive of the performance of published NMF code, by running the published algorithms on one of the synthetic image articulation databases. Expand
Multiscale Representations for Manifold-Valued Data
Multiscale representations for data observed on equispaced grids and taking values in manifolds such as the sphere, the special orthogonal group, the positive definite matrices, and the Grassmann manifolds, using theExp and Log maps of those manifolds are described. Expand
Enhancing reproducibility for computational methods
A novel set of Reproducibility Enhancement Principles (REP) targeting disclosure challenges involving computation is presented, which build upon more general proposals from the Transparency and Openness Promotion guidelines and emerged from workshop discussions among funding agencies, publishers and journal editors, industry participants, and researchers representing a broad range of domains. Expand
The Scientific Method in Practice: Reproducibility in the Computational Sciences
It is found that code, data, and ideas are each regarded differently in terms of how they are revealed and that guidance from scientific norms varies with pervasiveness of computation in the field. Expand
Breakdown Point of Model Selection When the Number of Variables Exceeds the Number of Observations
  • D. Donoho, V. Stodden
  • Mathematics, Computer Science
  • The IEEE International Joint Conference on…
  • 30 October 2006
This work points out that when p > n, there is a breakdown point for standard model selection schemes, such that model selection only works well below a certain critical complexity level depending on n/p, and applies this notion to some model selection algorithms (Forward Stepwise, LASSO, LARS) in the case where pGtn. Expand
Computing Environments for Reproducibility: Capturing the "Whole Tale"
The Whole Tale project aims to address technical and institutional barriers by connecting computational, data-intensive research efforts with the larger research process--transforming the knowledge discovery and dissemination process into one where data products are united with research articles to create "living publications" or "tales". Expand
Best Practices for Computational Science: Software Infrastructure and Environments for Reproducible and Extensible Research
A formalized set of best practice recommendations for computational scientists wishing to disseminate reproducible research, facilitate innovation by enabling data and code re-use, and enable broader communication of the output of digital scientific research are presented. Expand
Reproducible Research in Computational Harmonic Analysis
The authors review their approach to reproducible computational research and how it has evolved over time, discussing the arguments for and against working reproducibly. Expand
The Legal Framework for Reproducible Scientific Research: Licensing and Copyright
  • V. Stodden
  • Computer Science
  • Computing in Science & Engineering
  • 2009
The author proposes the reproducible research standard for scientific researchers to use for all components of their scholarship that should encourage reproducible scientific investigation through attribution, facilitate greater collaboration, and promote engagement of the larger community in scientific learning and discovery. Expand
Realizing the potential of data science
Data science promises new insights, helping transform information into knowledge that can drive science and industry.