Hack weeks as a model for data science education and collaboration

  title={Hack weeks as a model for data science education and collaboration},
  author={Daniela Huppenkothen and Anthony A. Arendt and David W. Hogg and Karthik Ram and J. Vanderplas and Ariel S. Rokem},
  journal={Proceedings of the National Academy of Sciences of the United States of America},
  pages={8872 - 8877}
  • D. Huppenkothen, A. Arendt, A. Rokem
  • Published 31 October 2017
  • Computer Science
  • Proceedings of the National Academy of Sciences of the United States of America
Significance As scientific disciplines grapple with more datasets of rapidly increasing complexity and size, new approaches are urgently required to introduce new statistical and computational tools into research communities and improve the cross-disciplinary exchange of ideas. In this paper, we introduce a type of scientific workshop, called a hack week, which allows for fast dissemination of new methodologies into scientific communities and fosters exchange and collaboration within and… 

Figures from this paper

Practitioners Teaching Data Science in Industry and Academia: Expectations, Workflows, and Challenges
Twenty data scientists who teach in settings ranging from small-group workshops to large online courses found that they must empathize with a diverse array of student backgrounds and expectations and face challenges involving authenticity versus abstraction in software setup, finding and curating pedagogically-relevant datasets, and acclimating students to live with uncertainty in data analysis.
New Collaboration for New Education: Libraries in the Moore-Sloan Data Science Environments
In 2014, the Gordon and Betty Moore Foundation and the Alfred P. Sloan Foundation partnered to invest $37.8 million across three US universities to build what they called Data Science Environments in
Building employability capabilities in data science students: An interdisciplinary, industry‐focused approach
In the contemporary workplace, data scientists who are capable of interdisciplinary collaboration are in high demand. Universities need to provide data science students with a plethora of learning
Ad hoc efforts for advancing data science education
It is suggested that the perceived benefits of ad hoc efforts go beyond developing technical skills and may provide continued benefit in conjunction with formal curricula, which warrants further investigation.
Bridging sustainability science, earth science, and data science through interdisciplinary education
Given the rapid emergence of data science techniques in the sustainability sciences and the societal importance of many of these applications, there is an urgent need to prepare future scientists to
Organizing online hackathons for newcomers to a scientific community – Lessons learned from two events
A hackathon format is developed that has been successfully applied during in-person events for two years and then forced to move towards a virtual format due to the global pandemic of 2020.
Ten simple rules for helping newcomers become contributors to open projects
The 10 rules laid out below are based on studies of such communities and on the authors’ experience as members, leaders, and observers and focus on small and medium-sized projects, i.e., ones that have a handful of to a few hundred participants but may not (yet) have any formal legal standing, such as incorporation as a nonprofit.
Data Science Support at the Academic Library
Academic libraries can support campus data science needs through professional development of current staff and recruitment of new personnel with expertise in data-intensive domains through professional developed and strategic partnerships with units outside of the library.
Paths towards greater consensus building in experimental biology.
It is argued that a greater adoption of open science practices, with a particular focus on FAIR (Findable, Accessible, Interoperable, Re-usable) data and code, represents a much-needed paradigm shift towards improved transparency, cross-disciplinary integration, and consensus building to maximize the contributions of experimental biologists in addressing the impacts of environmental change on living organisms.


Data Carpentry: Workshops to Increase Data Literacy for Researchers
Data Carpentry focuses on data literacy in particular, with the objective of teaching skills to researchers to enable them to retrieve, view, manipulate, analyze and store their and other's data in an open and reproducible way in order to extract knowledge from data.
Assessing the value of team science: a study comparing center- and investigator-initiated grants.
Open Source and Open Data Should Be Standard Practices.
  • J. Gezelter
  • Computer Science
    The journal of physical chemistry letters
  • 2015
The practical advantages of sharing code and data are important, but there are now strong scientif ic reasons for making open source and open data the accepted norm, the chief reason is the growing sense that science has reached a reproducibility crisis.
Understanding and improving the culture of hackathons: Think global hack local
Think Global Hack Local (TGHL) is a non-competitive, community-based hackathon that connects non-profit organizations with student developers and believes that hackathons can become an environment that is more inclusive and fun for all.
Educating Future Scientists
Significant cultural changes are urgently needed if the burgeoning scientific opportunities in biology are to be tackled by a well-prepared cadre of young scientists from all disciplines.
Sprints, Hackathons and Codefests as community gluons in computational biology
This paper lays out how the events are organised and presents an overview on their achievements, and describes how to improve the quality of these meetings.
Science hackathons for developing interdisciplinary research and collaborations
Science hackathons can help academics, particularly those in the early stage of their careers, to build collaborations and write research proposals.
Brainhack: a collaborative workshop for the open neuroscience community
Through introducing Brainhack to the wider neuroscience community, it is hoped to provide a unique conference format that promotes the features of collaborative, open science.
Bringing students into research by hacking global health
This essay is an evaluative case study reporting on the preparation, execution, and evaluation of a Global Health Hackathon as a teaching method piloted as part of the ‘Introduction to Global Health’