Tracking Hackathon Code Creation and Reuse

  title={Tracking Hackathon Code Creation and Reuse},
  author={Ahmed Imam and Tapajit Dey},
  journal={2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR)},
  • Ahmed Imam, Tapajit Dey
  • Published 18 March 2021
  • Computer Science
  • 2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR)
Background: Hackathons have become popular events for teams to collaborate on projects and develop software prototypes. Most existing research focuses on activities during an event with limited attention to the evolution of the code brought to or created during a hackathon. Aim: We aim to understand the evolution of hackathon-related code, specifically, how much hackathon teams rely on pre-existing code or how much new code they develop during a hackathon. Moreover, we aim to understand if and… 

Figures from this paper

One-off Events? An Empirical Study of Hackathon Code Creation and Reuse
The results of this study demonstrates hackathons are not always “one-off” events as the common knowledge dictates and it can serve as a starting point for further studies in this area.
Hackathons in computer science education: monitoring and evaluation of programming projects
This study aims to present the various ways to implementhackathons in the context of CS education, and presents a taxonomy of hackathons based on the experiences and observations from 2016 to the present.


World of Code: An Infrastructure for Mining the Universe of Open Source VCS Data
A very large and frequently updated collection of version control data for FLOSS projects named World of Code (WoC), which is capable of supporting trend evaluation, ecosystem measurement, and the determination of package usage, and is expected to spur investigation into global properties of OSS development leading to increased resiliency of the entire OSS ecosystem.
The Secret Life of Hackathon Code
These findings help to better understand code reuse as a phenomenon and the role of hackathons in this context and can serve as a starting point for further studies in this area.
World of Code: Enabling a Research Workflow for Mining and Analyzing the Universe of Open Source VCS data
World of Code is created to create a very large and frequently updated collection of version control data in the entire FLOSS ecosystems named World of Code (WoC), that can completely cross-reference authors, projects, commits, blobs, dependencies, and history of theFLOSS ecosystems and provide capabilities to efficiently correct, augment, query, and analyze that data.
What Do We Know About Hackathon Outcomes and How to Support Them? - A Systematic Literature Review
An overview of potential hackathon outcomes, design aspects and connections between them that have been addressed in prior work are provided thus providing an overview of the current state of the art of hackathon events.
How to organize a hackathon - A planning kit
A planning kit that is organized around 12 key decision that organizers need to make when preparing and running a hackathon, and the tradeoffs that drive decision-making is developed.
Replication package for
Representation of Developer Expertise in Open Source Software
The results suggest that the proposed embeddings in the Skill Space appear to satisfy the postulated topology and it is hoped that such representations may aid in the construction of signals that increase trust (and efficiency) of open source ecosystems at large and may aid investigations of other phenomena related to developer proficiency and learning.
A Dataset and an Approach for Identity Resolution of 38 Million Author IDs extracted from 2B Git Commits
This paper proposes a method that finds all author IDs belonging to a single developer in this entire dataset, and shares the list of all author ID that were found to have aliases, and uses a machine learning model to predict which of these potentially related IDs belong to the same developer.
An Exploratory Study of Bot Commits
It is observed that majority of bot commits involve single file modifications, and bots primarily work with data, configuration, and documentation files.
Detecting and Characterizing Bots that Commit Code
BIMAN is proposed, a systematic approach to detect bots using author names, commit messages, files modified by the commit, and projects associated with the commits to find an automated way of identifying bots and code committed by these bots.