Corpus ID: 236635032

Developing Open Source Educational Resources for Machine Learning and Data Science

  title={Developing Open Source Educational Resources for Machine Learning and Data Science},
  author={Ludwig Bothmann and Sven Strickroth and Giuseppe Casalicchio and David Rugamer and Marius Thomas Lindauer and Fabian Scheipl and Bernd Bischl Department of Statistics and Ludwig-Maximilians-Universitat Munchen and Germany and Institute of Computer Science and Institute of Information Process and Leibniz-University Hannover},
Education should not be a privilege but a common good. It should be openly accessible to everyone, with as few barriers as possible; even more so for key technologies such as Machine Learning (ML) and Data Science (DS). Open Educational Resources (OER) are a crucial factor for greater educational equity. In this paper, we describe the specific requirements for OER in ML and DS and argue that it is especially important for these fields to make source files publicly available, leading to Open… Expand


OpenML: networked science in machine learning
This paper introduces OpenML, a place for machine learning researchers to share and organize data in fine detail, so that they can work more effectively, be more visible, and collaborate with others to tackle harder problems. Expand
mlr3: A modern object-oriented machine learning framework in R
The R (R Core Team, 2019) package mlr3 is a complete reimplementation of the mlr (Bischl et al., 2016) package that leverages many years of experience and learned best practices to provide a state-of-the-art system that is powerful, flexible, extensible, and maintainable. Expand
Scikit-learn: Machine Learning in Python
Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems. This package focuses on bringingExpand
An Introduction to Machine Learning
  • M. Kubát
  • Computer Science
  • Springer International Publishing
  • 2017
This document discusses the basic concepts, models and formulations behind the problems of machine learning, influence maximization, and network diffusion, and analyzes the selected algorithms that were developed to address the above problems. Expand
Machine learning
Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis. Expand
PyTorch: An Imperative Style, High-Performance Deep Learning Library
This paper details the principles that drove the implementation of PyTorch and how they are reflected in its architecture, and explains how the careful and pragmatic implementation of the key components of its runtime enables them to work together to achieve compelling performance. Expand
The Gamification of Learning and Instruction: Game-Based Methods and Strategies for Training and Education
  • John W. Rice
  • Computer Science
  • Int. J. Gaming Comput. Mediat. Simulations
  • 2012
A simulation devoted to covering the steps involved in typical firefighter training known as REVAS, one of many examples covered by Karl Kapp (2012) in his new book, The gamification of learning and instruction: game-based methods and strategies for training and education. Expand
The Open Source Definition
This chapter discusses the open source definition. Open source doesn't just mean access to the source code. The distribution terms of open-source software must comply with the following criteria:Expand
A Study on Student Performance, Engagement, and Experience With Kaggle InClass data Challenges
Kaggle is a data modeling competition service, where participants compete to build a model with lower predictive error than other participants. Several years ago they released a simplified service ...
Building Predictive Models in R Using the caret Package
The caret package, short for classification and regression training, contains numerous tools for developing predictive models using the rich set of models available in R. The package focuses onExpand