• Corpus ID: 239998343

Play to Grade: Testing Coding Games as Classifying Markov Decision Process

  title={Play to Grade: Testing Coding Games as Classifying Markov Decision Process},
  author={Allen Nie and Emma Brunskill and Chris Piech},
Contemporary coding education often presents students with the task of developing programs that have user interaction and complex dynamic systems, such as mouse based games. While pedagogically compelling, there are no contemporary autonomous methods for providing feedback. Notably, interactive programs are im-possible to grade by traditional unit tests. In this paper we formalize the challenge of providing feedback to interactive programs as a task of classifying Markov Decision Processes… 

Figures and Tables from this paper


Zero Shot Learning for Code Education: Rubric Sampling with Deep Learning Inference
A human-in-the-loop "rubric sampling" approach to tackle the "zero shot" feedback challenge is introduced and is able to provide autonomous feedback for the first students working on an introductory programming assignment with accuracy that substantially outperforms data-hungry algorithms and approaches human level fidelity.
Wuji: Automatic Online Combat Game Testing Using Evolutionary Deep Reinforcement Learning
Wuji is proposed, an on-the-fly game testing framework, which leverages evolutionary algorithms, DRL and multi-objective optimization to perform automatic game testing and demonstrates the effectiveness of Wuji in exploring space and detecting bugs.
Taking the Scenic Route: Automatic Exploration for Videogames
It is shown that off-the-shelf automatic exploration strategies can explore with an effectiveness comparable to human gameplay on the same timescale, and generic methods for quantifying exploration quality as a function of time are contributed.
Learning to Represent Student Knowledge on Programming Exercises Using Deep Learning
This work feeds the embedded program submission sequence into a recurrent neural network and train it on two tasks of predicting the student’s future performance, and learns nuanced representations of a student's knowledge, exposes patterns about a student�'s learning behavior, and reliably predicts future student performance.
Hint systems may negatively impact performance in educational games
The results suggest that traditional hint systems may not translate well into the educational game environment, highlighting the importance of studying student behavior to understand the impact of new interactive learning technologies.
Reliable Deep Grade Prediction with Uncertainty Estimation
Two types of Bayesian deep learning models for grade prediction under a course-specific framework are presented, based on the assumption that prior courses can provide students with knowledge for future courses so that grades of prior course can be used to predict grades in a future course.
Learning Program Embeddings to Propagate Feedback on Student Code
A neural network method is introduced to encode programs as a linear mapping from an embedded precondition space to an embedded postcondition space and an algorithm for feedback at scale is proposed using these linear maps as features.
Trial without Error: Towards Safe Reinforcement Learning via Human Intervention
This work formalizes human intervention for RL and shows how to reduce the human labor required by training a supervised learner to imitate the human's intervention decisions, and outlines extensions of the scheme that are necessary if the authors are to train model-free agents without a single catastrophe.
Automated Correction for Syntax Errors in Programming Assignments using Recurrent Neural Networks
A technique for providing feedback on syntax errors that uses Recurrent neural networks (RNNs) to model syntactically valid token sequences that can fix the error by either replacing or inserting the predicted token sequence at the error location.
Go-Explore: a New Approach for Hard-Exploration Problems
A new algorithm called Go-Explore, which exploits the following principles to remember previously visited states, solve simulated environments through any available means, and robustify via imitation learning, which results in a dramatic performance improvement on hard-exploration problems.