# Generating data sets for teaching the importance of regression analysis

```@article{Murray2021GeneratingDS,
title={Generating data sets for teaching the importance of regression analysis},
author={Lori L. Murray and John G. Wilson},
journal={Decision Sciences Journal of Innovative Education},
year={2021}
}```
• Published 31 March 2021
• Computer Science
• Decision Sciences Journal of Innovative Education
2 Citations
Inverse Sampling of Degenerate Datasets from a Linear Regression Line
The present study characterizes the famous Anscombe datasets and provides a general algorithm for creating multiple paired datasets of identical statistical properties.
Decision making in the classroom; when mathematics teaching and statistical reasoning meet each other
• Education
• 2021
Zero factorial, defined to be one, is often counterintuitive to students but nonetheless an interesting concept to convey in a classroom environment. The challenge is to delineate the concept in a

## References

SHOWING 1-5 OF 5 REFERENCES
CLONING DATA: GENERATING DATASETS WITH EXACTLY THE SAME MULTIPLE LINEAR REGRESSION FIT
• Computer Science
• 2009
A simple computational procedure for generating ‘matching’ or ‘cloning’ datasets so that they have exactly the same fitted multiple linear regression equation, suggesting that ‘same fit’ procedures may provide a general and useful alternative to model‐based procedures, and have a wide range of applications.
Adding a dimension to Anscombe's quartet: Open source, 3-D data visualization
• Computer Science
ASIST
• 2013
The development and research goal of this work is to develop an accessible 3-D data tool that allows for a high level of control by the user, and facilitate future studies on the effectiveness and best use practices associated with3-D visualization.
Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing
• Computer Science
CHI
• 2017
This paper presents a novel method for generating identical datasets which are identical over a number of statistical properties yet produce dissimilar graphs, and allows for control over the graphical appearance of resulting output.