Generating data sets for teaching the importance of regression analysis

@article{Murray2021GeneratingDS,
  title={Generating data sets for teaching the importance of regression analysis},
  author={Lori L. Murray and John G. Wilson},
  journal={Decision Sciences Journal of Innovative Education},
  year={2021}
}
Inverse Sampling of Degenerate Datasets from a Linear Regression Line
TLDR
The present study characterizes the famous Anscombe datasets and provides a general algorithm for creating multiple paired datasets of identical statistical properties.
Decision making in the classroom; when mathematics teaching and statistical reasoning meet each other
Zero factorial, defined to be one, is often counterintuitive to students but nonetheless an interesting concept to convey in a classroom environment. The challenge is to delineate the concept in a

References

SHOWING 1-5 OF 5 REFERENCES
CLONING DATA: GENERATING DATASETS WITH EXACTLY THE SAME MULTIPLE LINEAR REGRESSION FIT
TLDR
A simple computational procedure for generating ‘matching’ or ‘cloning’ datasets so that they have exactly the same fitted multiple linear regression equation, suggesting that ‘same fit’ procedures may provide a general and useful alternative to model‐based procedures, and have a wide range of applications.
Adding a dimension to Anscombe's quartet: Open source, 3-D data visualization
TLDR
The development and research goal of this work is to develop an accessible 3-D data tool that allows for a high level of control by the user, and facilitate future studies on the effectiveness and best use practices associated with3-D visualization.
Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing
TLDR
This paper presents a novel method for generating identical datasets which are identical over a number of statistical properties yet produce dissimilar graphs, and allows for control over the graphical appearance of resulting output.