Jupyter: Thinking and Storytelling With Code and Data

@article{Granger2021JupyterTA,
  title={Jupyter: Thinking and Storytelling With Code and Data},
  author={Brian E. Granger and Fernando P{\'e}rez},
  journal={Computing in Science \& Engineering},
  year={2021},
  volume={23},
  pages={7-14}
}
Project Jupyter is an open-source project for interactive computing widely used in data science, machine learning, and scientific computing. We argue that even though Jupyter helps users perform complex, technical work, Jupyter itself solves problems that are fundamentally human in nature. Namely, Jupyter helps humans to think and tell stories with code and data. We illustrate this by describing three dimensions of Jupyter: 1) interactive computing; 2) computational narratives; and 3) the idea… 

Figures from this paper

Subjective data models in bioinformatics: Do wet-lab and computational biologists comprehend data differently?

TLDR
Examination of how people differ in how they conceptualise the same data suggests that software engineers should design interfaces for task performance and emulate other related popular user interfaces, rather than targeting a person’s professional background.

Epistemic Programming - An insight-driven programming concept for Data Science

TLDR
The goal of this dissertation is to evaluate, how far this programming concept changes students’ and teachers’ attitudes towards programming and how it enables students to actively gain insights about their own environment.

Computational reproducibility of Jupyter notebooks from biomedical publications

TLDR
Analyzing the computational reproducibility of 9625 Jupyter notebooks from 1117 GitHub repositories associated with 1419 publications indexed in the biomedical literature repository PubMed Central finds that 396 notebooks ran through without any errors, including 245 that produced results identical to those reported in the original.

A Framework to capture and reproduce the Absolute State of Jupyter Notebooks

TLDR
This paper uses Juptyer’s standard extension mechanisms to create an archivable system state for a running notebook, and shows that the overhead for these additional mechanisms, which involve interacting with the underlying Linux kernel, does not introduce substantial execution time overheads.

NoteG: A Computational Notebook to Facilitate Rapid Game Prototyping

TLDR
This paper proposes NoteG, a compu- tational notebook towards rapid game prototyping, and evaluated the tool with 18 novice game developers through a questionnaire-based user survey, finding it easy to use and the opinion that it saves time.

VegaFusion: Automatic Server-Side Scaling for Interactive Vega Visualizations

TLDR
The Vega grammar has been broadly adopted by a growing ecosystem of browser-based visualization tools, but the reference Vega renderer does not scale well to large datasets because it requires the entire dataset to be loaded into browser memory.

Modernizing computing by structural biologists with Jupyter and Colab

  • B. Mooers
  • Biology
    Proceedings of the 20th Python in Science Conference
  • 2021
TLDR
This work attempts to support the reproducibility of protein structure determination by using Jupyter notebooks to document the decisions made, the code, and selected output of this computational work.

Towards a Change Taxonomy for Machine Learning Systems

Machine Learning (ML) academic publications commonly provide open-source implementations on GitHub, allowing their audience to replicate, validate, or even extend machine learning algorithms, data

Ubermag: Toward More Effective Micromagnetic Workflows

TLDR
A human-centered research environment called Ubermag is designed and developed that can be extended to drive other micromagnetic packages from the same environment, and the complete simulation workflow, including definition, execution, and data analysis of simulation runs, can be performed within the same notebook environment.

References

SHOWING 1-10 OF 14 REFERENCES

Jupyter Notebooks - a publishing format for reproducible computational workflows

TLDR
Jupyter notebooks, a document format for publishing code, results and explanations in a form that is both readable and executable, is presented.

Hack weeks as a model for data science education and collaboration

TLDR
The concept of a hack week is presented as an effective model offering opportunities for networking and community building, education in state-of-the-art data science methods, and immersion in collaborative project work and is found that hack weeks are successful at cultivating collaboration and facilitating the exchange of knowledge.

Communities of practice: learning as a social system

You are a claims processor working for a large insurance company. You are good at what you do, but although you know where your paycheck comes from, the corporation mainly remains an abstraction for

Man-Computer Symbiosis

TLDR
Preliminary analyses indicate that the symbiotic partnership will perform intellectual operations much more effectively than man alone can perform them.

Literate Programming

TLDR
This anthology of essays from the inventor of literate programming also contains excerpts from the programs for TEX and METAFONT and CWEB, a system for Literate programming in C and related languages.

The Neurology of Narrative

Narrative is the inescapable frame of human existence. Thinkers as diverse as Aristotle, Barthes, and Bruner have recognized the centrality of narrative in human cognition, but have scanted its

Deterministic nonperiodic flow

Finite systems of deterministic ordinary nonlinear differential equations may be designed to represent forced dissipative hydrodynamic flow. Solutions of these equations can be identified with

Narrative Theory and Function: Why Evolution Matters

The neurology of narrative,” SubStance, vol

  • 30, no. 1/2, pp. 72–84,
  • 2001

Literate programming,” Comput

  • J., vol. 27, no. 2, pp. 97–111, Feb.
  • 1984