Coffea-casa: an analysis facility prototype

@article{Adamec2021CoffeacasaAA,
  title={Coffea-casa: an analysis facility prototype},
  author={Matous Adamec and Garhan Attebury and Kenneth Bloom and Brian Paul Bockelman and Carl Lundstedt and Oksana Shadura and John Thiltges},
  journal={ArXiv},
  year={2021},
  volume={abs/2103.01871}
}
Data analysis in HEP has often relied on batch systems and event loops; users are given a non-interactive interface to computing resources and consider data event-by-event. The “Coffea-casa” prototype analysis facility is an effort to provide users with alternate mechanisms to access computing resources and enable new programming paradigms. Instead of the command-line interface and asynchronous batch access, a notebook-based web interface and interactive computing is provided. Instead of… Expand

Figures from this paper

References

SHOWING 1-10 OF 56 REFERENCES
Kubernetes: Up and Running: Dive into the Future of Infrastructure
TLDR
This practical guide shows you how Kubernetes and container technology can help you achieve new levels of velocity, agility, reliability, and efficiency. Expand
INDIGO IAM for cms-Log in
  • [online] Available at: https://cms-auth.web.cern.ch [Accessed
  • 2021
Is GitOps the next big thing in DevOps? | Atlassian Git Tutorial. [online] Atlassian
  • Available at: https://www.atlassian.com/git/tutorials/gitops [Accessed
  • 2021
2020, July). A prototype U.S. CMS analysis facility. Presented at the PyHEP 2020 Workshop, Zenodo
  • 2020
Coffea - Columnar Object Framework For Effective Analysis
TLDR
This work will discuss the experience in implementing analysis of CMS data using the coffea framework, and a discussion of the user experience and future directions. Expand
Creating a content delivery network for general science on the internet backbone using XCaches
TLDR
In this project XRootD caches are placed on the internet backbone to create a content delivery network to increases CPU efficiency while decreasing network bandwidth use. Expand
ServiceX A Distributed, Caching, Columnar Data Delivery Service
TLDR
ServiceX is an experiment-agnostic service to enable on-demand data delivery specifically tailored for nearly-interactive vectorized analysis, motivated by the data engineering challenges posed by HL-LHC data volumes and the increasing popularity of python and Spark-based analysis workflows. Expand
SkyhookDM: Data Processing in Ceph with Programmable Storage
TLDR
Computational storage approaches address the problem of both data reduction nearest the source as well as offloading some processing to the storage layer, a common principle in big data systems. Expand
bbockelm/xrdcl-authz-plugin. [online] GitHub
  • Available at: https://github.com/bbockelm/xrdcl-authz-plugin [Accessed
  • 2020
  • JHEP
  • 2019
...
1
2
3
4
5
...