• Corpus ID: 119471455

HEP Software Foundation Community White Paper Working Group - Data Analysis and Interpretation

  title={HEP Software Foundation Community White Paper Working Group - Data Analysis and Interpretation},
  author={Lothar A.T. Bauerdick and Riccardo Maria Bianchi and Brian Paul Bockelman and Nuno Castro and Kyle Cranmer and Peter Elmer and Robert W. Gardner and Maria Girone and Oliver Gutsche and Benedikt Hegner and J. M. Hern'andez and Bo Jayatilaka and David Lange and Mark S. Neubauer and Daniel S. Katz and Lukasz Kreczko and James Letts and Shawn McKee and Christoph Paus and Kevin Pedro and James Pivarski and Martin Ritter and Eduardo Rodrigues and T. Sakuma and Elizabeth Sexton-Kennedy and Michael D. Sokoloff and Carl Vuosalo and Frank Wurthwein and Gordon T. Watts},
At the heart of experimental high energy physics (HEP) is the development of facilities and instrumentation that provide sensitivity to new phenomena. Our understanding of nature at its most fundamental level is advanced through the analysis and interpretation of data from sophisticated detectors in HEP experiments. The goal of data analysis systems is to realize the maximum possible scientific potential of the data within the constraints of computing and human resources in the least time. To… 

Figures from this paper

Developing a Declarative Analysis Language: LINQToROOT

  • G. Watts
  • Computer Science
    EPJ Web of Conferences
  • 2019
Recent work on the LINQToROOT project, based on the Language Integrated Query system built into the cross-platform C# language, has had two goals: improving analysis efficiency and better understanding the requirements of a declarative analysis language.

Dream Machines

  • C. Quigg
  • Physics
    Reviews of Accelerator Science and Technology
  • 2019
Particle accelerators and their detectors are the world’s most powerful microscopes. They enable us to inspect the constituents of matter at attometer scales, study matter under unusual conditions,


Background: KwaZulu Natal has been a battlefield of political violence over the past few years in a democratic South Africa where many other provinces were no longer resolving their conflict through



Status Report of the DPHEP Study Group: Towards a Global Effort for Sustainable Data Preservation in High Energy Physics

An analysis of the research case for data preservation and a detailed description of the various projects at experiment, laboratory and international levels are provided and a concrete proposal for an international organisation in charge of the data management and policies in high-energy physics is provided.

A Roadmap for HEP Software and Computing R&D for the 2020s

This white paper describes the R&D activities required to prepare for this software upgrade of the HL-LHC.

High-Luminosity Large Hadron Collider (HL-LHC) : Preliminary Design Report

The present document describes the technologies and components that will be used to realise the High Luminosity LHC and is intended to serve as the basis for the detailed engineering design of HL-LHC.

RECAST — extending the impact of existing analyses

Searches for new physics by experimental collaborations represent a significant investment in time and resources. Often these searches are sensitive to a broader class of models than they were

Data Structures for Statistical Computing in Python

P pandas is a new library which aims to facilitate working with data sets common to finance, statistics, and other related fields and to provide a set of fundamental building blocks for implementing statistical models.

Blaze: Building A Foundation for Array-Oriented Computing in Python

We present the motivation and architecture of Blaze, a library for cross-backend data-oriented computation. Blaze provides a standard interface to connect users familiar with NumPy and Pandas to

ROOT: an object-oriented data analysis framework

The ROOT system in an Object Oriented framework for large scale data analysis contains an efficient hierarchical 00 database, a C ++ interpreter, and advanced statistical analysis (multi-dimensional histogramming, fitting, minimization, cluster finding algorithms) and visualization tools.

TensorFlow: A system for large-scale machine learning

The TensorFlow dataflow model is described and the compelling performance that Tensor Flow achieves for several real-world applications is demonstrated.

Introducing Parsl: A Python Parallel Scripting Library

Parl (Parallel Scripting Library), a Python library for programming and executing data-oriented workflows in parallel, addresses problems of complex orchestration and management of applications and data as well as customiza-tion for specific execution environments.

Thrill: High-performance algorithmic distributed batch data processing with C++

The design and a first performance evaluation of Thrill are presented — a prototype of a general purpose big data processing framework with a convenient data-flow style programming interface based on C++ which enables performance advantages due to direct native code compilation, a more cache-friendly memory layout, and explicit memory management.