The Materials Data Facility: Data Services to Advance Materials Science Research

  title={The Materials Data Facility: Data Services to Advance Materials Science Research},
  author={Benjamin J. Blaiszik and Kyle Chard and Jim Pruyne and Rachana Ananthakrishnan and Steven Tuecke and I. Foster},
With increasingly strict data management requirements from funding agencies and institutions, expanding focus on the challenges of research replicability, and growing data sizes and heterogeneity, new data needs are emerging in the materials community. The materials data facility (MDF) operates two cloud-hosted services, data publication and data discovery, with features to promote open data sharing, self-service data publication and curation, and encourage data reuse, layered with powerful… 

An infrastructure with user-centered presentation data model for integrated management of materials data and services

This paper introduces an emerging architecture the Materials Genome Engineering Databases (MGED), which provides cloud-hosted services with features to simplify the process of collecting datasets from diverse data providers, unify data representation forms with user-centered presentation data model, and accelerate data discovery with advanced search capabilities.

Tracking materials science data lineage to manage millions of materials experiments and analyses

The Materials Experiment and Analysis Database (MEAD) is a database that contains raw data and metadata from millions of materials synthesis and characterization experiments, as well as the analysis and distillation of that data into property and performance metrics via software in an accompanying open source repository.

Serverless Workflows for Indexing Large Scientific Data

Xtract is described, a service capable of processing vast collections of scientific files and automatically extracting metadata from diverse file types and it is demonstrated that it can derive metadata from a 7 TB scientific data repository.

MatD^3^: A Database and Online Presentation Package for Research Data Supporting Materials Discovery, Design, and Dissemination

MatD3 is an open-source, dedicated database and web application framework designed to store, curate and disseminate experimental and theoretical materials data generated by individual research groups or research consortia.

Chapter 9 Materials Data Infrastructure and Materials Informatics

The materials science and engineering (MS&E) community identified its collective need for data infrastructure as early as the 1980s [1]. While interest around this topic has grown markedly in recent

NexusLIMS: A Laboratory Information Management System for Shared-Use Electron Microscopy Facilities

The NexusLIMS suite of tools requires minimal input and adjustments to user behavior, instead relying on existing organizational procedures and the collection of information from a multitude of sources to construct a complete picture and record of a research experiment.

Globus: Recent Enhancements and Future Plans

A powerful new authentication and authorization platform service, Globus Auth, addresses identity, credential, and delegation management needs encountered in research environments, and new REST APIs allow external and third-party services to leverage Globus data management, authentication, and authorization capabilities as a platform.

Evolution of a Materials Data Infrastructure

It is learned that the MDI is essential to eliminating the seams between experiment and computation by providing a means for them to connect effortlessly, and is becoming an enabler, allowing materials engineering to tie into a much broader model-based engineering enterprise for product design.

Materials graph ontology




Materials Data Science: Current Status and Future Outlook

The concept of process-structure-property (PSP) linkages is introduced and illustrated how the determination of PSPs is one of the main objectives of materials data science.

Commentary: The Materials Project: A materials genome approach to accelerating materials innovation

Accelerating the discovery of advanced materials is essential for human welfare and sustainable, clean energy. In this paper, we introduce the Materials Project (, a core

Efficient and Secure Transfer, Synchronization, and Sharing of Big Data

The authors describe the approaches taken by Globus to create standard data interfaces and common security models for performing these actions on large quantities of data, allowing users to access different types of cloud storage with the same ease with which they access local storage.

Strategy for Extensible, Evolving Terminology for the Materials Genome Initiative Efforts

A proposed rules-based approach is presented with initial examples from a growing corpus of materials terms in the NIST Materials Data Repository to establish a common, consistent, and evolving set of rules for creating or extending terminology as needed to describe materials data.

Digital Object Identifier (DOI) System

The Digital Object Identifier (DOI®) System is a managed system for persistent identification of content on digital networks. It can be used to identify physical, digital, or abstract entities. The

Materials Genome Initiative for Global Competitiveness

This Cabinet-level Council is the principal means within the executive branch to coordinate science and technology policy across the diverse entities that make up the federal research and development

Materials Design and Discovery with High-Throughput Density Functional Theory: The Open Quantum Materials Database (OQMD)

High-throughput density functional theory (HT DFT) is fast becoming a powerful tool for accelerating materials design and discovery by the amassing tens and even hundreds of thousands of DFT

Materials Genome Initiative

In June 2011, the White House set out an audacious goal for materials science: Cut in half the time it takes to get a newly designed material from the lab to the marketplace. Even the program’s