Dr.Aid: Supporting Data-governance Rule Compliance for Decentralized Collaboration in an Automated Way

Collaboration across institutional boundaries is widespread and increasing today. It depends on federations sharing data that often have governance rules or external regulations restricting their use. However, the handling of data governance rules (aka. data-use policies) remains manual, time-consuming and error-prone, limiting the rate at which collaborations can form and respond to challenges and opportunities, inhibiting citizen science and reducing data providers' trust in compliance. Using… 


Towards a Computer-Interpretable Actionable Formal Model to Encode Data Governance Rules
  • Rui Zhao, M. Atkinson
  • Computer Science
    2019 15th International Conference on eScience (eScience)
  • 2019
It is argued that intelligent systems can be used to improve the situation, by recording provenance records during processing, encoding the rules and performing reasoning, as the first step towards helping data providers and data users sustain productive relationships.
Comprehensible Control for Researchers and Developers Facing Data Challenges
An architecture for establishing and sustaining the necessary optimised mappings and early evaluations of its feasibility with two application communities are reported on.
The FAIR Guiding Principles for scientific data management and stewardship
This Comment is the first formal publication of the FAIR Principles, and includes the rationale behind them, and some exemplar implementations in the community.
An Automated Negotiation Agent for Permission Management
This work introduces a novel agent-based approach to negotiate the permission to exchange private data between users and services and finds that the agent is able to effectively capture the preferences and negotiate on the user's behalf but, surprisingly, does not reduce user engagement with the system.
Abstract, link, publish, exploit: An end to end framework for workflow sharing
How do Data Science Workers Collaborate? Roles, Workflows, and Tools
It is found that data science teams are extremely collaborative and work with a variety of stakeholders and tools during the six common steps of a data science workflow (e.g., clean data and train model).
The DAta Protection REgulation COmpliance Model
A model of the GDPR is described that allows for a semiautomatic processing of legal text and the leveraging of state-of-the-art legal informatics approaches, which are useful for legal reasoning, software design, information retrieval, or compliance checking.
Technical and Policy Approaches to Balancing Patient Privacy and Data Sharing in Clinical and Translational Research
This work recounts a recent privacy-related concern associated with the publication of aggregate statistics from pooled genome-wide association studies that have had a significant impact on the data sharing policies of National Institutes of Health-sponsored databanks.
The MIMIC Code Repository: enabling reproducibility in critical care research
The Medical Information Mart for Intensive Care (MIMIC) Code Repository is outlined, a centralized code base for generating reproducible studies on an openly available critical care dataset that enables end-to-end reproducible analysis of electronic health records.
Thoth: Comprehensive Policy Compliance in Data Retrieval Systems
Thoth provides an efficient, kernel-level compliance layer for data use policies that tracks the flow of data through the system, and enforces policy regardless of bugs, misconfigurations, compromises in application code, or actions by unprivileged operators.