A Generic Data Harmonization Process for Cross-linked Research and Network Interaction. Construction and Application for the Lung Cancer Phenotype Database of the German Center for Lung Research.

@article{Firnkorn2015AGD,
  title={A Generic Data Harmonization Process for Cross-linked Research and Network Interaction. Construction and Application for the Lung Cancer Phenotype Database of the German Center for Lung Research.},
  author={Daniel Firnkorn and Matthias Ganzinger and Thomas R. Muley and M Thomas and Petra Knaup},
  journal={Methods of information in medicine},
  year={2015},
  volume={54 5},
  pages={
          455-60
        }
}
OBJECTIVE Joint data analysis is a key requirement in medical research networks. [] Key MethodMETHODS We developed a spreadsheet-based solution as tool to support the harmonization process for lung cancer data and a data integration procedure based on Talend Open Studio. RESULTS The harmonization process consists of eight steps describing a systematic approach for defining and reviewing source data elements and standardizing common data elements. The steps for defining common data elements and harmonizing…
Integrating Heterogeneous Biomedical Data for Cancer Research: the CARPEM infrastructure
TLDR
This article identifies a set of scientific and technical principles needed to build a translational research platform compatible with ethical requirements, data protection and data-integration problems, and describes the solution adopted by the CARPEM cancer research program.
A Semi-Automated Term Harmonization Pipeline Applied to Pulmonary Arterial Hypertension Clinical Trials.
TLDR
A semi-automated harmonization pipeline was developed and applied for use with domain-expert annotators to resolve ambiguous term mappings using exact and fuzzy matching and dramatically reduced the burden of manual annotation.
Information management for enabling systems medicine
TLDR
A three-layer information technology (IT) architecture for systems medicine and a cyclic data management approach including a knowledge base that is dynamically updated by extract, transform, and load (ETL) procedures is suggested.
A Review of AI and Data Science Support for Cancer Management
TLDR
The main objective is to analyze the literature to identify open research challenges that a novel decision support system for cancer patients and clinicians will need to address, point to potential solutions, and provide a list of established best-practices to adopt.
A Review of Data Science Methods and Systems Used for Monitoring and Coaching Cancer Patients
TLDR
Development of modern decision support system for cancer needs to utilize best practices like the use of validated electronic questionnaires for quality of life assessment, adoption of appropriate information modeling standards supplemented by terminologies/ontologies, adherence to FAIR data principles, external validation, stratification of patients in subgroups for better predictive modeling, and adoption of formal behavior change theories.
Good Medicine and Good Healthcare Demand Good Information (Systems).
TLDR
This issue of MIM deals with a comparison of benchmarking initiatives in German-speaking countries, use of communication standards in telemonitoring scenarios, the estimation of national cancer incidence rates and modifications of parametric tests.

References

SHOWING 1-10 OF 28 REFERENCES
Mapping clinical phenotype data elements to standardized metadata repositories and controlled terminologies: the eMERGE Network experience
TLDR
This study emphasizes the requirement for standardized representation of clinical research data using existing metadata and terminology resources and provides simple techniques and software for data element mapping using experiences from the eMERGE Network.
Architecture of the Open-source Clinical Research Chart from Informatics for Integrating Biology and the Bedside
TLDR
The Informatics for Integrating Biology and the Bedside (i2b2) is a set of software modules called "cells" that have a common messaging protocol that allow them to interact using web services and XML messages, and this architecture is found to be of high value.
Foundations of a Metadata Repository for Databases of Registers and Trials
TLDR
The Telematikplattform für Medizinische Forschungsnetze, an umbrella organization for medical research in Germany, aims at supporting and improving this process with a metadata repository, covering the variables and value lists used in databases of registers and trials.
Characteristics Desired in Clinical Data Warehouse for Biomedical Research
TLDR
A CDW for research should include an honest broker system and an Institutional Review Board approval interface to comply with governmental regulations and be a biomedical research platform for data repository use as well as data analysis.
Unlocking Data for Clinical Research - The German i2b2 Experience.
TLDR
i2b2 is a viable platform for data query tasks in use cases typical for networked medical research in Germany and the integration of privacy enhancing tools facilitates the use of i2B2 within established data protection concepts.
Quality, quantity and harmony: the DataSHaPER approach to integrating data across bioclinical studies
TLDR
The origins, purpose and scientific foundations of the DataSHaPER are described, and the two primary components, the ‘DataSchema’ and ‘Harmonization Platforms’, together support the preparation of effective data-collection protocols and provide a central reference to facilitate harmonization.
Toward Semantic Interoperability of Electronic Health Records
TLDR
This paper presents a proposal that smoothes out structural differences between heterogeneous EHR representations, allowing proper alignment of information, and includes a canonical ontology whose EHR-related terms focus on semantic aspects.
Data integration flows for business intelligence
TLDR
The requirements for data integration flows in this next generation of operational BI system are described, the limitations of current technologies, the research challenges in meeting these requirements, and a framework for addressing these challenges are described.
...
1
2
3
...