A Generic Data Harmonization Process for Cross-linked Research and Network Interaction. Construction and Application for the Lung Cancer Phenotype Database of the German Center for Lung Research.
@article{Firnkorn2015AGD,
title={A Generic Data Harmonization Process for Cross-linked Research and Network Interaction. Construction and Application for the Lung Cancer Phenotype Database of the German Center for Lung Research.},
author={Daniel Firnkorn and Matthias Ganzinger and Thomas R. Muley and M Thomas and Petra Knaup},
journal={Methods of information in medicine},
year={2015},
volume={54 5},
pages={
455-60
}
}OBJECTIVE
Joint data analysis is a key requirement in medical research networks. [] Key MethodMETHODS
We developed a spreadsheet-based solution as tool to support the harmonization process for lung cancer data and a data integration procedure based on Talend Open Studio.
RESULTS
The harmonization process consists of eight steps describing a systematic approach for defining and reviewing source data elements and standardizing common data elements. The steps for defining common data elements and harmonizing…
8 Citations
Integrating Heterogeneous Biomedical Data for Cancer Research: the CARPEM infrastructure
- Computer ScienceAppl. Clin. Inform.
- 2016
This article identifies a set of scientific and technical principles needed to build a translational research platform compatible with ethical requirements, data protection and data-integration problems, and describes the solution adopted by the CARPEM cancer research program.
A Semi-Automated Term Harmonization Pipeline Applied to Pulmonary Arterial Hypertension Clinical Trials.
- Computer ScienceMethods of information in medicine
- 2021
A semi-automated harmonization pipeline was developed and applied for use with domain-expert annotators to resolve ambiguous term mappings using exact and fuzzy matching and dramatically reduced the burden of manual annotation.
Information management for enabling systems medicine
- Computer Science, Medicine
- 2017
A three-layer information technology (IT) architecture for systems medicine and a cyclic data management approach including a knowledge base that is dynamically updated by extract, transform, and load (ETL) procedures is suggested.
A review of AI and Data Science support for cancer management
- MedicineArtif. Intell. Medicine
- 2021
A Review of AI and Data Science Support for Cancer Management
- Computer Science
- 2020
The main objective is to analyze the literature to identify open research challenges that a novel decision support system for cancer patients and clinicians will need to address, point to potential solutions, and provide a list of established best-practices to adopt.
A Review of Data Science Methods and Systems Used for Monitoring and Coaching Cancer Patients
- MedicinemedRxiv
- 2020
Development of modern decision support system for cancer needs to utilize best practices like the use of validated electronic questionnaires for quality of life assessment, adoption of appropriate information modeling standards supplemented by terminologies/ontologies, adherence to FAIR data principles, external validation, stratification of patients in subgroups for better predictive modeling, and adoption of formal behavior change theories.
Good Medicine and Good Healthcare Demand Good Information (Systems).
- MedicineMethods of information in medicine
- 2015
This issue of MIM deals with a comparison of benchmarking initiatives in German-speaking countries, use of communication standards in telemonitoring scenarios, the estimation of national cancer incidence rates and modifications of parametric tests.
References
SHOWING 1-10 OF 28 REFERENCES
Mapping clinical phenotype data elements to standardized metadata repositories and controlled terminologies: the eMERGE Network experience
- MedicineJ. Am. Medical Informatics Assoc.
- 2011
This study emphasizes the requirement for standardized representation of clinical research data using existing metadata and terminology resources and provides simple techniques and software for data element mapping using experiences from the eMERGE Network.
Architecture of the Open-source Clinical Research Chart from Informatics for Integrating Biology and the Bedside
- Computer ScienceAMIA
- 2007
The Informatics for Integrating Biology and the Bedside (i2b2) is a set of software modules called "cells" that have a common messaging protocol that allow them to interact using web services and XML messages, and this architecture is found to be of high value.
Foundations of a Metadata Repository for Databases of Registers and Trials
- Computer ScienceMIE
- 2009
The Telematikplattform für Medizinische Forschungsnetze, an umbrella organization for medical research in Germany, aims at supporting and improving this process with a metadata repository, covering the variables and value lists used in databases of registers and trials.
Characteristics Desired in Clinical Data Warehouse for Biomedical Research
- MedicineHealthcare informatics research
- 2014
A CDW for research should include an honest broker system and an Institutional Review Board approval interface to comply with governmental regulations and be a biomedical research platform for data repository use as well as data analysis.
Unlocking Data for Clinical Research - The German i2b2 Experience.
- Computer Science, MedicineApplied clinical informatics
- 2011
i2b2 is a viable platform for data query tasks in use cases typical for networked medical research in Germany and the integration of privacy enhancing tools facilitates the use of i2B2 within established data protection concepts.
Quality, quantity and harmony: the DataSHaPER approach to integrating data across bioclinical studies
- MedicineInternational journal of epidemiology
- 2010
The origins, purpose and scientific foundations of the DataSHaPER are described, and the two primary components, the ‘DataSchema’ and ‘Harmonization Platforms’, together support the preparation of effective data-collection protocols and provide a central reference to facilitate harmonization.
Development of a clinical data warehouse from an intensive care clinical information system
- MedicineComput. Methods Programs Biomed.
- 2012
Toward Semantic Interoperability of Electronic Health Records
- Computer ScienceIEEE Transactions on Information Technology in Biomedicine
- 2012
This paper presents a proposal that smoothes out structural differences between heterogeneous EHR representations, allowing proper alignment of information, and includes a canonical ontology whose EHR-related terms focus on semantic aspects.
Towards a comprehensive electronic patient record to support an innovative individual care concept for premature infants using the openEHR approach
- MedicineInt. J. Medical Informatics
- 2009
Data integration flows for business intelligence
- Computer ScienceEDBT '09
- 2009
The requirements for data integration flows in this next generation of operational BI system are described, the limitations of current technologies, the research challenges in meeting these requirements, and a framework for addressing these challenges are described.

