UMLS to DBPedia link discovery through circular resolution

  title={UMLS to DBPedia link discovery through circular resolution},
  author={John Cuzzola and Ebrahim Bagheri and Jelena Jovanovi{\'c}},
  journal={Journal of the American Medical Informatics Association},
Objective The goal of this work is to map Unified Medical Language System (UMLS) concepts to DBpedia resources using widely accepted ontology relations from the Simple Knowledge Organization System (skos:exactMatch, skos:closeMatch) and from the Resource Description Framework Schema (rdfs:seeAlso), as a result of which a complete mapping from UMLS (UMLS 2016AA) to DBpedia (DBpedia 2015-10) is made publicly available that includes 221 690 skos:exactMatch, 26 276 skos:closeMatch, and 6 784 322… 

Figures and Tables from this paper

Biomedical Interpretable Entity Representations

This paper creates a new entity type system and training set from a large corpus of biomedical texts by mapping entities to concepts in a medical ontology, and from this mapping they derive Biomedical Interpretable Entity Representations (BIERs), in which dimensions correspond to fine-grained entity types, and values are predicted probabilities that a given entity is of the corresponding type.

Access to care: analysis of the geographical distribution of healthcare using Linked Open Data

This work focuses on generating a comprehensive semantic dataset of medical facilities worldwide containing extensive information about such facilities’ geo-location, and evaluates each data source along various dimensions, such as completeness, correctness, and interlinking with other sources, all critical aspects of current knowledge representation technologies.

The Unified Medical Language System at 30 Years and How It Is Used and Published: Systematic Review and Content Analysis (Preprint)

  • X. Jing
  • Computer Science, Medicine
  • 2020
The results, although largely related to academia, demonstrate that UMLS achieves its intended uses successfully, in addition to achieving uses broadly beyond its original intentions.

A Document Ranking Approach Based on Weighted-Gene/Protein in Large Biomedical Documents Using MapReduce Framework

A novel MapReduce based natural language processing framework is designed and implemented on large biomedical databases using weighted gene or protein measures and document ranking score and Experimental results show that the proposed model has high contextual ranking accuracy, less search space and time consumption compared to the traditional biomedical document ranking models.



DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia

An overview of the DBpedia community project is given, including its architecture, technical implementation, maintenance, internationalisation, usage statistics and applications, including DBpedia one of the central interlinking hubs in the Linked Open Data (LOD) cloud.

Building Linked Open Data towards integration of biomedical scientific literature with DBpedia

A Linked Open Data set that links LFs to DBpedia titles by applying key collision methods to their literals, which are simple approximate string-matching methods, which help Allie users locate the correct LFs.

Silk - A Link Discovery Framework for the Web of Data

The Silk - Link Discovery Framework is presented, a tool for finding relationships between entities within different data sources and features a declarative language for specifying which types of RDF links should be discovered between data sources as well as which conditions entities must fulfill in order to be interlinked.

Is Wikipedia a Latent Gene Ontology?

  • N. DessìM. Atzori
  • Computer Science
    2017 IEEE 26th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE)
  • 2017
The paper demonstrates the effectiveness of Wikipedia in recognizing functional groups of genes, the quality and the wealth of its knowledge about genes as well the accuracy of TagMe.

Research and applications: A multi-part matching strategy for mapping LOINC with laboratory terminologies

The probability of term combinations proved to be a valuable strategy for increasing the quality of match results, providing recommendations for proposed LOINC conepts, and decreasing the run time for match processing.

RysannMD: A biomedical semantic annotator balancing speed and accuracy

An evaluation of medical knowledge contained in Wikipedia and its use in the LOINC database

This project focused on 1705 laboratory analytes (the first part in the LOINC laboratory name) and found that of the 1705 parts queried, 1314 matching articles were found in Wikipedia.

Research Paper: Consumer Health Concepts That Do Not Map to the UMLS: Where Do They Fit?

This study identifies a novel approach for identifying and characterize consumer health terms not found in the Unified Medical Language System (UMLS) Metathesaurus (2007 AB) and describes the procedure for creating new concepts in the process of building a consumer health vocabulary.

LIMES - A Time-Efficient Approach for Large-Scale Link Discovery on the Web of Data

This paper presents and evaluates LIMES, a novel time-efficient approach for link discovery in metric spaces that utilizes the mathematical characteristics of metric spaces during the mapping process to filter out a large number of those instance pairs that do not suffice the mapping conditions.

Combining Open-domain and Biomedical Knowledge for Topic Recognition in Consumer Health Questions

This paper proposes a topic recognition approach based on biomedical and open-domain knowledge bases that outperformed the results obtained by individual knowledge bases by up to 16.5% F1 and achieved state-of-the-art performance.