A Fast Deep Learning Model for Textual Relevance in Biomedical Information Retrieval

  title={A Fast Deep Learning Model for Textual Relevance in Biomedical Information Retrieval},
  author={Sunil Mohan and Nicolas Fiorini and Sun Kim and Zhiyong Lu},
  journal={Proceedings of the 2018 World Wide Web Conference},
Publications in the life sciences are characterized by a large technical vocabulary, with many lexical and semantic variations for expressing the same concept. Towards addressing the problem of relevance in biomedical literature search, we introduce a deep learning model for the relevance of a document's text to a keyword style query. Limited by a relatively small amount of training data, the model uses pre-trained word embeddings. With these, the model first computes a variable-length Delta… 

Figures and Tables from this paper

Developing a More Accurate Biomedical Literature Retrieval Method using Deep Learning and Citations in PubMed Central Full-text Articles

A deep learning-based method for sentence queries, called DeepSenSe, is developed using citation data available in full-text articles obtained from PubMed Central (PMC), making it possible to train quality predictive models using modern deep learning techniques.

GRAPHENE: A Precise Biomedical Literature Retrieval Engine with Graph Augmented Deep Learning and External Knowledge Empowerment

GRAPHENE is a deep learning based framework for precise BLR that is connected to a BiLSTM so both local and global topics can be explored and results on applying the system to TREC Precision Medicine track data are provided to demonstrate its effectiveness.

Building deep learning models for evidence classification from the open access biomedical literature

This work developed document classification (triage) methods for molecular interaction papers by using deep learning mechanisms of attention to aggregate classification-based decisions over selected paragraphs in the document using a combined convolutional neural network, bi-directional long short-term memory architecture augmented by attention to produce a single decision for triage.

Interactive Attention Networks for Semantic Text Matching

A novel deep neural semantic text matching model that adopts an interactive attention network to achieve information exchange between the source text and the target text, and dynamically explores the matching matrix and learns new representations of source and target texts.

Querying Across Genres for Medical Claims in News

The results show that cross-genre medical IR is a viable task, but incorporating domain-specific knowledge is crucial and the classical IR approach using BM25 with more recent transformer-based models is compared.

Effective Natural Language Processing and Interpretable Machine Learning for Structuring CT Liver-Tumor Reports

A two-stage report structuring method by integrating effective Natural Language Processing (NLP) and interpretable machine learning is proposed and the results show that, the proposed method is more promising than the fashion neural networks such as Bert in terms of effectiveness and efficiency.

Personalized neural language models for real-world query auto completion

This work improves previous methods based on neural language modeling, with the goal of building an end-to-end QAC system, and focuses on using real-world data by integrating user information for personalized suggestions when possible, making a step towards neural query auto-completion in production search engines.

DeepVar: An End-to-End Deep Learning Approach for Genomic Variant Recognition in Biomedical Literature

This work presents the first successful end-to-end deep learning approach to bridge the gap between generic NER algorithms and low-resource applications through genomic variants recognition, and can result in promising performance without any hand-crafted features or post-processing rules.



Deep Learning for Biomedical Information Retrieval: Learning Textual Relevance from Click Logs

A Deep Learning approach to modeling the relevance of a document’s text to a query, applied to biomedical literature, which produces a more robust predictor than computing similarities between semantic vector representations of the query and document, and also results in significant improvements over traditional IR text factors.

A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval

A new latent semantic model that incorporates a convolutional-pooling structure over word sequences to learn low-dimensional, semantic vector representations for search queries and Web documents is proposed.

Learning deep structured semantic models for web search using clickthrough data

A series of new latent semantic models with a deep structure that project queries and documents into a common low-dimensional space where the relevance of a document given a query is readily computed as the distance between them are developed.

Learning to Rank Short Text Pairs with Convolutional Deep Neural Networks

This paper presents a convolutional neural network architecture for reranking pairs of short texts, where the optimal representation of text pairs and a similarity function to relate them in a supervised way from the available training data are learned.

Modeling Interestingness with Deep Neural Networks

The results on large-scale, real-world datasets show that the semantics of documents are important for modeling interestingness and that the DSSM leads to significant quality improvement on both tasks, outperforming not only the classic document models that do not use semantics but also state-of-the-art topic models.

Neural Models for Information Retrieval

This tutorial introduces basic concepts and intuitions behind neural IR models, and places them in the context of traditional retrieval models, by introducing fundamental concepts of IR and different neural and non-neural approaches to learning vector representations of text.

Improving Document Ranking with Dual Word Embeddings

This paper investigates the popular neural word embedding method Word2vec as a source of evidence in document ranking and proposes the proposed Dual Embedding Space Model (DESM), which provides evidence that a document is about a query term.

A Deep Relevance Matching Model for Ad-hoc Retrieval

A novel deep relevance matching model (DRMM) for ad-hoc retrieval that employs a joint deep architecture at the query term level for relevance matching and can significantly outperform some well-known retrieval models as well as state-of-the-art deep matching models.

A Deep Architecture for Matching Short Texts

This paper proposes a new deep architecture to more effectively model the complicated matching relations between two objects from heterogeneous domains and applies this model to matching tasks in natural language, e.g., finding sensible responses for a tweet, or relevant answers to a given question.

Learning to Match using Local and Distributed Representations of Text for Web Search

This work proposes a novel document ranking model composed of two separate deep neural networks, one that matches the query and the document using a local representation, and another that Matching with distributed representations complements matching with traditional local representations.