Document Retrieval for Biomedical Question Answering with Neural Sentence Matching

  title={Document Retrieval for Biomedical Question Answering with Neural Sentence Matching},
  author={Jiho Noh and Ramakanth Kavuluru},
  journal={2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)},
  • Jiho Noh, Ramakanth Kavuluru
  • Published 1 December 2018
  • Computer Science
  • 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)
Document retrieval (DR) forms an important component in end-to-end question-answering (QA) systems where particular answers are sought for well-formed questions. [] Key Method At the core of our approach is a question-answer sentence matching neural network that learns a measure of relevance of a sentence to an input question in the form of a matching score.

Figures and Tables from this paper

A Review on Medical Textual Question Answering Systems Based on Deep Learning Approaches

The medical textual question-answering systems based on deep learning approaches were reviewed, and recent architectures of MQA systems were thoroughly explored, and an in-depth analysis ofDeep learning approaches used in different MQ a system tasks was provided.

Using FHIR to Construct a Corpus of Clinical Questions Annotated with Logical Forms and Answers

This paper describes a novel technique for annotating logical forms and answers for clinical questions by utilizing Fast Healthcare Interoperability Resources (FHIR), and aims to automate this step using the normalized codes present in a FHIR resource.

Literature Retrieval for Precision Medicine with Neural Matching and Faceted Summarization

A document reranking approach that combines neural query-document matching and text summarization toward such retrieval scenarios and achieves state-of-the-art performance using NIST’s TREC-PM track datasets.



An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition

Overall, BioASQ helped obtain a unified view of how techniques from text classification, semantic indexing, document and passage retrieval, question answering, and text summarization can be combined to allow biomedical experts to obtain concise, user-understandable answers to questions reflecting their real information needs.

Reading Wikipedia to Answer Open-Domain Questions

This approach combines a search component based on bigram hashing and TF-IDF matching with a multi-layer recurrent neural network model trained to detect answers in Wikipedia paragraphs, indicating that both modules are highly competitive with respect to existing counterparts.

Neural Question Answering at BioASQ 5B

This paper focuses on factoid and list question answering, using an extractive QA model, that is, it restricts the system to output substrings of the provided text snippets, and uses FastQA, a state-of-the-art neural QA system.

A Multi-strategy Query Processing Approach for Biomedical Question Answering: USTB_PRIR at BioASQ 2017 Task 5B

This paper describes the participation of USTB PRIR team in the 2017 BioASQ 5B on question answering, including document retrieval, snippet retrieval and concept retrieval task. We introduce

Reasoning With Neural Tensor Networks for Knowledge Base Completion

An expressive neural tensor network suitable for reasoning over relationships between two entities given a subset of the knowledge base is introduced and performance can be improved when entities are represented as an average of their constituting word vectors.

Multihop Attention Networks for Question Answer Matching

This paper proposes Multihop Attention Networks (MAN) which use multiple vectors which focus on different parts of the question for its overall semantic representation and apply multiple steps of attention to learn representations for the candidate answers.

Tasks, topics and relevance judging for the TREC Genomics Track: five years of experience evaluating biomedical text information retrieval systems

With the help of a team of expert biologist judges, the TREC Genomics track has generated four large sets of “gold standard” test collections, comprised of over a hundred unique topics, two kinds of

Bidirectional Attention Flow for Machine Comprehension

The BIDAF network is introduced, a multi-stage hierarchical process that represents the context at different levels of granularity and uses bi-directional attention flow mechanism to obtain a query-aware context representation without early summarization.

State-of-the-art in biomedical literature retrieval for clinical cases: a survey of the TREC 2014 CDS track

An overview of the task, a survey of the information retrieval methods employed by the participants, an analysis of the results, and a discussion on the future directions for this challenging yet important task are provided.

Distributed Representations of Sentences and Documents

Paragraph Vector is an unsupervised algorithm that learns fixed-length feature representations from variable-length pieces of texts, such as sentences, paragraphs, and documents, and its construction gives the algorithm the potential to overcome the weaknesses of bag-of-words models.