Rediscovering Don Swanson:The Past, Present and Future of Literature-based Discovery

  title={Rediscovering Don Swanson:The Past, Present and Future of Literature-based Discovery},
  author={Neil R. Smalheiser},
  journal={Journal of Data and Information Science},
  pages={43 - 64}
  • N. Smalheiser
  • Published 2017
  • Computer Science
  • Journal of Data and Information Science
Abstract Purpose The late Don R. Swanson was well appreciated during his lifetime as Dean of the Graduate Library School at University of Chicago, as winner of the American Society for Information Science Award of Merit for 2000, and as author of many seminal articles. In this informal essay, I will give my personal perspective on Don’s contributions to science, and outline some current and future directions in literature-based discovery that are rooted in concepts that he developed. Design… 

Figures from this paper

Remembering Don Swanson: Link to Bibliometric-enhanced Information Retrieval

Don R. Swanson suggested that even in the 1950's it was clear that conceptual problems of greater subtlety were involved in Information retrieval, but there has been little interest in these problems in spite of the rapid development and growth of online services since that time.

A Systematic Review on Literature-based Discovery

This systematic review provides an in-depth analysis of the computational techniques used in the LBD process using a novel, up-to-date, and detailed classification and discusses the prevailing research deficiencies in the discipline by highlighting the challenges and opportunities of future LBD research.

Scientometric analysis and knowledge mapping of literature-based discovery (1986–2020)

This paper presents the first inclusive scientometric overview of LBD research, utilizing a comprehensive scientometric approach incorporating CiteSpace to systematically analyze the literature on LBD from the last four decades.

Literature-Based Discovery beyond the ABC paradigm: a contrastive approach

A novel contrastive approach which leverages differences between “views” of the biomedical literature in order to target relations between concepts of interest and demonstrates the relevance of this approach through quantitative evaluation on the 8 target discoveries.

Validation and Topic-driven Ranking for Biomedical Hypothesis Generation Systems

A novel method to validate HG systems is presented, which both scales to large validation sets and does not require expert input, and a number of new metrics are introduced to automatically identify plausible generated hypotheses.

A systematic review on literature-based discovery workflow

This systematic review provides a comprehensive overview of the LBD workflow by answering nine research questions related to the major components of theLBD workflow (i.e., input, process, output, and evaluation).

Automatically explaining literature based discoveries

To identify candidate LBD related concepts and to determine if the natures of the relationship may be automatically explained using supervised machine learning classification, a training set of data is identified that includes linking B terms and data representations suitable for applying classification techniques to explain the relationships are explored.

Accelerating innovations through literature-based discovery: Qualitative analysis of an early usability evaluation

The study provides initial qualitative insights into four key usability issues faced by users in performing LBD-style hypothesis generation activities and suggests that addressing these challenges holds the key to increasing LBD systems future adoptions by researchers and scientists in practical settings.



Conceptual biology, hypothesis discovery, and text mining: Swanson's legacy

Innovative biomedical librarians and information specialists who want to expand their roles as expert searchers need to know about profound changes in biology and parallel trends in text mining. In

Emerging approaches in literature-based discovery: techniques and performance review

This paper reviews the progress of literature-based discovery research, focusing on understanding their technical features and evaluating their performance, and observes the births of new trends and unprecedented paradigm shifts among the recently emerging literature- based discovery approach.

Literature-based discovery: Beyond the ABCs

Perhaps the most urgent need is to develop a series of objective literature-based interestingness measures, which can customize the output of LBD systems for different types of scientific investigations.

A new evaluation methodology for literature-based discovery systems

Enriching plausible new hypothesis generation in PubMed

A method is proposed that expands and elaborates the existing hypothesis and provides plausible candidates of the concretized hypothesis, which are based on extracted heterogeneous entities and detailed relation information, along with a reliable ranking criterion.

Knowledge discovery by automated identification and ranking of implicit relationships

A computational method to identify relationships within scientific reports, such that large sets of relationships between unrelated items could be sought out and statistically ranked for their potential relevance as a set, demonstrates the method is well suited to discover novel relationships based upon existing shared relationships.

A quantitative model for linking two disparate sets of articles in MEDLINE

The model greatly simplifies and streamlines the process of carrying out a two- node search, and may be applicable to a number of other literature-based discovery applications, including the so-called one-node search and related gene-centric strategies that incorporate implicit links to predict how genes may be related to each other and to human diseases.

Biomedical Text Mining for Research Rigor and Integrity: Tasks, Challenges, Directions

Four key areas in which text mining techniques can make a significant contribution are identified: plagiarism/fraud detection, ensuring adherence to reporting guidelines, managing information overload, and accurate citation/enhanced bibliometrics.

Intervening in the Life Cycles of Scientific Knowledge Patrick Wilson, The Value of Currency

The dominant information problems of the future inevitably will derive from the fragmentation of knowledge, a problem shift that may lead to new discovery-oreinted views of information searching and use.