Validating GAN-BioBERT: A Methodology for Assessing Reporting Trends in Clinical Trials

  title={Validating GAN-BioBERT: A Methodology for Assessing Reporting Trends in Clinical Trials},
  author={Joshua J. Myszewski and Emily Klossowski and Patrick Meyer and Kristin M Bevil and L Klesius and Kristopher M. Schroeder},
  journal={Frontiers in Digital Health},
Background The aim of this study was to validate a three-class sentiment classification model for clinical trial abstracts combining adversarial learning and the BioBERT language processing model as a tool to assess trends in biomedical literature in a clearly reproducible manner. We then assessed the model's performance for this application and compared it to previous models used for this task. Methods Using 108 expert-annotated clinical trial abstracts and 2,000 unlabeled abstracts this study… 
1 Citations

Figures and Tables from this paper

Evaluation of language analysis to summarize the literature: a comparison to traditional meta-analysis in primary hip and knee surgery

A moderately positive outlook supporting the utilization of regional anesthesia for hip and knee arthroplasty was found in the 115 articles that were included for analysis, which was well aligned with the conclusions reached by a previous meta-analysis of the same articles.



A visualization of evolving clinical sentiment using vector representations of clinical notes

This paper visualize the evolution of clinical language and sentiment with respect to several common population-level categories including: time in the hospital, age, mortality, gender and race and found intriguing differences in the sentiment of clinical notes over time, outcome, and demographic features.

The Role of Balanced Training and Testing Data Sets for Binary Classifiers in Bioinformatics

This work shows that using balanced training data results in the highest balanced accuracy (the average of True Positive Rate and True Negative Rate), Matthews correlation coefficient, and area under ROC curves, no matter what the proportions of the two phenotypes are in the testing data.

Citation Sentiment Analysis in Clinical Trial Papers

This study aims to determine the sentiment polarity that the citation context carries towards the cited paper, and indicates that it is feasible to use machine learning methods for citation sentiment analysis in biomedical publications.

Toward automatic evaluation of medical abstracts: The current value of sentiment analysis and machine learning for classification of the importance of PubMed abstracts of randomized trials for stroke.

  • I. FischerH. Steiger
  • Medicine
    Journal of stroke and cerebrovascular diseases : the official journal of National Stroke Association
  • 2020

BioBERT: a pre-trained biomedical language representation model for biomedical text mining

This article introduces BioBERT (Bidirectional Encoder Representations from Transformers for Biomedical Text Mining), which is a domain-specific language representation model pre-trained on large-scale biomedical corpora that largely outperforms BERT and previous state-of-the-art models in a variety of biomedical text mining tasks when pre- trained on biomedical Corpora.

Construct validity of six sentiment analysis methods in the text of encounter notes of patients with critical illness

While sentiment between all methods was positively correlated, their agreement was weak, and sentiment analysis holds promise for clinical applications, but will require a novel domain-specific method applicable to clinical text.

A survey on semi-supervised learning

This survey aims to provide researchers and practitioners new to the field as well as more advanced readers with a solid understanding of the main approaches and algorithms developed over the past two decades, with an emphasis on the most prominent and currently relevant work.

Does direction of results of abstracts submitted to scientific conferences on drug addiction predict full publication?

Clinical trials were the minority of abstracts presented at the CPDD; evidence of possible publication bias in the field of addiction is found, with negative or null results having half the likelihood of being published than positive ones.

Extracting the Population, Intervention, Comparison and Sentiment from Randomized Controlled Trials

An identification approach for the Population, the Intervention and the Comparison in Randomized Controlled Trials (RCTs) is proposed and the sentiment of RCTs is classified to determine whether an Intervention is more effective than its Comparison.