Combining HCI, Natural Language Processing, and Knowledge Discovery - Potential of IBM Content Analytics as an Assistive Technology in the Biomedical Field

  title={Combining HCI, Natural Language Processing, and Knowledge Discovery - Potential of IBM Content Analytics as an Assistive Technology in the Biomedical Field},
  author={Andreas Holzinger and Christof Stocker and Bernhard Ofner and Gottfried Prohaska and Alberto Brabenetz and Rainer Hofmann-Wellenhof},
Medical professionals are confronted with a flood of big data most of it containing unstructured information. Such unstructured information is the subset of information, where the information itself describes parts of what constitutes as significant within it, or in other words - structure and information are not completely separable. The best example for such unstructured information is text. For many years, text mining has been an essential area of medical informatics. Although text can… 

Biomedical Text Mining: State-of-the-Art, Open Problems and Future Challenges

This paper provides a short, concise overview of some selected text mining methods, focusing on statistical methods, i.e. Latent Semantic Analysis, Probabilistic Latent seminar analysis, Hierarchical Latent Dirichlet Allocation, Principal Component Analysis, and Support Vector Machines, along with some examples from the biomedical domain.

Trends in Interactive Knowledge Discovery for Personalized Medicine: Cognitive Science meets Machine Learning

This paper provides a rough overview of the HCI-KDD approach and focuses on three future trends: graph-based mining, topological data mining and entropy-based data mining.

Special Issue on the Curative Power of Medical Data

This Special Issue contains four surveys, which include a wide range of topics, from the analysis of biomedical articles writing style, to automatically generating tests from medical references, constructing a Gold standard biomedical corpus or the visualization of biomedical data.

NLP-based platform as a service: a brief review

This paper presents a survey of NLP in cloud computing with a key focus on the comparison of cloud-based NLP services, challenges of N LP and big data while emphasizing the necessity of viable cloud- based N LP services.

Enhancing Patient Safety through Human-Computer Information Retrieval on the Example of German-Speaking Surgical Reports

The thought process of a domain expert is investigated in order to derive his information of interest and a facet-based way to navigate this kind of information in the form of extracted phrases is described.

Visual Data Mining: Effective Exploration of the Biological Universe

The experience with biology researchers has required us to address their needs and requirements in the design and development of a scalable and interactive network visualization and analysis platform, NAViGaTOR, now in its third major release.

NLPReViz: an interactive tool for natural language processing on clinical text

A prototype tool for interactive review and revision of natural language processing models of binary concepts extracted from clinical notes, which evaluated in a user study involving 9 physicians, who used the tool to build and revise models for 2 colonoscopy quality variables.

Lecture 5 Semi-structured, Weakly Structured, and Unstructured Data

At the end of this fourth lecture, you: would have acquired background knowledge on some issues in standardization and structurization of data; would have a general understanding of

The Curative Power of Medical Data

Linking the contents of these documents to each other, as well as to specialized ontologies, could enable access to and discovery of structured clinical information and foster a major leap in natural language processing and health research.

A hybrid solution for extracting structured medical information from unstructured data in medical records via a double-reading/entry system

Using the DRESS, the feasibility of extracting clinical data from unstructured MRs to create semi-structured and patient-centered electronic health record database is demonstrated and this hybrid system represents probably a workable solution to solve the big medical data challenge.



Semantic Information in Medical Information Systems: Utilization of Text Mining Techniques to Analyze Medical Diagnoses

This paper proposes a calculation of significant co-occurrences of diseases and defined regions of the human body, in order to identify possible risks for health and designs and develops an application for analyzing expert comments on magnetic resonance images (MRI) diagnoses by applying a text mining method.

Text analytics for life science using the Unstructured Information Management Architecture

The value of text analysis in biomedical research, the development of the BioTeKS system, and applications which demonstrate its functions are described.

Quality-Based Knowledge Discovery from Medical Text on the Web

A selection of quality-oriented Web-based tools for analyzing biomedical literature are presented, and specifically discuss PolySearch, FACTA and Kleio and Pointwise Mutual Information (PMI), which is a measure to discover the strength of a relationship.

Text analysis and knowledge mining system

By applying the prototype system named TAKMI (Text Analysis and Knowledge Mining) to textual databases in PC help centers, the system can automatically detect product failures; determine issues that have led to rapid increases in the number of calls and their underlying reasons; and analyze help center productivity and changes in customers' behavior involving a particular product, without reading any of the text.

Disease-Disease Relationships for Rheumatic Diseases: Web-Based Biomedical Textmining an Knowledge Discovery to Assist Medical Decision Making

A web based text-mining tool is used to find disease names and their co-occurrence frequencies in MEDLINE articles for each disease and an evaluation on knowledge discovery of disease-disease relationships for rheumatic diseases is presented.

A Comparison of Different Retrieval Strategies Working on Medical Free Texts

This paper describes the evaluation of four different types of information retrieval strategies: keyword search, search performed by a medical domain expert, a semantic based information retrieval tool, and a purely statistical information retrieval method.

Naming notes: transitions from free text to structured entry.

A framework is set forth that connects over-arching questions concerning medical informatics systems development with the practical, cultural and conceptual issues involved in transitions from handwritten and other free text documentation to structured entry of medical records to build patient profiles.

Quality Issues in the Management of Web Information

The volume constitutes a compendium of both heterogeneous approaches and sample applications focusing specific aspects of the quality assessment for Web-based information for researchers, PhD students and practitioners carrying out their research activity in the field of Web information retrieval and filtering, Web information mining, information quality representation and management.

Automated extraction of ejection fraction for quality measurement using regular expressions in Unstructured Information Management Architecture (UIMA) for heart failure

An automated information extraction system can be used to accurately extract EF for quality measurement and an EF value of <40% can be accurately identified in VA echocardiogram reports.

Power of expression in the electronic patient record: structured data or narrative text?