Semedico: A Comprehensive Semantic Search Engine for the Life Sciences

  title={Semedico: A Comprehensive Semantic Search Engine for the Life Sciences},
  author={Erik Faessler and Udo Hahn},
SEMEDICO is a semantic search engine designed to support literature search in the life sciences by integrating the semantics of the domain at all stages of the search process—from query formulation via query processing up to the presentation of results. SEMEDICO excels with an ad-hoc search approach which directly reflects relevance in terms of information density of entities and relations among them (events) and, a truly unique feature, ranks interaction events by certainty information… 

Figures from this paper

RusNLP: Semantic Search Engine for Russian NLP Conference Papers

RusNLP, a web service implementing semantic search engine and recommendation system over proceedings of three major Russian NLP conferences (Dialogue, AIST and AINL) and contains about 400 academic papers in English.

Honey Bee Versus Apis Mellifera: A Semantic Search for Biological Data

A semantic search for biological data based on metadata files is introduced and it is demonstrated how ontological concepts are integrated into the search and how it improves the search result.

Biomedical knowledge base construction from text and its applications in knowledge-based systems

A largely automated and scalable pattern-based knowledge extraction method covering a spectrum of different text genres and distilling a wide variety of facts from different biomedical areas is devised, and the fact-pattern duality paradigm of previous methods is generalized.

Annotation Data Management with JeDIS

The Jena Document Information System (JeDIS) is introduced and the focus lies on its capability to partition annotation graphs into modules, which allow easy manipulation of their annotations and the creation of alternative annotations of individual documents.

A Data-driven Approach for Core Biodiversity Ontology Development

This paper develops a semi-automatic data-driven approach that uses clear links between domain experts and knowledge engineers and uses the fusion/merge strategy by reusing existing ontologies and is guided by data from several data resources in the biodiversity domain.

Towards Implementation of an Information Dissemination Tool for Health Publications: Case of a Developing Country

A web based, low-cost and user-friendly health information dissemination tool based on machine learning algorithms that analyses full-text publications sequentially and cluster related documents for ease of access is proposed.

BiodivOnto: Towards a Core Ontology for Biodiversity

The design of a core ontology for biodiversity is presented aiming to establish a link between the foundational and domain-specific ontologies and it is guided by data from several resources in the biodiversity domain.

Dataset search in biodiversity research: Do metadata in data repositories reflect scholarly information needs?

It is shown that existing metadata currently poorly reflect information needs and therefore are the biggest obstacle in retrieving relevant data in biodiversity research, a field that produces a large amount of heterogeneous data.

A Superpower Tree Mapping and Tracing (STMT) Algorithm for OCA Sub-types Classification and Associated Risk Factors Identification

It is revealed from the results of the classification algorithm that the proposed method can be used as a supplementary tool for the experts to diagnose the subtypes of oculocutaneous albinism and simplify the analysis of various physiological signals of patients.

Controversial Trials First: Identifying Disagreement Between Clinical Guidelines and New Evidence.

A software system for the automatic identification of disagreement between clinical guidelines and published research is described, which improves precision over state-of-the-art literature research strategies while maintaining near-total recall.



GeneView: a comprehensive semantic search engine for PubMed

GeneView is a semantic search engine for biomedical knowledge built upon a comprehensively annotated version of PubMed abstracts and openly available PubMed Central full texts that enables a number of features extending classical search engines.

Corpus annotation for mining biomedical events from literature

A new type of semantic annotation, event annotation, is completed, which is an addition to the existing annotations in the GENIA corpus, and is expected to become a valuable resource for NLP (Natural Language Processing)-based TM in the bio-medical domain.

Quertle: The Conceptual Relationships Alternative Search Engine for PubMed.

This search engine describes itself as a “relationship-driven biomedical search” tool that is “designed by biomedical professionals for biomedical professionals” and has an advanced ontology of biological, medical, and chemical terms.

Ferret: a sentence-based literature scanning system

Ferret, a prototype retrieval system, designed to retrieve and rank sentences (and their documents) conveying gene-centric relationships of interest to a scientist, is presented.

GoPubMed: exploring PubMed with the Gene Ontology

GoPubMed, a web server which allows users to explore PubMed search results with the Gene Ontology (GO), a hierarchically structured vocabulary for molecular biology, gives an overview of the literature abstracts by categorizing abstracts according to the GO and thus allowing users to quickly navigate through the Abstracts by category.

A fast rule-based approach for biomedical event extraction

This system consists of two phases: in the learning phase, a dictionary and patterns are generated automatically from annotated events and in the extraction phase, the dictionary and obtained patterns are applied to extract events from input text.

Discovering and visualizing indirect associations between biomedical concepts

FACTA+ is the first real-time web application that offers the functionality of finding concepts involving biomolecular events and visualizing indirect associations of concepts with both their categories and importance.

A Simple Algorithm for Identifying Abbreviation Definitions in Biomedical Text

This paper shows that the problem of identifying abbreviations' definitions can be solved with a much simpler algorithm than that proposed by other research efforts, and achieves 96% precision and 82% recall on a standard test collection, which is at least as good as existing approaches.

PolySearch2: a significantly improved text-mining system for discovering associations between human diseases, genes, drugs, metabolites, toxins and more

PolySearch2 maintains an extensive thesaurus of biological terms and exploits the latest search engine technology to rapidly retrieve relevant articles and databases records to facilitate user interpretation.

‘HypothesisFinder:’ A Strategy for the Detection of Speculative Statements in Scientific Text

A pattern matching approach for the detection of speculative statements in scientific text that uses a dictionary of speculative patterns to classify sentences as hypothetical and it is shown that this approach captures a wide spectrum of scientific speculations on Alzheimer's disease.