Learn More
The automatic extraction of chemical information from text requires the recognition of chemical entity mentions as one of its key steps. When developing supervised named entity recognition (NER) systems, the availability of a large, manually annotated text corpus is desirable. Furthermore, large corpora permit the robust evaluation and comparison of(More)
MOTIVATION Chemical named entity recognition is used to automatically identify mentions to chemical compounds in text and is the basis for more elaborate information extraction. However, only a small number of applications are freely available to identify such mentions. Particularly challenging and useful is the identification of International Union of Pure(More)
Reconstruction of genes and/or protein networks from automated analysis of the literature is one of the current targets of text mining in biomedical research. Some user-friendly tools already perform this analysis on precompiled databases of abstracts of scientific papers. Other tools allow expert users to elaborate and analyze the full content of a corpus(More)
The CHEMDNER task is a Named Entity Recognition (NER) challenge that aims at labeling different types of chemical names in biomedical text. We approach this challenge by proposing a hybrid approach that combines linear Conditional Random Fields (CRF) together with regular expression taggers and dictionary usage, followed by a post-processing step to tag(More)
BACKGROUND Small chemical molecules regulate biological processes at the molecular level. Those molecules are often involved in causing or treating pathological states. Automatically identifying such molecules in biomedical text is difficult due to both, the diverse morphology of chemical names and the alternative types of nomenclature that are(More)
When designing a cloud infrastructure, it is critical to ensure beforehand that the system will be able to offer the desired level of QoS (Quality of Service). Our attention is focused here on efficient QoS accessing to a biological database in cloud computing systems. Our group developed two software applications that address important biological problems,(More)
Biblio-MetReS is a single-thread data mining application that facilitates the reconstruction of molecular networks based on automated text mining analysis of published scientific literature. This application is very CPU-intensive, requiring High Performace Computing (HPC). Due to the amount of execution tasks, it can be quite slow. Those tasks are(More)
UNLABELLED One way to initiate the reconstruction of molecular circuits is by using automated text-mining techniques. Developing more efficient methods for such reconstruction is a topic of active research, and those methods are typically included by bioinformaticians in pipelines used to mine and curate large literature datasets. Nevertheless, experimental(More)
Our group developed two biological applications, Biblio-MetReS and Homol-MetReS, accessing the same database of organisms with annotated genes. Biblio-MetReS is a data-mining application that facilitates the reconstruction of molecular networks based on automated text-mining analysis of published scientific literature. Homol-MetReS allows functional(More)
  • 1