Data- and expert-driven rule induction and filtering framework for functional interpretation and description of gene sets

  title={Data- and expert-driven rule induction and filtering framework for functional interpretation and description of gene sets},
  author={Aleksandra Gruca and Marek Sikora},
  journal={Journal of Biomedical Semantics},
  • A. Gruca, M. Sikora
  • Published 26 June 2017
  • Computer Science
  • Journal of Biomedical Semantics
BackgroundHigh-throughput methods in molecular biology provided researchers with abundance of experimental data that need to be interpreted in order to understand the experimental results. Manual methods of functional gene/protein group interpretation are expensive and time-consuming; therefore, there is a need to develop new efficient data mining methods and bioinformatics tools that could support the expert in the process of functional analysis of experimental results.ResultsIn this study, we… 
4 Citations

Functional Interpretation of Gene Sets: Semantic-Based Clustering of Gene Ontology Terms on the BioTest Platform

This study analyzes transcription profiles of human cell line K562 and shows that clustering allows grouping functionally related GO terms and therefore obtaining more concise and comprehensive description, and applies cluster-specific data aggregation tool.

Review of Rule Quality Measurement: Metrics and Rule Evaluation Models

This review seeks to partially present how ideas to measure the rule as knowledge representation from a varied viewpoint and how people construct evaluation models to assess the resulting regulations either from the experts or human experts as well as those resulting from the induction rule algorithm much developed.

Machine learning for bioinformatics and neuroimaging

It is shown how ML techniques such as clustering, classification, embedding techniques and network‐based approaches can be successfully employed to tackle various problems such as gene expression clusters, patient classification, brain networks analysis, and identification of biomarkers.

Efficiency Comparison of Modern Computer Languages: Sorting Benchmark

The paper surveys the execution features of ready-to-use sorting procedures in various modern computer languages/compilers and reveals some differences between particular implementations in efficiency of sorting in terms of CPU load and execution time.



RuleGO: a logical rules-based tool for description of gene groups by means of Gene Ontology

RuleGO is the web-based application that allows the user to describe gene groups on the basis of logical rules that include Gene Ontology (GO) terms in their premises that reflect coappearance of GO-terms describing genes supported by the rules.

Learning Rule-based Models of Biological Process from Gene Expression Time Profiles Using Gene Ontology

A systematic supervised learning approach to predicting biological process from time series of gene expression data and biological knowledge that can automatically associate genes with novel hypotheses of biological process is reported.

Annotation-Modules: a tool for finding significant combinations of multisource annotations for gene lists

Annotation-Modules is developed, which offers an improvement over the current tools in two critical aspects: first, the underlying annotation database implements features from many different fields like gene regulation and expression, sequence properties, evolution and conservation, genomic localization and functional categories-resulting in about 60 different annotation features.

Predicting gene ontology biological process from temporal gene expression patterns.

The aim of the present study was to generate hypotheses on the involvement of uncharacterized genes in biological processes. To this end, supervised learning was used to analyze microarray-derived

Ontological analysis of gene expression data: current tools, limitations, and open problems

A detailed comparison of the capabilities of 14 ontological analysis tools is presented using the following criteria: scope of the analysis, visualization capabilities, statistical model used, correction for multiple comparisons, reference microarrays available, installation issues and sources of annotation data.

GeneCodis3: a non-redundant and modular enrichment analysis tool for functional genomics

This version of GeneCodis has been made to remove noisy and redundant output from the enrichment results with the inclusion of a recently reported algorithm that summarizes significantly enriched terms and generates functionally coherent modules of genes and terms.

Improvement of FP-Growth Algorithm for Mining Description-Oriented Rules

A new modification of the rules induction method for description of gene groups using Gene Ontology based on FP-growth algorithm is proposed, taking advantage of the hierarchical structure of GO graph.

Ciruvis: a web-based tool for rule networks and interaction detection using rule-based classifiers

Rule networks enable a fast method for model visualization and provide an exploratory heuristic to interaction detection and may be used to aid and improve rule-based classification.

Fuzzy association rules for biological data analysis: A case study on yeast

A novel fuzzy methodology based on a fuzzy association rule mining method for biological knowledge extraction is proposed over a yeast genome dataset containing heterogeneous information regarding structural and functional genome features.