• Corpus ID: 2760832

Information Theoretic Interestingness Measures for Cross-Ontology Data Mining in the Mouse Anatomy Ontology and the Gene Ontology

  title={Information Theoretic Interestingness Measures for Cross-Ontology Data Mining in the Mouse Anatomy Ontology and the Gene Ontology},
  author={Prashanti Manda and Fiona M. McCarthy and Bindu Nanduri and Susan M. Bridges},
Community annotation of biological entities with concepts from multiple bio-ontologies has created large and growing repositories of ontology-based annotation data with embedded implicit relationships among orthogonal ontologies. Development of efficient data mining methods and metrics to mine and assess the quality of the mined relationships has not kept pace with the growth of annotation data. In this study, we present a data mining method that uses ontology-guided generalization to discover… 

Tables from this paper

Post Analysis Based AKD Using Domain Ontology for Disease Prediction
This paper presents a hybrid model where post-analysis based domain ontology concept has been adopted to discover actionable knowledge from frequent patterns and outperforms as compared to a previous AIRM algorithm in order to match knowledge discovery process.


Cross-Ontology Multi-level Association Rule Mining in the Gene Ontology
A new method for association rule mining to discover implicit co-occurrence relationships across the GO sub-ontologies at multiple levels of abstraction is described and Biologically interesting rules discovered by the method reveal unknown and surprising knowledge about co-Occurring GO terms.
Semantic interestingness measures for discovering association rules in the skeletal dysplasia domain
This work introduces a method that combines concept similarity metrics, formulated using the intrinsic structure of a given ontology, with traditional interestingness measures to compute semanticinterestingness measures in the process of association rule mining.
Additional Gene Ontology structure for improved biological reasoning
A new structure is proposed, the Second Gene Ontology Layer, capturing biological relations not directly reflected in the present ontology structure, to enable GO to cover more biological knowledge, enable a more consistent use of GO and provide new opportunities for biological reasoning.
Mining Rare Associations between Biological Ontologies
A data mining approach which relates biological ontologies by mining cross and intra-ontology pairwise generalized association rules, and proposes a new class of interestingness measures designed for hierarchically organized rules.
Integrated analysis of gene expression by association rules discovery
This paper shows that the proposed methodology is able to integrate multiple gene annotations and expression data in the same analytic framework and extract meaningful associations among heterogeneous sources of data.
GO PaD: the Gene Ontology Partition Database
This paper addresses quantitative analysis of biological functions based on nodes at the same level using an informational theoretic approach encoded in the GO Partition Database that guarantees to maximize information for gene enrichment analysis.
Mining GO Annotations for Improving Annotation Consistency
This work analyzes the full GO molecular function annotation of UniProtKB proteins and presents and evaluates a data mining algorithm, based on the association rule learning methodology, for identifying implicit relationships between molecular function terms.
Non-Lexical Approaches to Identifying Associative Relations in the Gene Ontology
The associations identified in this study could serve as the starting point for adding associative relations across hierarchies to GO, but would require manual curation.
Ontology in association rules
A study on the effects, in terms of precision and recall, of using a data preparation technique, called SemPrune, which is built on domain ontology, intended for pre- and post-processing phases of data mining.