Comprehensive data-driven analysis of the impact of chemoinformatic structure on the genome-wide biological response profiles of cancer cells to 1159 drugs

@article{Khan2011ComprehensiveDA,
  title={Comprehensive data-driven analysis of the impact of chemoinformatic structure on the genome-wide biological response profiles of cancer cells to 1159 drugs},
  author={Suleiman A. Khan and Ali Faisal and John Mpindi and Juuso A. Parkkinen and Tuomo Kalliokoski and Antti Poso and Olli Kallioniemi and Krister Wennerberg and Samuel Kaski},
  journal={BMC Bioinformatics},
  year={2011},
  volume={13},
  pages={112 - 112}
}
BackgroundDetailed and systematic understanding of the biological effects of millions of available compounds on living cells is a significant challenge. As most compounds impact multiple targets and pathways, traditional methods for analyzing structure-function relationships are not comprehensive enough. Therefore more advanced integrative models are needed for predicting biological effects elicited by specific chemical features. As a step towards creating such computational links we developed… 
Identification of structural features in chemicals associated with cancer drug response: a systematic data-driven analysis
TLDR
This method quantitatively outperforms the limited earlier methods on CMap and identifies both the previously reported associations and several interesting novel findings, by taking into account multiple cell lines and advanced 3D structural descriptors.
Connecting gene expression data from connectivity map and in silico target predictions for small molecule mechanism-of-action analysis.
TLDR
This paper spotlights the integration of gene expression data and target prediction scores, providing insight into mechanism of action (MoA) in compounds and shows one or more underlying MoA for compounds that were well-substantiated with literature.
Gene-Set Local Hierarchical Clustering (GSLHC)—A Gene Set-Based Approach for Characterizing Bioactive Compounds in Terms of Biological Functional Groups
TLDR
A GSA-based version of CMap, Gene- set Connectivity Map (GSCMap), in which all the genomic profiles in CMap are converted, using gene-sets from the Molecular Signatures Database, to functional profiles, and the platform Gene-Set Local Hierarchical Clustering (GSLHC) is constructed for discovering insights on coordinated actions of biological functions and facilitating classification of heterogeneous subtypes on drug-driven responses.
A transcriptomics data-driven gene space accurately predicts liver cytopathology and drug-induced liver injury
TLDR
A ‘big data compacting and data fusion’—concept to capture diverse adverse outcomes on cellular and organismal levels is utilized to capture unanticipated harmful effects of chemicals and drug molecules.
An integrative analysis of small molecule transcriptional responses in the human malaria parasite Plasmodium falciparum
TLDR
This work provides a new framework for linking transcriptional responses to drug MOA in P. falciparum and can be generalized for the same purpose in other organisms.
Matrix and Tensor Factorization Methods for Toxicogenomic Modeling and Prediction
TLDR
These approaches are described by using exemplary application of a data set comprising over 2.5 × 108 data points and 1300 compounds, with the aim of explaining dose-dependent cytotoxic effects by identifying hidden factors/ patterns captured in transcriptomics data with links to structural fingerprints of the compounds.
Probabilistic drug connectivity mapping
TLDR
The probabilistic modeling-based connectivity mapping method is superior to alternatives in finding functionally and chemically similar drugs from the Connectivity Map data set, and an extension of the method is capable of retrieving combinations of drugs that match different relevant parts of the query drug response profile.
Bayesian multi-view models for data-driven drug response analysis
TLDR
The results illustrate that modeling the precise multi-view and multi-way formulation of the data is valuable for discovering interpretable latent components as well as for the prediction of unseen responses of drugs.
Probabilistic components of molecular interactions and drug responses ; Solutason lääkevasteiden ja molekyylivuorovaikutusten tilastollisia komponentteja
TLDR
The first comprehensive view of molecular toxicogenomic responses is constructed with high performance in drug toxicity prediction, and probabilistic component models are applied to detect coherent drug response patterns from gene expression data.
...
...

References

SHOWING 1-10 OF 43 REFERENCES
Investigating the correlations among the chemical structures, bioactivity profiles and molecular targets of small molecules
TLDR
This analysis shows that bioactivity profile comparison can provide insights into the mode of actions at the molecular level, thus will facilitate the knowledge-based discovery of novel compounds with desired pharmacological properties.
Pharmacogenomic analysis: correlating molecular substructure classes with microarray gene expression data
TLDR
A set of conceptual tools and fluent computational methods for projecting directly from gene expression patterns to drug substructures and vice versa are introduced, providing a systematic way to mine databases for the design of further structure–activity studies, particularly to aid in target and lead identification.
Discovery of drug mode of action and drug repositioning from transcriptional responses
TLDR
This work developed an automatic and robust approach that exploits similarity in gene expression profiles following drug treatment, across multiple cell lines and dosages, to predict similarities in drug effect and MoA, and correctly predicted the MoA for nine anticancer compounds and was able to discover an unreported effect for a well-known drug.
Integration of clinical chemistry, expression, and metabolite data leads to better toxicological class separation.
TLDR
Improved predictive ability of the different classes is seen, suggesting that this approach is a suitable method for data integration and visualization of biological data and for correlation of biological parameters between the different data types, which could lead to an improvement in biological interpretation.
Drug-Induced Regulation of Target Expression
TLDR
It is confirmed systematically that in particular G-protein coupled receptors, when serving as known targets, are regulated upon drug treatment and newly identified drug-induced differential regulation of Lanosterol 14-alpha demethylase, Endoplasmin, DNA topoisomerase 2-alpha and Calmodulin 1.
Network pharmacology: the next paradigm in drug discovery.
TLDR
A new appreciation of the role of polypharmacology has significant implications for tackling the two major sources of attrition in drug development--efficacy and toxicity.
The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease
TLDR
The first installment of a reference collection of gene-expression profiles from cultured human cells treated with bioactive small molecules is created, and it is demonstrated that this “Connectivity Map” resource can be used to find connections among small molecules sharing a mechanism of action, chemicals and physiological processes, and diseases and drugs.
An Algorithmic Framework for Predicting Side Effects of Drugs
TLDR
A novel approach to predict the side effects of a given drug, taking into consideration information on other drugs and their side effects, is reported on, able to infer side effects that highly match existing knowledge.
Predicting new molecular targets for known drugs
TLDR
Compared 3,665 US Food and Drug Administration (FDA)-approved and investigational drugs against hundreds of targets, defining each target by its ligands, chemical similarities between drugs and ligand sets predicted thousands of unanticipated associations.
Comparative molecular field analysis (CoMFA). 1. Effect of shape on binding of steroids to carrier proteins.
TLDR
The main features of the CoMFA approach, exemplified by analyses of the affinities of 21 varied steroids to corticosteroid and testosterone-binding globulins, and a number of advances in the methodology of molecular graphics are described.
...
...