Impressive progress in genome sequencing, protein expression and high-throughput crystallography and NMR has radically transformed the opportunities to use protein three-dimensional structures to accelerate drug discovery, but the quantity and complexity of the data have ensured a central place for informatics. Structural biology and bioinformatics have… (More)
Neurofibrillary tangles, one of the hallmarks of Alzheimer disease (AD), are composed of paired helical filaments of abnormally hyperphosphorylated tau. The accumulation of these proteinaceous aggregates in AD correlates with synaptic loss and severity of dementia. Identifying the kinases involved in the pathological phosphorylation of tau may identify… (More)
MOTIVATION The ChEMBLSpace graphical explorer enables the identification of compounds from the ChEMBL database, which exhibit a desirable polypharmacology profile. This profile can be predefined or created iteratively, and the tool can be extended to other data sources.
The increase of publicly available bioactivity data has led to the extensive development and usage of in silico bioactivity prediction algorithms. A particularly popular approach for such analyses is the multiclass Naïve Bayes, whose output is commonly processed by applying empirically-derived likelihood score thresholds. In this work, we describe a… (More)
Understanding the mode of action of small molecules is critical for drug research, both with respect to efficacy and anticipated side effects. Given that many compounds act on multiple targets simultaneously, it appears that linking single targets to outcomes is no longer sufficient. Hence, in this work we explore machine learning methods for rationalising… (More)
We report on the sequencing of 10,545 human genomes at 30×-40× coverage with an emphasis on quality metrics and novel variant and sequence discovery. We find that 84% of an individual human genome can be sequenced confidently. This high-confidence region includes 91.5% of exon sequence and 95.2% of known pathogenic variant positions. We present the… (More)
A continuing problem in protein-ligand docking is the correct relative ranking of active molecules versus inactives. Using the ChemScore scoring function as implemented in the GOLD docking software, we have investigated the effect of scaling hydrogen bond, metal-ligand, and lipophilic interactions based on the buriedness of the interaction. Buriedness was… (More)