L. Mark Hall

Learn More
Four modeling techniques, using topological descriptors to represent molecular structure, were employed to produce models of human serum protein binding (% bound) on a data set of 1008 experimental values, carefully screened from publicly available sources. To our knowledge, this data is the largest set on human serum protein binding reported for QSAR(More)
Several QSPR models were developed for predicting intrinsic aqueous solubility, S(o). A data set of 5,964 neutral compounds was sub-divided into two classes, aromatic and non-aromatic compounds. Three models were created with different methods on both data sets: two regression models (multiple linear regression and partial least squares) and an artificial(More)
The goal of many metabolomic studies is to identify the molecular structure of endogenous molecules that are differentially expressed among sampled or treatment groups. The identified compounds can then be used to gain an understanding of disease mechanisms. Unfortunately, despite recent advances in a variety of analytical techniques, small molecule (<1000(More)
The binding of beta-lactams to human serum proteins was modeled with topological descriptors of molecular structure. Experimental data was the concentration of protein-bound drug expressed as a percent of the total plasma concentration (percent fraction bound, PFB) for 87 penicillins and for 115 beta-lactams. The electrotopological state indices (E-State)(More)
In this paper, we present MolFind, a highly multithreaded pipeline type software package for use as an aid in identifying chemical structures in complex biofluids and mixtures. MolFind is specifically designed for high-performance liquid chromatography/mass spectrometry (HPLC/MS) data inputs typical of metabolomics studies where structure identification is(More)
The binding affinity to human serum albumin for 94 drugs was modeled with topological descriptors of molecular structure, using as experimental data the HPLC chromatographic retention index [logk(HSA)] on immobilized albumin. The electrotopological state (E-State) along with the molecular connectivity chi indices provided the basis for a satisfactory model:(More)
MS and HPLC are commonly used for compound characterization and obtaining structural information; in the field of metabonomics, these two analytical techniques are often combined to characterize unknown endogenous or exogenous metabolites present in complex biological samples. Since the structures of a majority of these metabolites are not actually(More)
A back-propagation artificial neural network (ANN) was used to create a 10-fold leave-10%-out cross-validated ensemble model of high performance liquid chromatography retention index (HPLC-RI) for a data set of 498 diverse druglike compounds. A 10-fold multiple linear regression (MLR) ensemble model of the same data was developed for comparison. Molecular(More)
A novel approach is developed for modeling situations in which the modeled property is an algebraically transformed version of the original experimental data. In many cases such a transformation results in a data set with a significantly smaller data range. Here we explore the effects of range-of-data on modeling statistics. We illustrate a twostep method(More)