Learn More
Metabolomics studies generate increasingly complex data tables, which are hard to summarize and visualize without appropriate tools. The use of chemometrics tools, e.g., principal component analysis (PCA), partial least-squares to latent structures (PLS), and orthogonal PLS (OPLS), is therefore of great importance as these include efficient, validated, and(More)
We provide an overview of how the underlying philosophy of chemometrics is integrated throughout metabonomic studies. Four steps are demonstrated: (1) definition of the aim, (2) selection of objects, (3) sample preparation and characterization, and (4) evaluation of the collected data. This includes the tools applied for linear modeling, for example,(More)
During generation of microarray data, various forms of systematic biases are frequently introduced which limits accuracy and precision of the results. In order to properly estimate biological effects, these biases must be identified and discarded. We introduce a normalization strategy for multi-channel microarray data based on orthogonal projections to(More)
In general, applications of metabonomics using biofluid NMR spectroscopic analysis for probing abnormal biochemical profiles in disease or due to toxicity have all relied on the use of chemometric techniques for sample classification. However, the well-known variability of some chemical shifts in 1H NMR spectra of biofluids due to environmental differences(More)
Gene expression data from microarrays are being applied to predict preclinical and clinical endpoints, but the reliability of these predictions has not been established. In the MAQC-II project, 36 independent teams analyzed six microarray data sets to generate predictive models for classifying a sample with respect to one of 13 endpoints indicative of lung(More)
In metabolomics, the objective is to identify differences in metabolite profiles between samples. A widely used tool in metabolomics investigations is gas chromatography-mass spectrometry (GC/MS). More than 400 compounds can be detected in a single analysis, if overlapping GC/MS peaks are deconvoluted. However, the deconvolution process is time-consuming(More)
The goal of this group is to define the reporting requirements associated with the statistical analysis (including univariate, multivariate, informatics, machine learning etc.) of metabolite data with respect to other measured/collected experimental data (often called meta-data). These definitions will embrace as many aspects of a complete metabolomics(More)
Analysis of the entire set of low molecular weight compounds (LMC), the metabolome, could provide deeper insights into mechanisms of disease and novel markers for diagnosis. In the investigation, we developed an extraction and derivatization protocol, using experimental design theory (design of experiment), for analyzing the human blood plasma metabolome by(More)
We describe here the implementation of the statistical total correlation spectroscopy (STOCSY) analysis method for aiding the identification of potential biomarker molecules in metabonomic studies based on NMR spectroscopic data. STOCSY takes advantage of the multicollinearity of the intensity variables in a set of spectra (in this case 1H NMR spectra) to(More)
This article describes the applicability of multivariate projection techniques, such as principal-component analysis (PCA) and partial least-squares (PLS) projections to latent structures, to the large-volume high-density data structures obtained within genomics, proteomics, and metabonomics. PCA and PLS, and their extensions, derive their usefulness from(More)