Factor analysis applied to regional geochemical data: problems and possibilities

  title={Factor analysis applied to regional geochemical data: problems and possibilities},
  author={C. Reimann and P. Filzmoser and R. Garrett},
  journal={Applied Geochemistry},
Abstract Cluster analysis can be used to group samples and to develop ideas about the multivariate geochemistry of the data set at hand. Due to the complex nature of regional geochemical data (neither normal nor log-normal, strongly skewed, often multi-modal data distributions, data closure), cluster analysis results often strongly depend on the preparation of the data (e.g. choice of the transformation) and on the clustering algorithm selected. Different variants of cluster analysis can lead… Expand
Wahrscheinlichkeitstheorie Cluster analysis applied to regional geochemical data : Problems and possibilities
A large regional geochemical data set of O-horizon samples from a 188,000 km2 area in the European Arctic, analysed for 38 chemical elements, pH, electrical conductivity (both in a water extraction)Expand
Cluster Analysis with Application to Geochemical Data
The goal of this presentation is to give a deeper insight into the large data sets of the C-horizon and the O-horizon soil samples with the help of cluster analysis and fuzzy cluster analysis. TheseExpand
Application of cluster analysis to geochemical compositional data for identifying ore-related geochemical anomalies
Cluster analysis is a well-known technique that is used to analyze various types of data. In this study, cluster analysis is applied to geochemical data that describe 1444 stream sediment samplesExpand
Regional geochemical pattern recognition with multivariate correspondence cluster analysis in the Ravar area, Iran
Abstract Multiple correspondence analysis and cluster analysis (MCACA) is developed in this paper, and then used to extract the main advantages of multiple correspondence analysis and hierarchicalExpand
A New Approach to Identify Recharge Areas in the Lower Virgin River Basin and Surrounding Basins by Multivariate Statistics
Many studies group hydrochemical datasets into hydrochemical facies using graphical and multivariate statistical methods. Compared to graphical methods, cluster analysis and principal componentExpand
A comparison study on detection of key geochemical variables and factors through three different types of factor analysis
Abstract Large numbers of variables have been measured to explain different phenomena. Factor analysis has widely been used in order to reduce the dimension of datasets. Additionally, the techniqueExpand
A modified procedure for mixture-model clustering of regional geochemical data
Abstract A modified procedure is proposed for mixture-model clustering of regional-scale geochemical data. The key modification is the robust principal component transformation of the isometricExpand
Multivariate analysis of stream sediment data from Nanling metallogenic belt, South China
Multivariate statistical methods can be applied to analyse a complete set of multidimensional geochemical data and to identify latent relationships among these data. In this paper, we usedExpand
Multivariate statistical analysis and partitioning of sedimentary geochemical data sets: General principles and specific MATLAB scripts
[1] Multivariate statistical treatments of large data sets in sedimentary geochemical and other fields are rapidly becoming more popular as analytical and computational capabilities expand. BecauseExpand
Sedimentary chemofacies characterization by means of multivariate analysis
Abstract Multivariate statistical analysis is applied to geochemical data from three sections forming part of the stratigraphic record of the Cerro Pelado Formation (Oligocene–Miocene), in theExpand


Exploratory data analysis: recent advances for the interpretation of geochemical data
Major advantages of EDA are the straightforward application of its techniques and the easily interpretable results, which can be enhanced considerably, resulting in a more objective outlier definition as well as a better resolution of the regional background. Expand
Correspondence cluster analysis and its application in exploration geochemistry
Abstract Correspondence cluster analysis, developed in this paper, regroups the main advantages of correspondence analysis and cluster analysis. The output of the method, correspondence clusterExpand
Comparison of interpretations of geochemical soil data by some multivariate statistical methods, Key Anacon, N.B., Canada
The nature of the information - and its usefulness - that can be obtained from some of the more common multivariate statistical techniques is illustrated by their application to H and B horizon soilsExpand
The management and analysis of regional geochemical data
Abstract Experiences in the U.S. National Uranium Resource Evaluation Program and Canadian Uranium Reconnaissance Program related to the use of geochemical exploration methods are drawn upon toExpand
Critical review of some multivariate procedures in the analysis of geochemical data
Simulation experiments have been conducted to examine the potential usefulness of R-mode and Q-mode factor methods in the analysis and interpretation of geochemical data. The R-mode factor analysisExpand
Normal and lognormal data distribution in geochemistry: death of a myth. Consequences for the statistical treatment of geochemical and environmental data
Abstract All variables of several large data sets from regional geochemical and environmental surveys were tested for a normal or lognormal data distribution. As a general rule, almost all variablesExpand
Interpreting exploration geochemical data from Outokumpu, Finland: a MVE-robust factor analysis
Abstract A factor analysis (FA) rendered robust using the minimum volume ellipsoid (MVE) estimator was carried out on geochemical till data consisting of 622 observations on 21 variables (Al, Ba, Ca,Expand
The chi-square plot: a tool for multivariate outlier recognition
Abstract In large multi-element regional surveys statistically derived threshold levels of the form that define, for example, the top 2% of the data for each element as worthy of furtherExpand
Principal component analysis for clustering gene expression data
The empirical study showed that clustering with the PCs instead of the original variables does not necessarily improve, and often degrades, cluster quality, and would not recommend PCA before clustering except in special circumstances. Expand
Principal components analysis using the hypothetical closed array
The application of R-mode principal components analysis to a set of closed chemical data is described using previously published chemical analyses of rocks from Gough Island. Different measures ofExpand