An approach to the analysis of SDSS spectroscopic outliers based on Self-Organizing Maps

  title={An approach to the analysis of SDSS spectroscopic outliers based on Self-Organizing Maps},
  author={Diego Fustes and Minia Manteiga and Carlos Dafonte and Bernardino Arcay and Ana Ulla and K. W. Smith and R. Borrachero and Rosanna Sordo},
  journal={arXiv: Instrumentation and Methods for Astrophysics},
Aims. A new method is applied to the segmentation, and further analysis of the outliers resulting from the classification of astronomical objects in large databases is discussed. The method is being used in the framework of the Gaia satellite DPAC (Data Processing and Analysis Consortium) activities to prepare automated software tools that will be used to derive basic astrophysical information that is to be included in Gaia final archive. Methods. Our algorithm has been tested by means of… Expand
GUASOM: Gaia Utility for Analysis and Knowledge Discovery based on Self Organizing Maps
We present a method for knowledge analysis in large astronomical spectrophotometric archives. The method is based on a type of unsupervised learning Artificial Neural Networks named Self-organizingExpand
A distributed learning algorithm for Self-Organizing Maps intended for outlier analysis in the GAIA - ESA mission
This work presents a method that is based on Self-Organizing Maps (SOM) and parallelized by means of the Hadoop framework so as to improve its performance and compares the execution times of both the sequential and the distributed versions of the algorithm. Expand
GUASOM: an adaptive visualization tool for unsupervised clustering in spectrophotometric astronomical surveys
An adaptive visualization tool for unsupervised classification of astronomical objects in a Big Data context such as the one found in the increasingly popular large spectrophotometric sky surveys, based on Kohonen’s self-organizing maps. Expand
SOMBI: Bayesian identification of parameter relations in unstructured cosmological data
This work describes the implementation and application of a correlation determination method based on Self Organizing Maps and Bayesian Inference (SOMBI). SOMBI aims to automatically identifyExpand
A Blended Artificial Intelligence Approach for Spectral Classification of Stars in Massive Astronomical Surveys
This paper analyzes and compares the sensitivity and suitability of several artificial intelligence techniques applied to the Morgan–Keenan system for the classification of stars and proposes a final hybrid system that integrates the best intelligent techniques, automatically collects the most important spectral features, and determines the spectral type and luminosity level of the stars according to the MK standard system. Expand
Analysis and Knowledge Discovery by Means of Self-Organizing Maps for Gaia Data Releases
A useful technique for source clustering is shown, focusing on the development of an advanced visualization tool based on this technique, which will apply data mining techniques such as Self-Organizing Maps. Expand
Automatic Object Detection and Categorisation in Deep Astronomical Imaging Surveys Using Unsupervised Machine Learning
An unsupervised machine learning technique that automatically segments and labels galaxies in astronomical imaging surveys using only pixel data is presented, and can be used to identify rarer objects and present lensed galaxy candidates from the CANDELS imaging. Expand
An automatic taxonomy of galaxy morphology using unsupervised machine learning
We present an unsupervised machine learning technique that automatically segments and labels galaxies in astronomical imaging surveys using only pixel data. Distinct from previous unsupervisedExpand
Teaching a machine to see: unsupervised image segmentation and categorisation using growing neural gas and hierarchical clustering
It is suggested that the algorithm has potential as a tool in the automatic analysis and data mining of next-generation imaging and spectral surveys, and could also find application beyond astronomy. Expand
Distributed Unsupervised Clustering for Outlier Analysis in the Biggest Milky Way Survey: ESA Gaia Mission
An algorithm based on optimized Self-Organized Maps (SOM) is proposed and implemented for taking advantage of distributed computing platforms, such as the MapReduce paradigm for Apache Hadoop and Apache Spark. Expand


HSC: A multi-resolution clustering strategy in Self-Organizing Maps applied to astronomical observations
This work presents a strategy for the classification of astronomical objects based on spectrophotometric data and the use of unsupervised neural networks and statistical classification algorithms based on a hierarchical structure of neural networks composed of various tree-structured SOM networks. Expand
SOM ensemble for unsupervised outlier analysis. Application to outlier identification in the Gaia astronomical survey
A novel method to combine SOMs trained with independent features that are calculated from spectrophotometry can help to improve the models used for the supervised classification of astronomical sources and allows for data exploration and knowledge discovery in huge astronomical databases such as the upcoming Gaia mission. Expand
Unsupervised self-organized mapping: a versatile empirical tool for object selection, classification and redshift estimation in large surveys
It is found that the SOM can deliver accuracies that are competitive with many of the established template fitting and empirical methods, and could be a powerful tool in the era of extremely large – ‘petabyte’ – data bases where efficient data mining is a paramount concern. Expand
A High Resolution Atlas of Composite SDSS Galaxy Spectra
In this work we present an atlas of composite spectra of galaxies based on the data of the Sloan Digital Sky Survey Data Release 7 (SDSS DR7). Galaxies are classified by colour, nuclear activity andExpand
Galaxy Morphology without Classification: Self-organizing Maps
We examine a general framework for visualizing data sets of high (greater than 2) dimensionality and demonstrate the framework by taking the morphology of galaxies at moderate redshifts as anExpand
A semi-empirical library of galaxy spectra for Gaia classification based on SDSS data and PÉGASE models
Aims:This paper is the third in a series implementing a classification system for Gaia observations of unresolved galaxies. The system makes use of template galaxy spectra in order to determineExpand
The Discrete Source Classifier in Gaia-Apsis
The Discrete Source Classifier in the Gaia Astrophysical parameters processing chain has the task of sorting the 109 detected Gaia sources into broad astrophysical classes. These classifications willExpand
The Gaia astrophysical parameters inference system (Apsis) - Pre-launch description
The Gaia satellite will survey the entire celestial sphere down to 20th magnitude, obtaining astrometry, photometry, and low resolution spectrophotometry on one billion astronomical sources, plusExpand
Error characterization of the Gaia astrometric solution - II. Validating the covariance expansion model
Context. To use the data in the future Gaia catalogue it is important to have accurate estimates of the statistical uncertainties and correlations of the errors in the astrometric data given in theExpand
A grid of MARCS model atmospheres for late-type stars. I. Methods and general properties
Context. In analyses of stellar spectra and colours, and for the analysis of integrated light from galaxies, a homogeneous grid of model atmospheres of late-type stars and corresponding flux spectraExpand