Random forest automated supervised classification of Hipparcos periodic variable stars

  title={Random forest automated supervised classification of Hipparcos periodic variable stars},
  author={Pierre Dubath and Lorenzo Rimoldini and Maria Suveges and J. Blomme and M. L'opez and Luis M. Sarro and Joris De Ridder and Johan Peter Cuypers and Leanne P. Guy and I. Lecoeur and Krzysztof Nienartowicz and A. Jan and M. Beck and Nami Mowlavi and Peter De Cat and Thomas Lebzelter and Laurent Eyer},
  journal={Monthly Notices of the Royal Astronomical Society},
We present an evaluation of the performance of an automated classification of the Hipparcos periodic variable stars into 26 types. The sub-sample with the most reliable variability types available in the literature is used to train supervised algorithms to characterize the type dependencies on a number of attributes. The most useful attributes evaluated with the random forest methodology include, in decreasing order of importance, the period, the amplitude, the V − I colour index, the absolute… 
It is described a methodology to classify periodic variable stars identified using photometric time-series measurements constructed from the Wide-field Infrared Survey Explorer (WISE) full-mission single-exposure Source Databases and shows that Fourier decomposition techniques can be extended into the mid-IR to assist with their classification.
Automated classification of Hipparcos unsolved variables
An automated classification of stars exhibiting periodic, non-periodic and irregular light variations, using the Hipparcos catalogue of unsolved variables to complement the training set of periodic variables of Dubath et al.
Optimizing Automated Classification of Periodic Variable Stars in New Synoptic Surveys
Results from simulated and real--world light curves show that noisification can significantly improve classifier performance and be used for upcoming surveys such as Gaia and LSST and describe some of the promises and challenges of applyingNoisification to these surveys.
A package for the automated classification of periodic variable stars
A machine learning package for the classification of periodic variable stars finds that recall and precision do not vary significantly if there are more than 80 data points and the duration is more than a few weeks, and investigates how the performance varies with the number ofData points and duration of observations.
A supervised learning method to automatically classify variable X-ray sources in the Second XMM-Newton Serendipitous Source Catalog and identifies 12 anomalous sources, of which 2XMM J180658.7–500250 appears to be the most unusual source in the sample.
Classification of Variable Stars Light Curves Using Long Short Term Memory Network
This work presents a new automated method, 1D CNN-LSTM, for classifying variable stars using a hybrid neural network of one-dimensional CNN and LSTM network which employs the raw time-series data from the variable stars.
Autoclassification of the Variable 3XMM Sources Using the Random Forest Machine Learning Algorithm
This paper presents the catalog of variable sources in the Third XMM-Newton Serendipitous Source catalog (3XMM) autoclassified using the Random Forest machine learning algorithm, and shows that the Random forests algorithm is highly effective at automatically classifying 3XMM sources.
Gaia eclipsing binary and multiple systems. Supervised classification and self-organizing maps
This study investigates the potential of a few machine-learning methods for the automated analysis of eclipsing binaries in the data of large surveys, and investigates the performance of principal component analysis (PCA), linear discriminant analysis (LDA), random forest classification and self-organizing maps.
A machine learned classifier for RR Lyrae in the VVV survey
This work describes a supervised machine-learned classifier constructed for assigning a score to a K s -band VVV light curve that indicates its likelihood of being ab -type RR Lyrae, and obtains a classifier based on the AdaBoost algorithm that achieves a harmonic mean between false positives and false negatives of ≈7% for typical VvV light-curve sets.


A study of supervised classification of Hipparcos variable stars using PCA and Support Vector Machines
We report on the automated classification of Hipparcos variable stars by a supervised classification algorithm known as Support Vector Machines. The dataset comprised about 3200 stars, each
Automated classification of variable stars for All‐Sky Automated Survey 1–2 data
With the advent of surveys generating multi-epoch photometry and the discovery of large numbers of variable stars, the classification of these stars has to be automatic. We have developed such a
Automated supervised classification of variable stars - I. Methodology
An overview of the stellar variability classes that are presently known, in terms of some relevant stellar parameters, to use the class descriptions obtained as the basis for an automated ``supervised classification'' of large databases.
Comparative clustering analysis of variable stars in the Hipparcos, OGLE Large Magellanic Cloud, and CoRoT exoplanet databases
Context. Discovery of new variability classes in large surveys using multivariate statistics techniques such as clustering, relies heavily on the correct understanding of the distribution of known
Automated supervised classification of variable stars in the CoRoT programme. Method and application
Context: Aims: In this work, we describe the pipeline for the fast supervised classification of light curves observed by the CoRoT exoplanet CCDs. We present the classification results obtained for
Automated Classification of Variable Stars in the Asteroseismology Program of the Kepler Space Mission
We present the first results of the application of supervised classification methods to the Kepler Q1 long-cadence light curves of a subsample of 2288 stars measured in the asteroseismology program
Light-curve classification in massive variability surveys - II. Transients towards the Large Magellanic Cloud
Automatic classification of variability is now possible with tools such as neural networks. Here, we present two neural networks for the identification of microlensing events: the first discriminates
Lightcurve Classification in Massive Variability Surveys
This paper pioneers the use of neural networks to provide a fast and automatic way to classify lightcurves in massive photometric datasets. As an example, we provide a working neural network that can
Light-curve classification in massive variability surveys — I. Microlensing
This paper exploits neural networks to provide a fast and automatic way to classify light curves in massive photometric data sets. As an example, we provide a working neural network that can
Random Forests
Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.