Feature-Based Time-Series Analysis in R using the theft Package

  title={Feature-Based Time-Series Analysis in R using the theft Package},
  author={Trent Henderson and Ben D. Fulcher},
Time series are measured and analyzed across the sciences. One method of quantifying the structure of time series is by calculating a set of summary statistics or ‘features’, and then representing a time series in terms of its properties as a feature vector. The resulting feature space is interpretable and informative, and enables conventional statistical learning approaches, including clustering, regression, and classification, to be applied to time-series datasets. Many open-source software… 

Figures and Tables from this paper



An Empirical Evaluation of Time-Series Feature Sets

The largest feature set, hctsa, is found to be the most comprehensive, and that tsfresh is the most distinctive, due to its incorporation of large numbers of Fourier coefficients that are summarized at higher levels in the other sets.

Highly Comparative Feature-Based Time-Series Classification

  • B. FulcherN. Jones
  • Computer Science
    IEEE Transactions on Knowledge and Data Engineering
  • 2014
A highly comparative, feature-based approach to time series classification is introduced that uses an extensive database of algorithms to extract thousands of interpretable features from time series, allowing the method to perform well on very large data sets containing long time series or time series of different lengths.

catch22: CAnonical Time-series CHaracteristics

This work provides an efficient implementation of catch22, accessible from many programming environments, that facilitates feature-based time- series analysis for scientific, industrial, financial and medical applications using a common language of interpretable time-series properties.

Highly comparative time-series analysis: the empirical structure of time series and their methods

Reduced representations of both time series, in terms of their properties measured by diverse scientific methods, and of time-series analysis methods, interms of their behaviour on empirical time series are introduced and used to organize these interdisciplinary resources.

Feature-based time-series analysis

The range of feature-based representations for time series that have been developed to aid interpretable insights into time-series structure are summarized and particular emphasis is given to emerging research that facilitates wide comparison of feature the properties of a time- series dataset that make it suited to a particular feature- based representation or analysis algorithm.

Time series extrinsic regression

The results show that the state-of-the-art TSC algorithm Rocket, when adapted for regression, achieves the highest overall accuracy compared to adaptations of other TSC algorithms and state of theart machine learning (ML) algorithms such as XGBoost, Random Forest and Support Vector Regression.

A self-organizing, living library of time-series data

The web platform, CompEngine, a self-organizing, living library of time-series data that lowers the barrier to forming meaningful interdisciplinary connections between time series, and incentivizes data sharing by automatically connecting experimental and theoretical scientists across disciplines based on the empirical structure of the data they measure.