Extracting discriminative shapelets from heterogeneous sensor data

Abstract

We study the problem of identifying discriminative features in Big Data arising from heterogeneous sensors. We highlight the heterogeneity in sensor data from engineering applications and the challenges involved in automatically extracting only the most interesting features from large datasets. We formulate this problem as that of classification of multivariate time series and design shapelet-based algorithms for this task. We design a novel approach, called Shapelet Forests (SF), which combines shapelet extraction with feature selection. We evaluate our proposed method with other approaches for mining shapelets from multivariate time series using data from real-world engineering applications. Quantitative analysis of the experiments shows that SF performs better than the baseline approaches and achieves high classification accuracy. In addition, the method enables identification of noisy sensors from multivariate data and discounts their use for classification.

DOI: 10.1109/BigData.2014.7004344

Extracted Key Phrases

12 Figures and Tables

Cite this paper

@article{Patri2014ExtractingDS, title={Extracting discriminative shapelets from heterogeneous sensor data}, author={Om Prasad Patri and Abhishek B. Sharma and Haifeng Chen and Guofei Jiang and Anand V. Panangadan and Viktor K. Prasanna}, journal={2014 IEEE International Conference on Big Data (Big Data)}, year={2014}, pages={1095-1104} }