• Corpus ID: 49868873

Novel Feature-Based Clustering of Micro-Panel Data (CluMP)

  title={Novel Feature-Based Clustering of Micro-Panel Data (CluMP)},
  author={Luk{\'a}{\vs} Sob{\'i}{\vs}ek and M{\'a}ria Stachov{\'a} and J{\'a}nos Fojtik},
Micro-panel data are collected and analysed in many research and industry areas. Cluster analysis of micro-panel data is an unsupervised learning exploratory method identifying subgroup clusters in a data set which include homogeneous objects in terms of the development dynamics of monitored variables. The supply of clustering methods tailored to micro-panel data is limited. The present paper focuses on a feature-based clustering method, introducing a novel two-step characteristic-based… 


Model-based clustering of time series in group-specific functional subspaces
This work develops a general procedure for clustering functional data which adapts the clustering method high dimensional data clustering (HDDC), originally proposed in the multivariate context, based on a functional latent mixture model which fits the functional data in group-specific functional subspaces.
A Comparison of Hierarchical Methods for Clustering Functional Data
A simulation study compares the performance of four major hierarchical methods for clustering functional data and yields concrete suggestions to future researchers to determine the best method for clustered their functional data.
Crime And Economic Performance. A Cluster Analysis Of Panel Data On Italy'S Nuts 3 Regions
Crime is a complex phenomenon which needs to be investigated at appropriate disaggregate temporal and territorial levels of analysis. The speci c issue addressed in this paper concerns the
Clustering of time series data - a survey
  • T. Liao
  • Computer Science
    Pattern Recognit.
  • 2005
This paper surveys and summarizes previous works that investigated the clustering of time series data in various application domains, including general-purpose clustering algorithms commonly used in time series clustering studies.
One of the important steps in financial distress analyses is to correctly and reasonably mark a company whether is, or it is not in financial distress risk. There are many definitions used in the
A Bounded Index for Cluster Validity
A new bounded index for cluster validity, called the score function (SF), is introduced, based on standard cluster properties, and is shown to work well on multi-dimensional data sets and is able to accommodate unique and sub-cluster cases.
Model-based clustering for longitudinal data
A model-based clustering method is proposed for clustering individuals on the basis of measurements taken over time and the identification of hormone trajectories that are likely to lead to adverse pregnancy outcomes in a group of pregnant women is illustrated.
Kml: A package to cluster longitudinal data
KmL is an R package providing an implementation of k-means designed to work specifically on longitudinal data, providing several different techniques for dealing with missing values in trajectories and providing an easy way to export graphical representations of the mean trajectories resulting from the clustering.
Dimension-Reduced Clustering of Functional Data via Subspace Separation
The proposed method aims to minimize a distance between functional objects and their projections with the imposition of clustering penalties to find an optimal cluster structure of functions as well as an optimal subspace for clustering simultaneously.
A recursively partitioned mixture model for clustering time-course gene expression data.
The proposed time-course RPMM (TC-RPMM), subjects are clustered on the basis of their temporal profiles of gene expression using a mixture of mixed effects models framework, which captures changes in gene expression over time and models the autocorrelation between repeated gene expression measurements for the same subject.