Abstract
Recently, two ideas have been explored that lead to more accurate algorithms for time-series classification (TSC). First, it has been shown that the simplest way to gain improvement on TSC problems is to transform into an alternative data space where discriminatory features are more easily detected. Second, it was demonstrated that with a single data representation, improved accuracy can be achieved through simple ensemble schemes. We combine these two principles to test the hypothesis that forming a collective of ensembles of classifiers on different data transformations improves the accuracy of time-series classification. The collective contains classifiers constructed in the time, frequency, change, and shapelet transformation domains. For the time domain we use a set of elastic distance measures. For the other domains we use a range of standard classifiers. Through extensive experimentation on 72 datasets, including all of the 46 UCR datasets, we demonstrate that the simple collective formed by including all classifiers in one ensemble is significantly more accurate than any of its components and any other previously published TSC algorithm. We investigate alternative hierarchical collective structures and demonstrate the utility of the approach on a new problem involving classifying {\em Caenorhabditis elegans} mutant types.
Original language | English |
---|---|
Pages (from-to) | 2522-2535 |
Number of pages | 14 |
Journal | IEEE Transactions on Knowledge and Data Engineering |
Volume | 27 |
Issue number | 9 |
Early online date | 26 Mar 2015 |
DOIs | |
Publication status | Published - 1 Sept 2015 |
Keywords
- Time Series Classification
- ensembles
- shapelets
Profiles
-
Jason Lines
- School of Computing Sciences - Associate Professor in Computing Sciences
- Data Science and AI - Member
Person: Research Group Member, Academic, Teaching & Research