Projects per year
Abstract
Several alternative distance measures for comparing time series have recently been proposed and evaluated on time series classification (TSC) problems. These include variants of dynamic time warping (DTW), such as weighted and derivative DTW, and edit distance-based measures, including longest common subsequence, edit distance with real penalty, time warp with edit, and move–split–merge. These measures have the common characteristic that they operate in the time domain and compensate for potential localised misalignment through some elastic adjustment. Our aim is to experimentally test two hypotheses related to these distance measures. Firstly, we test whether there is any significant difference in accuracy for TSC problems between nearest neighbour classifiers using these distance measures. Secondly, we test whether combining these elastic distance measures through simple ensemble schemes gives significantly better accuracy. We test these hypotheses by carrying out one of the largest experimental studies ever conducted into time series classification. Our first key finding is that there is no significant difference between the elastic distance measures in terms of classification accuracy on our data sets. Our second finding, and the major contribution of this work, is to define an ensemble classifier that significantly outperforms the individual classifiers. We also demonstrate that the ensemble is more accurate than approaches not based in the time domain. Nearly all TSC papers in the data mining literature cite DTW (with warping window set through cross validation) as the benchmark for comparison. We believe that our ensemble is the first ever classifier to significantly outperform DTW and as such raises the bar for future work in this area.
Original language | English |
---|---|
Pages (from-to) | 565-592 |
Number of pages | 28 |
Journal | Data Mining and Knowledge Discovery |
Volume | 29 |
Issue number | 3 |
Early online date | 30 Jun 2014 |
DOIs | |
Publication status | Published - May 2015 |
Keywords
- Time Series Classification
- elastic distance measure
- ensemble
Profiles
-
-
Jason Lines
- School of Computing Sciences - Associate Professor in Computing Sciences
- Data Science and AI - Member
- Smart Emerging Technologies - Member
Person: Research Group Member, Academic, Teaching & Research
Projects
- 1 Finished
-
Time series data mining of electricity usage patterns (EPSRC Industrial CASE) (Student - Jason Lines)
1/10/10 → 30/09/15
Project: Training