TY - GEN
T1 - Efficient Temporal Kernels Between Feature Sets for Time Series Classification
AU - Tavenard, Romain
AU - Malinowski, Simon
AU - Chapel, Laetitia
AU - Bailly, Adeline
AU - Sanchez, Heider
AU - Bustos, Benjamin
N1 - Publisher Copyright:
© 2017, Springer International Publishing AG.
PY - 2017
Y1 - 2017
N2 - In the time-series classification context, the majority of the most accurate core methods are based on the Bag-of-Words framework, in which sets of local features are first extracted from time series. A dictionary of words is then learned and each time series is finally represented by a histogram of word occurrences. This representation induces a loss of information due to the quantization of features into words as all the time series are represented using the same fixed dictionary. In order to overcome this issue, we introduce in this paper a kernel operating directly on sets of features. Then, we extend it to a time-compliant kernel that allows one to take into account the temporal information. We apply this kernel in the time series classification context. Proposed kernel has a quadratic complexity with the size of input feature sets, which is problematic when dealing with long time series. However, we show that kernel approximation techniques can be used to define a good trade-off between accuracy and complexity. We experimentally demonstrate that the proposed kernel can significantly improve the performance of time series classification algorithms based on Bag-of-Words. Code related to this chapter is available at: https://github.com/rtavenar/SQFD-TimeSeries Data related to this chapter are available at: http://www.timeseriesclassification.com
AB - In the time-series classification context, the majority of the most accurate core methods are based on the Bag-of-Words framework, in which sets of local features are first extracted from time series. A dictionary of words is then learned and each time series is finally represented by a histogram of word occurrences. This representation induces a loss of information due to the quantization of features into words as all the time series are represented using the same fixed dictionary. In order to overcome this issue, we introduce in this paper a kernel operating directly on sets of features. Then, we extend it to a time-compliant kernel that allows one to take into account the temporal information. We apply this kernel in the time series classification context. Proposed kernel has a quadratic complexity with the size of input feature sets, which is problematic when dealing with long time series. However, we show that kernel approximation techniques can be used to define a good trade-off between accuracy and complexity. We experimentally demonstrate that the proposed kernel can significantly improve the performance of time series classification algorithms based on Bag-of-Words. Code related to this chapter is available at: https://github.com/rtavenar/SQFD-TimeSeries Data related to this chapter are available at: http://www.timeseriesclassification.com
UR - http://www.scopus.com/inward/record.url?scp=85040252378&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-71246-8_32
DO - 10.1007/978-3-319-71246-8_32
M3 - Conference contribution
AN - SCOPUS:85040252378
SN - 9783319712451
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 528
EP - 543
BT - Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2017, Proceedings
A2 - Ceci, Michelangelo
A2 - Hollmen, Jaakko
A2 - Todorovski, Ljupco
A2 - Vens, Celine
A2 - Dzeroski, Saso
PB - Springer Verlag
T2 - European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2017
Y2 - 18 September 2017 through 22 September 2017
ER -