Effect of Data Representation Method for Effective Mining of Time Series Data

机译：数据表示方法对时间序列数据有效挖掘的影响

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent days, generation, collection and storage of large volume of data is increasing due to enhanced sensor and memory technology. Analysis and mining of these big data in various real life problems from health care to disaster prevention is also becoming possible with the ever increasing computational power and maginificant growth of powerful machine learning algorithms. The dynamics of natural or man made disasters can be captured by time series data and their efficient analysis can lead to the development of effective systems to minimize the loss or damage. Efficient techniques and algorithms for time series analysis is a challenging problem. Traditional machine learning algorithms for analysis of static data cannot be directly applied for analyzing dynamic time series data. For analysis of time series, representation of data and the comparison method are very important. Various methods for representation and similarity measurement of time series data have been proposed. In this paper, a comparative study has been done by simulation experiments with 85 bench mark data sets as a first step to study the effect of representation of data for mining task. The main focus of this paper is to study the effect of classification with raw data using different similarity measures with traditional classifiers and convolutional neural network and transformed data in to recurrence plot, classified by convolutional neural network. From the simulation results, it is found that classification accuracy with convolutional neural networks in many data sets are improved with time series represented by recurrence plot.

机译：近年来，由于增强的传感器和存储技术，海量数据的生成，收集和存储正在增加。随着计算能力的不断提高和强大的机器学习算法的飞速发展，从医疗保健到防灾等各种现实问题中的大数据分析和挖掘也变得可能。时间序列数据可以捕获自然灾害或人为灾害的动态，对其进行有效的分析可以开发出有效的系统，以最大程度地减少损失或破坏。用于时间序列分析的有效技术和算法是一个具有挑战性的问题。传统的用于静态数据分析的机器学习算法不能直接应用于动态时间序列数据的分析。对于时间序列分析，数据表示和比较方法非常重要。已经提出了用于时间序列数据的表示和相似性测量的各种方法。本文通过对85个基准数据集的模拟实验进行了比较研究，作为研究数据表示对采矿任务的影响的第一步。本文的主要重点是使用与传统分类器和卷积神经网络不同的相似性度量研究原始数据的分类效果，并将数据转换为递归图，并通过卷积神经网络进行分类。从仿真结果可以发现，用递归图表示的时间序列可以提高卷积神经网络在许多数据集中的分类精度。

著录项

来源
《IEEE International Conference on Big Data and Smart Computing》|2019年|1-6|共6页
会议地点
作者
Kotaro Nakano; Basabi Chakraborty;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Time series analysis; Time measurement; Task analysis; Feature extraction; Data mining; Convolutional neural networks; Machine learning algorithms;

机译：时间序列分析;时间测量;任务分析;特征提取;数据挖掘;卷积神经网络;机器学习算法;

相似文献

外文文献
中文文献
专利

1. AN EFFECTIVE DATA MINING TECHNIQUE FOR RECONSTRUCTING GENE REGULATORY NETWORKS FROM TIME SERIES EXPRESSION DATA [J] . PATRICK C. H. MA, KEITH C. C. CHAN Journal of Bioinformatics and Computational Biology . 2007,第3期

机译：利用时间序列表达数据重建基因调控网络的有效数据挖掘技术
2. Three-dimensional piecewise cloud representation for time series data mining [J] . Si Gangquan, Zheng Kai, Zhou Zhou, Neurocomputing . 2018,第NOVa17期

机译：用于时间序列数据挖掘的三维分段云表示
3. Piecewise two-dimensional normal cloud representation for time-series data mining [J] . Deng Weihui, Wang Guoyin, Xu Ji Information Sciences: An International Journal . 2016,第Null期

机译：用于时间序列数据挖掘的分段二维法线云表示
4. Effect of Data Representation Method for Effective Mining of Time Series Data [C] . Kotaro Nakano, Basabi Chakraborty International Conference on Big Data and Smart Computing . 2019

机译：数据表示方法对时间序列数据有效挖掘的影响
5. Mining tools for high-dimensional time series data using spectral methods [D] . Mukhopadhyay, Jaydip 2007

机译：使用频谱方法的高维时间序列数据挖掘工具
6. Addressing Big Data Time Series: Mining Trillions of Time Series Subsequences Under Dynamic Time Warping [O] . THANAWIN RAKTHANMANON, BILSON CAMPANA, ABDULLAH MUEEN, -1

机译：解决大数据时间序列：动态时间规整下挖掘数千个时间序列子序列
7. An effective data mining technique for reconstructing gene regulatory networks from time series expression data [O] . Ma PCH, Chan KCC 2007

机译：从时间序列表达数据重建基因调控网络的有效数据挖掘技术

Effect of Data Representation Method for Effective Mining of Time Series Data

摘要

著录项

相似文献

相关主题

期刊订阅