首页> 外文会议>IEEE International Congress on Big Data >Hydrological Time Series Anomaly Mining Based on Symbolization and Distance Measure
【24h】

Hydrological Time Series Anomaly Mining Based on Symbolization and Distance Measure

机译:基于符号和距离测度的水文时间序列异常挖掘

获取原文

摘要

Large amount of hydrological data set is a kind of big data, which has much hidden and potentially useful knowledge. It is necessary to extract these knowledge from hydrological data set, which can provide more valuable hydrological information and be useful for future hydrological forecasting. Data mining based on time series is widely used currently. There are some techniques based on time series to extract anomaly. However, most of these techniques cannot suit big unstable data such as hydrological big data set. Some important problems are high fitting error after dimension reduction and low accuracy of mining results. In this work we propose a new idea to solve the problem of hydrological anomaly mining based on time series. The idea combines time series symbolization with distance measure. It proposes Feature Points Symbolic Aggregate Approximation (FP SAX) to improve the selection of feature points, and then measures the distance of strings by Symbol Distance based Dynamic Time Warping (SD DTW). Finally, the distance which we have got are sorted. A set of dedicated experiments are performed to validate our approach. The experimental data set is based on the water level data set obtained from Xiaomeikou gauge station in the Taihu Lake from 1956 to 2005. The results of experiments show that our approach has lower fitting error and higher accuracy.
机译:大量的水文数据集是一种大数据,它具有很多隐藏的和潜在有用的知识。有必要从水文数据集中提取这些知识,这可以提供更有价值的水文信息,并且对将来的水文预报很有用。基于时间序列的数据挖掘目前被广泛使用。有一些基于时间序列的技术可以提取异常。但是,这些技术中的大多数不能适应不稳定的大数据,例如水文大数据集。一些重要的问题是尺寸减小后的拟合误差高以及采矿结果的准确性低。在这项工作中,我们提出了一个新的思想来解决基于时间序列的水文异常开采问题。这个想法将时间序列符号化与距离度量结合在一起。它提出了特征点符号聚合近似(FP SAX)来改进特征点的选择,然后通过基于符号距离的动态时间规整(SD DTW)测量字符串的距离。最后,对我们得到的距离进行排序。进行了一组专用实验以验证我们的方法。实验数据集基于1956年至2005年从太湖小梅口水位站获得的水位数据集。实验结果表明,该方法具有较低的拟合误差和较高的精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号