首页> 外文学位 >Ensemble Stream Model for Data-Cleaning in Sensor Networks.
【24h】

Ensemble Stream Model for Data-Cleaning in Sensor Networks.

机译:传感器网络中用于数据清洗的集成流模型。

获取原文
获取原文并翻译 | 示例

摘要

Ensemble Stream Modeling and Data-cleaning are sensor information processing systems have different training and testing methods by which their goals are cross-validated. This research examines a mechanism, which seeks to extract novel patterns by generating ensembles from data. The main goal of label-less stream processing is to process the sensed events to eliminate the noises that are uncorrelated, and choose the most likely model without over fitting thus obtaining higher model confidence. Higher quality streams can be realized by combining many short streams into an ensemble which has the desired quality. The framework for the investigation is an existing data mining tool.;First, to accommodate feature extraction such as a bush or natural forest-fire event we make an assumption of the burnt area (BA*), sensed ground truth as our target variable obtained from logs. Even though this is an obvious model choice the results are disappointing. The reasons for this are two: One, the histogram of fire activity is highly skewed. Two, the measured sensor parameters are highly correlated. Since using non descriptive features does not yield good results, we resort to temporal features. By doing so we carefully eliminate the averaging effects; the resulting histogram is more satisfactory and conceptual knowledge is learned from sensor streams.;Second is the process of feature induction by cross-validating attributes with single or multi-target variables to minimize training error. We use F-measure score, which combines precision and accuracy to determine the false alarm rate of fire events. The multi-target data-cleaning trees use information purity of the target leaf-nodes to learn higher order features. A sensitive variance measure such as ƒ-test is performed during each node's split to select the best attribute. Ensemble stream model approach proved to improve when using complicated features with a simpler tree classifier.;The ensemble framework for data-cleaning and the enhancements to quantify quality of fitness (30% spatial, 10% temporal, and 90% mobility reduction) of sensor led to the formation of streams for sensor-enabled applications. Which further motivates the novelty of stream quality labeling and its importance in solving vast amounts of real-time mobile streams generated today.
机译:集成流建模和数据清理是传感器信息处理系统,具有不同的训练和测试方法,通过这些方法可以交叉验证其目标。这项研究探讨了一种机制,该机制试图通过从数据生成合奏来提取新颖的模式。无标签流处理的主要目标是处理感测到的事件,以消除不相关的噪声,并选择最可能的模型而不会过度拟合,从而获得更高的模型置信度。通过将许多短流组合成具有所需质量的集合,可以实现更高质量的流。该调查的框架是现有的数据挖掘工具。首先,为了适应诸如灌木丛或自然森林大火等事件的特征提取,我们假设烧毁面积(BA *),将地面真实感作为我们获得的目标变量从日志。即使这是一个明显的模型选择,结果也令人失望。造成这种情况的原因有两个:一是火灾活动的直方图高度偏斜。第二,测得的传感器参数高度相关。由于使用非描述性特征不会产生良好的结果,因此我们诉诸时间特征。通过这样做,我们仔细消除了平均效应;第二个是通过交叉验证具有单个或多个目标变量的属性以最小化训练误差的特征归纳过程。我们使用F-measure分数,结合精确度和准确性来确定火灾事件的误报率。多目标数据清除树使用目标叶节点的信息纯度来学习高阶特征。在每个节点的分割过程中都会执行诸如ƒ-test之类的敏感方差度量以选择最佳属性。事实证明,集成流模型方法在使用带有简单树分类器的复杂功能时会得到改善。;数据清理的集成框架以及量化传感器适应度的增强功能(30%的空间,10%的时间和90%的移动性降低)导致形成了支持传感器的应用流。这进一步激发了流质量标签的新颖性及其在解决当今产生的大量实时移动流中的重要性。

著录项

  • 作者

    Iyer, Vasanth.;

  • 作者单位

    Florida International University.;

  • 授予单位 Florida International University.;
  • 学科 Computer science.
  • 学位 Ph.D.
  • 年度 2013
  • 页码 273 p.
  • 总页数 273
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号