首页> 外文学位 >Property Testing and Probability Distributions: New Techniques, New Models, and New Goals
【24h】

Property Testing and Probability Distributions: New Techniques, New Models, and New Goals

机译:属性测试和概率分布:新技术,新模型和新目标

获取原文
获取原文并翻译 | 示例

摘要

In order to study the real world, scientists (and computer scientists) develop simplified models that attempt to capture the essential features of the observed system. Understanding the power and limitations of these models, when they apply or fail to fully capture the situation at hand, is therefore of uttermost importance.;In this thesis, we investigate the role of some of these models in property testing of probability distributions (distribution testing), as well as in related areas. We introduce natural extensions of the standard model (which only allows access to independent draws from the underlying distribution), in order to circumvent some of its limitations or draw new insights about the problems they aim at capturing. Our results are organized in three main directions: (i) We provide systematic approaches to tackle distribution testing questions. Specifically, we provide two general algorithmic frameworks that apply to a wide range of properties, and yield efficient and near-optimal results for many of them. We complement these by introducing two methodologies to prove information-theoretic lower bounds in distribution testing, which enable us to derive hardness results in a clean and unified way. (ii) We introduce and investigate two new models of access to the unknown distributions, which both generalize the standard sampling model in different ways and allow testing algorithms to achieve significantly better efficiency. Our study of the power and limitations of algorithms in these models shows how these could lead to faster algorithms in practical situations, and yields a better understanding of the underlying bottlenecks in the standard sampling setting. (iii) We then leave the field of distribution testing to explore areas adjacent to property testing. We define a new algorithmic primitive of sampling correction , which in some sense lies in between distribution learning and testing and aims to capture settings where data originates from imperfect or noisy sources. Our work sets out to model these situations in a rigorous and abstracted way, in order to enable the development of systematic methods to address these issues.
机译:为了研究现实世界,科学家(和计算机科学家)开发了简化的模型,试图捕获被观察系统的基本特征。因此,了解这些模型的功能和局限性,当它们应用或无法完全掌握当前情况时,具有极其重要的意义。;本论文中,我们研究了其中一些模型在概率分布的属性测试(分布)中的作用。测试),以及相关领域。我们引入了标准模型的自然扩展(它仅允许访问基础分布的独立图纸),以规避其某些局限性或对他们旨在解决的问题得出新的见解。我们的结果从三个主要方向进行组织:(i)我们提供系统的方法来解决分布测试问题。具体来说,我们提供了两个适用于广泛属性的通用算法框架,并为其中的许多属性提供了有效且接近最佳的结果。我们通过引入两种方法来证明分布测试中的信息理论下界,来补充这些知识,这使我们能够以一种干净统一的方式得出硬度结果。 (ii)我们引入并研究了两种访问未知分布的新模型,它们以不同的方式推广了标准采样模型,并允许测试算法显着提高效率。我们对这些模型中算法的功能和局限性的研究表明,这些算法如何在实际情况下可以导致更快的算法,并更好地理解标准采样设置中的潜在瓶颈。 (iii)然后,我们离开配电测试领域,探索与资产测试相邻的区域。我们定义了一种新的采样校正算法原语,在某种意义上,它位于分布学习和测试之间,旨在捕获数据来自不完善或嘈杂来源的设置。我们的工作着手以严格和抽象的方式对这些情况进行建模,以便能够开发出系统的方法来解决这些问题。

著录项

  • 作者

    Canonne, Clément L.;

  • 作者单位

    Columbia University.;

  • 授予单位 Columbia University.;
  • 学科 Computer science.;Statistics.
  • 学位 Ph.D.
  • 年度 2017
  • 页码 318 p.
  • 总页数 318
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号