Why Adaptively Collected Data Have Negative Bias and How to Correct for It

Xinkun Nie; Xiaoying Tian; Jonathan Taylor; James Zou

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Why Adaptively Collected Data Have Negative Bias and How to Correct for It

【24h】

Why Adaptively Collected Data Have Negative Bias and How to Correct for It

机译：为什么自适应收集的数据具有负偏差，以及如何纠正它

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

From scientific experiments to online A/B testing, the previously observed data often affects how future experiments are performed, which in turn affects which data will be collected. Such adaptivity introduces complex correlations between the data and the collection procedure. In this paper, we prove that when the data collection procedure satisfies natural conditions, then sample means of the data have systematic negative biases. As an example, consider an adaptive clinical trial where additional data points are more likely to be tested for treatments that show initial promise. Our surprising result implies that the average observed treatment effects would underestimate the true effects of each treatment. We quantitatively analyze the magnitude and behavior of this negative bias in a variety of settings. We also propose a novel debiasing algorithm based on selective inference techniques. In experiments, our method can effectively reduce bias and estimation error.

机译：从科学实验到在线A / B测试，先前观察到的数据通常会影响未来的实验是如何进行的，这反过来影响将收集哪些数据。这种适应性在数据和收集过程之间引入了复杂的相关性。在本文中，我们证明，当数据收集程序满足自然条件时，数据的样本装置具有系统负偏差。例如，考虑一个自适应临床试验，其中更容易测试额外的数据点以进行初始承诺的治疗。我们令人惊讶的结果意味着平均观察到的治疗效果将低估每种治疗的真实效果。我们定量分析各种设置中这种负偏差的幅度和行为。我们还提出了一种基于选择性推理技术的新型脱叠算法。在实验中，我们的方法可以有效地减少偏差和估计误差。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2018年第2011期|共9页
作者
Xinkun Nie; Xiaoying Tian; Jonathan Taylor; James Zou;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Meteorological and evaluation datasets for snow modelling at 10 reference sites: description of in situ and bias-corrected reanalysis data [J] . Ménard Cécile B., Essery Richard, Barr Alan, Earth System Science Data Discussions . 2019,第2期

机译：10个参考地点的雪地气象和评估数据集：对原位和偏差校正后的再分析数据的描述
2. Meteorological and evaluation datasets for snow modelling at 10 reference sites: description of in situ and bias-corrected reanalysis data [J] . Cécile B. Ménard, Richard Essery, Alan Barr, Earth System Science Data . 2019,第2期

机译：用于10个参考网站的雪建模的气象和评估数据集：原位描述和偏置再分析数据
3. Statistical methods to correct for verification bias in diagnostic studies are inadequate when there are few false negatives: a simulation study [J] . Angel M Cronin, Andrew J Vickers BMC Medical Research Methodology . 2008,第1期

机译：当假阴性很少时，用于校正诊断研究中验证偏差的统计方法不足：模拟研究
4. A fast, flexible, positive and negative adaptive body-bias generator in 28nm FDSOI [C] . Milovan Blagojević, Martin Cochet, Ben Keller, Symposium on VLSI Circuits . 2016

机译：28nm FDSOI中的快速，灵活，正负自适应人体偏置发生器
5. Legal Data: Bias in the Law, and How Legal Technology Can Be Built to Help Correct for It [D] . Avery, Joseph J. 2021

机译：法律数据：法律偏见，以及如何建造法律技术以帮助纠正它
6. Statistical methods to correct for verification bias in diagnostic studies are inadequate when there are few false negatives: a simulation study [O] . Angel M Cronin, Andrew J Vickers 2008

机译：当假阴性很少时用于校正诊断研究中验证偏差的统计方法是不够的：模拟研究
7. Table 1: Observed and predicted mean total length (TL) from the bias-corrected growth model, measured in millimeters, and natural mortality at age (M, Charnov, Gislason Pope, 2013) data for schoolmaster (Lutjanus apodus) collected from 1981–2015 from the southeastern United States. [O] . -1

机译：表1：观察和预测的平均总长度（TL）来自偏置校正的生长模型，以毫米为单位测量，以及年龄（M，Charnov，Gislason＆Pope，2013）的自然死亡率从1981年收集的校长（Lutjanus apodus）数据-2015来自美国东南部。
8. Correcting Moored ADCP Data for Fish-Bias Errors at 0 deg, 110 deg W and 0 deg,140 deg W from 1993 to 1995 [R] . Plimpton, P. E., Freitag, H. P., McPhaden, M. J. 2000

机译：修正系泊aDCp数据，了解1993年至1995年0度，110度和0度，140度的鱼类偏差误差

Why Adaptively Collected Data Have Negative Bias and How to Correct for It

摘要

著录项

相似文献

相关主题

期刊订阅