首页> 外文学位 >Analysis of high-throughput biological data: Some statistical problems in RNA-seq and mouse genotyping.
【24h】

Analysis of high-throughput biological data: Some statistical problems in RNA-seq and mouse genotyping.

机译:高通量生物学数据分析:RNA-seq和小鼠基因分型中的一些统计问题。

获取原文
获取原文并翻译 | 示例

摘要

The many areas of research of high-throughput computational biology provide endless opportunities for methodological contributions by statisticians. In this thesis, we present results in two main areas, one just emerging and one well-established.;In Part I of this thesis, we present new results related to the analysis of high-throughput sequencing data. The last year or so has seen the emergence of many new technologies aimed at enabling the massively parallel sequencing of many molecules of DNA simultaneously. This technological leap forward has enabled scientists to conduct exciting experiments that were impossible with previous technologies, and statisticians are being flooded with new data to analyze. We focus on two analytical problems related to new short-read sequencing technologies, each aimed at a different aspect of the goal of quantifying gene expression using sequencing. First, we present a new method aimed at determining which gene a particular sequence fragment originated from, in order to obtain better unbiased estimates of gene expression. Second, we develop a new empirical Bayes test statistic aimed at measuring differential gene expression between two samples which have been sequenced. Both problems combine fundamental statistical concepts with cutting-edge biology research.;Part II of this thesis focuses on genetic analysis of the mouse model organism, a more established area of both biological and statistical inquiry. We present an analysis of the performance of a high-throughput microarray in measuring genotype information in a pooled set of mice, for the purposes of detecting a disease-carrying mutation locus. This problem combines relatively new technological advances with classical theories of linkage analysis.
机译:高通量计算生物学的许多研究领域为统计学家的方法学贡献提供了无限的机会。在本文中,我们从两个主要领域介绍了结果,一个是新兴领域,一个是公认的领域。本论文的第一部分,我们提出了与高通量测序数据分析相关的新结果。大约一年左右的时间里,出现了许多旨在同时对许多DNA分子进行大规模并行测序的新技术。这一技术上的飞跃使科学家能够进行激动人心的实验,而这是以前的技术无法实现的,统计学家正被大量新数据进行分析。我们专注于与新的短读测序技术有关的两个分析问题,每个问题都针对使用测序量化基因表达的目标的不同方面。首先,我们提出了一种旨在确定特定序列片段源自哪个基因的新方法,以便获得更好的基因表达无偏估计。其次,我们开发了一种新的经验贝叶斯检验统计量,旨在测量已测序的两个样品之间的差异基因表达。这两个问题都将基本的统计概念与最新的生物学研究相结合。本论文的第二部分着重于对小鼠模型生物的遗传分析,这是生物学和统计研究领域中更为成熟的领域。我们提出了一种高通量微阵列在测量一组小鼠的基因型信息中的性能的分析,目的是检测携带疾病的突变基因座。该问题将相对较新的技术进步与经典的链接分析理论相结合。

著录项

  • 作者

    Taub, Margaret Anne.;

  • 作者单位

    University of California, Berkeley.;

  • 授予单位 University of California, Berkeley.;
  • 学科 Statistics.;Biology Bioinformatics.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 114 p.
  • 总页数 114
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号