首页> 外文会议>IEEE International Conference on Bioinformatics and Biomedicine >An embedded method for gene identification in heterogenous data involving unwanted heterogeneity
【24h】

An embedded method for gene identification in heterogenous data involving unwanted heterogeneity

机译:一种嵌入的基因鉴定在涉及不需要的异质性的异质数据中的基因鉴定方法

获取原文

摘要

The various ways of data collection for modern applications such as bioinformatics result in heterogeneous data, which presents challenges for traditional variable selection methods that assume data is independent and identically distributed. Existing statistical models accounting for unwanted variation can be applied for gene identification in heterogeneous genetic data, which however suffer from variable redundancy and also lack of predictability. To cope with that, we propose an embedded variable selection method for gene identification from a sparse learning perspective which is capable of accounting for unwanted heterogeneity blurring the true gene effects. Its performance is investigated by studying two different unsupervised and supervised gene identification problems in which the benchmark data samples are heterogeneous and collected with group structures. The results have demonstrated the superiority of our method over state-of-the art methods by effectively accounting for the unwanted heterogeneity in both cases.
机译:诸如生物信息学的现代应用的数据收集的各种方式导致异构数据,这提出了传统可变选择方法的挑战,该方法假设数据是独立的和相同分布的。占对不起变异的现有统计模型可以应用于异质遗传数据中的基因鉴定,然而遭受可变冗余并且缺乏可预测性。为了应对,我们提出了一种从稀疏学习透视的基因识别提出了一种嵌入的可变选择方法,该方法能够考虑对真正的基因效应模糊的不希望的异质性。通过研究两种不同的无监督和监督基因鉴定问题来研究其性能,其中基准数据样本是异质的并用群体结构收集。结果证明了我们通过有效地核对两种情况下的不希望的异质性的方法的方法的优越性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号