首页> 外文会议>Statistical Challenges in Modern Astronomy Conference >Challenges for Cluster Analysis in a Virtual Observatory
【24h】

Challenges for Cluster Analysis in a Virtual Observatory

机译:虚拟天文台中聚类分析的挑战

获取原文

摘要

There has been an unprecedented and continuing growth in the volume, quality, and complexity of astronomical data sets over the past few years, mainly through large digital sky surveys. Virtual Observatory (VO) concept represents a scientific and technological framework needed to cope with this data flood. We review some of the applied statistics and computing challenges posed by the analysis of large and complex data sets expected in the VO-based research. The challenges are driven both by the size and the complexity of the data sets (billions of data vectors in parameter spaces of tens or hundreds of dimensions), by the heterogeneity of the data and measurement errors, the selection effects and censored data, and by the intrinsic clustering properties (functional form, topology) of the data distribution in the parameter space of observed attributes. Examples of scientific questions one may wish to address include: objective determination of the numbers of object classes present in the data, and the membership probabilities for each source; searches for unusual, rare, or even new types of objects and phenomena; discovery of physically interesting multivariate correlations which may be present in some of the clusters; etc. This paper is followed by a commentary by statistician Dianne Cook.
机译:在过去几年中,天文数据集的数量,质量和复杂性具有前所未有的,持续的增长,主要是通过大型数字天空调查。虚拟天文台(VO)概念代表了应对这种数据洪水所需的科技框架。我们回顾了在基于VO的研究中预期的大型和复杂数据集分析所带来的一些应用统计和计算挑战。通过数据集的规模和复杂性(数十万个或数百个尺寸的参数空间中的数十亿个数据向量),通过数据和测量误差,选择效果和审查数据,数据集观察属性参数空间中数据分布的内在聚类属性(功能形式,拓扑)。人们希望解决的科学问题的例子包括:客观确定数据中存在的对象类的数量,以及每个来源的隶属概率;搜索异常,罕见,甚至新类型的物体和现象;发现物理有趣的多变量相关性,其可能存在于一些簇中;这篇论文随后是统计学家Dianne Cook的评论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号