【2h】

A self-organizing principle for learning nonlinear manifolds

机译:学习非线性流形的一种自组织原理

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Modern science confronts us with massive amounts of data: expression profiles of thousands of human genes, multimedia documents, subjective judgments on consumer products or political candidates, trade indices, global climate patterns, etc. These data are often highly structured, but that structure is hidden in a complex set of relationships or high-dimensional abstractions. Here we present a self-organizing algorithm for embedding a set of related observations into a low-dimensional space that preserves the intrinsic dimensionality and metric structure of the data. The embedding is carried out by using an iterative pairwise refinement strategy that attempts to preserve local geometry while maintaining a minimum separation between distant objects. In effect, the method views the proximities between remote objects as lower bounds of their true geodesic distances and uses them as a means to impose global structure. Unlike previous approaches, our method can reveal the underlying geometry of the manifold without intensive nearest-neighbor or shortest-path computations and can reproduce the true geodesic distances of the data points in the low-dimensional embedding without requiring that these distances be estimated from the data sample. More importantly, the method is found to scale linearly with the number of points and can be applied to very large data sets that are intractable by conventional embedding procedures.
机译:现代科学面临着大量数据:数千种人类基因的表达谱,多媒体文件,对消费产品或政治候选人的主观判断,贸易指数,全球气候格局等。这些数据通常是高度结构化的,但这种结构是隐藏在一组复杂的关系或高维抽象中。在这里,我们提出了一种自组织算法,用于将一组相关观测值嵌入到低维空间中,该空间保留了数据的固有维数和度量结构。嵌入是通过使用迭代的成对精炼策略来执行的,该策略试图保留局部几何图形,同时保持远距离对象之间的最小间隔。实际上,该方法将远程对象之间的邻近视为其真实测地距离的下限,并将其用作施加全局结构的手段。与以前的方法不同,我们的方法无需进行密集的最近邻计算或最短路径计算即可显示流形的基本几何形状,并且可以在低维嵌入中重现数据点的真实测地距离,而无需根据数据样本。更重要的是,发现该方法与点的数量成线性比例,并且可以应用于非常大的数据集,而这些数据集是常规嵌入程序难以处理的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号