首页> 外文会议>Mexican International Conference on Artificial Intelligence >Distributions of Functional and Content Words Differ Radically
【24h】

Distributions of Functional and Content Words Differ Radically

机译:功能和内容词的分布差异从根本上差异

获取原文

摘要

We consider statistical properties of prepositions—the most numerous and important functional words in European languages. Usually, they syntactically link verbs and nouns to nouns. It is shown that their rank distributions in Russian differ radically from those of content words, being much more compact. The Zipf law distribution commonly used for content words fails for them, and thus approximations flatter at first ranks and steeper at higher ranks are applicable. For these purposes, the Mandelbrot family and an expo-logarithmic family of distributions are tested, and an insignificant difference between the two least-square approximations is revealed. It is proved that the first dozen of ranks cover more than 80% of all preposition occurrences in the DB of Russian collocations of Verb-Preposition-Noun and Noun-Preposition-Noun types, thus hardly leaving room for the rest two hundreds of available Russian prepositions.
机译:我们考虑介词的统计属性 - 欧洲语言中最多重要的功能词。通常,它们将动词和名词与名词进行了句子。结果表明,他们的级别分布从内容词的那些差异很大,更紧凑。通常用于内容词的ZIPF法律分布失败,因此适用于更高级别的第一次排名和较陡的近似值。出于这些目的,测试了Mandelbrot家族和展开对数族分布,并且揭示了两个最小二乘近似之间的微不足道差异。有人证明,第一十几个排名覆盖了俄罗斯介词和名词介词 - 名词类型的DB中所有介词发生的80%以上,因此几乎没有留下其余两百百元的俄罗斯的房间介词。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号