首页> 外文会议>Chinese lexical semantics workshop >Latent Semantic Distance Between Chinese Basic Words and Non-basic Words
【24h】

Latent Semantic Distance Between Chinese Basic Words and Non-basic Words

机译:汉语基本单词与非基本单词之间的潜在语义距离

获取原文

摘要

What determines the "basicness" of words still remains a challenging question in creating basic lexicons and basic wordlists. Since frequency and dispersion seem to be the most dominant criteria, it is questioned that whether contextual factors also help to define the concept of "basicness." From the perspective of the distributional model, meanings are represented through the interaction between words and their contexts. Hence, this research aims to examine an existing wordlist and tentatively take it as the standard of "basicness," trying to seek the differences between "basic words" and "non-basic words" based on their occurrences in different texts. Two experiments were conducted to answer the research questions. The first calculated the "latent semantic distances" between basic words and non-basic words. The second calculated and examined the "near neighbors" of basic word and non-basic words. It has been discovered that basic words tend to occur in more similar texts than non-basic words do; in addition, the near neighbors of basic words tend to be more "basic", too. This research contributes to providing a more "contextual" perspective in exploring "basicness."
机译:在创建基本词典和基本单词表时,决定单词“基本性”的问题仍然是一个具有挑战性的问题。由于频率和分散似乎是最主要的标准,因此有人质疑上下文因素是否也有助于定义“基本性”的概念。从分布模型的角度来看,意义是通过单词及其上下文之间的交互来表示的。因此,本研究旨在检查现有的单词表,并尝试将其作为“基本性”的标准,试图根据它们在不同文本中的出现来寻找“基本单词”和“非基本单词”之间的差异。进行了两个实验以回答研究问题。首先计算基本单词和非基本单词之间的“潜在语义距离”。第二种计算并检查了基本单词和非基本单词的“近邻”。已经发现,与非基本单词相比,基本单词倾向于出现在更多相似的文本中。另外,基本单词的近邻也趋向于更“基本”。这项研究有助于在探索“基本性”方面提供更“上下文相关”的观点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号