首页> 外文会议>IEEE International Conference on Fuzzy Systems >A Computational Linguistic Approach for the Identification of Translator Stylometry using Arabic-English Text
【24h】

A Computational Linguistic Approach for the Identification of Translator Stylometry using Arabic-English Text

机译:一种使用阿拉伯语文本识别转换器训练器的计算语言方法

获取原文

摘要

Translator Stylometry is a small but growing area of research in computational linguistics. Despite the research proliferation on the wider research field of authorship attribution using computational linguistics techniques, the translator stylometry problem is more challenging and there is no sufficient literature on the topic. Some authors even claimed that this problem does not have a solution; a claim we will challenge in this paper. We present an innovative set of translator stylometric features that can be used as signatures to detect and identify translators. The features are based on the concept of network motifs: small graph local substructures which have been used successfully in characterizing global network dynamics. The text is transformed into a network, where words become nodes and their adjacencies in a sentence are represented through links. Motifs of size 3 are then extracted from this network and their distribution is used as a signature for the corresponding translator. We then investigate the impact of sample size, method of normalization and imbalance dataset on classification accuracy. We also adopt the Fuzzy Lattice Reasoning Classifier (FLR) among others, where FLR achieved the best performance with a classification accuracy reaching the 70% mark.
机译:转换器款式是一个小但增长的计算语言学研究领域。尽管使用计算语言学技术更广泛的研究领域的研究领域,转换器致力计数问题更具挑战性,但该主题没有足够的文学。一些作者甚至声称这个问题没有解决方案;索赔我们将在本文中挑战。我们提出了一套创新的转换器型致力计量功能,可用作检测和识别翻译器的签名。这些功能基于网络图案的概念:小图本地子结构,这些子结构已成功用于表征全局网络动态。该文本被转换为网络,其中单词成为节点,并且句子中的邻接通过链接表示。然后从该网络中提取大小3的图案,并且它们的分布用作相应翻译器的签名。然后,我们调查样本量,标准化方法和不平衡数据集的影响。我们还采用了模糊的晶格推理分类器(FLR)等,其中FLR实现了达到70%标记的分类准确性的最佳性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号