Unsupervised Fact Checking by Counter-Weighted Positive and Negative Evidential Paths in A Knowledge Graph

机译：通过对知识图中的反加权正和负证据路径检查无监督的事实

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Misinformation spreads across media, community, and knowledge graphs in the Web by not only human agents but also information extraction algorithms that extract factual statements from unstructured textual data to populate the existing knowledge graphs. Traditional fact checking by experts or crowds is increasingly difficult to keep pace with the volume of newly created misinformation in the Web. Therefore, it is important and necessary to enhance the computational ability to determine whether a given factual statement is truthful or not. We view this problem as a truth scoring task in a knowledge graph. We present a novel rule-based approach that finds positive and negative evidential paths in a knowledge graph for a given factual statement, and calculates a truth score for the given statement by unsupervised ensemble of the found positive and negative evidential paths. For example, we can determine the factual statement "United States is the birth place of Barack Obama" as truthful if there is the positive evidential path (Barack Obama, birthplace, Hawaii) Λ (Hawaii, country, United States) in a knowledge graph. For another example, we can determine the factual statement "Canada is the nationality of Barack Obama" as untruthful if there is the negative evidential path (Barack Obama, nationality, United States) Λ (United States, ≠, Canada) in a knowledge graph. For evaluating on a real-world situation, we constructed an evaluation dataset by labeling truth or untruth label on factual statements that were extracted from Wikipedia texts by using the state-of-the-art BERT-based information extraction system. Our evaluation results show that our approach outperforms the state-of-the-art unsupervised approaches significantly by up to 0.12 AUC-ROC and even outperforms the supervised approach by up to 0.05 AUC-ROC not only in our dataset but also in the two different standard datasets.

机译：错误信息不仅是人类代理的媒体，社区和知识图表，还可以通过人类代理传播，而且还可以从非结构化文本数据中提取事实语句的信息提取算法填充现有知识图。通过专家或人群检查的传统事实越来越难以与网络中的新创造的误导的数量保持同步。因此，增强了确定给定的事实陈述是否真实的计算能力是重要的，必要的。我们将此问题视为知识图中的真实评分任务。我们提出了一种基于规则的基于规则的方法，为特定的事实声明找到了知识图中的积极和负证据路径，并通过发现正面和负证据路径的无监督集合来计算给定陈述的真实性评分。例如，我们可以确定“美国是巴拉克奥巴马的出生地”，如有真实的证据（巴拉克奥巴马，出生地，夏威夷）λ（夏威夷，国家，美国）在知识图表中。另一个例子，我们可以确定“加拿大是巴拉克奥巴马的国籍”，如果有证知识图中的负数证据道路（巴拉克奥巴马，加拿大），那么“加拿大是巴拉克奥巴马的国籍”是不诚实的。为了评估真实世界的情况，我们通过使用最先进的BERT基信息提取系统来标记从维基百科文本提取的事实陈述的真实语句构建了评估数据集。我们的评价结果表明，我们的方法优于最先进的无监督方法，明显高达0.12 AUC-ROC，甚至优于监督方法，不仅在我们的数据集中最多0.05 AUC-ROC，而且在两种不同中标准数据集。

著录项

来源
《International Conference on Computational Linguistics》|2020年|1677-1686|共10页
会议地点
作者
Jiseong Kim; Key-Sun Choi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Discriminative predicate path mining for fact checking in knowledge graphs [J] . Shi Baoxu, Weninger Tim Knowledge-Based Systems . 2016,第jula期

机译：区分性谓词路径挖掘，用于知识图中的事实检查
2. LABELED SHORTEST PATHS IN DIGRAPHS WITH NEGATIVE AND POSITIVE EDGE WEIGHTS [J] . Phillip G. Bradford, David A. Thomas RAIRO Theoretical Informatics and Applications . 2009,第3期

机译：负和正边权重的图形中最短的路径
3. Fact Checking in Knowledge Graphs with Ontological Subgraph Patterns [J] . Peng Lin, Qi Song, Yinghui Wu Data Science and Engineering . 2018,第4期

机译：具有本体子图模式的知识图中的事实检查
4. On path cover problems with positive and negative constraints for graph-based multitarget tracking [C] . Lingji Chen, Ravi Ravichandran International Conference on Information Fusion . 2016

机译：基于图的多目标跟踪具有正负约束的路径覆盖问题
5. A search for physics beyond the standard model through the three-body rare and forbidden charm decays positive D meson, positive strange D meson decaying to positive kaon positive muon negative muon, negative kaon positive muon positive muon, positive pion positive muon negative muon, negative pion positive muon positive muon, positive muon positive muon negative muon [D] . Engh, Daniel James 2002

机译：通过三体稀有和禁止的魅力来寻找超出标准模型的物理场，将衰减正D介子，正奇怪D介子衰减为正kaon正muon负muon，负kaon正muon正muon，正pion正muon负muon，负介子阳性介子阳性介子阳性介子阳性介子阴性介子
6. Consistency checks to improve measurement with the Positive and Negative Syndrome Scale (PANSS) [O] . Jonathan Rabinowitz, Nina R Schooler, Ariana Anderson, -1

机译：进行一致性检查以改善正负综合症量表（PANSS）的测量
7. Figure 1: (A) Example of a text-based forma mentis network. A TFMN can be represented either as an edge-coloured graph or as a multiplex network. Positive (negative) words are highlighted in cyan (red). Neutral words are in black. Syntactic links between positive (negative) words are highlighted in cyan (red) too. Syntactic links between positive and negative concepts are in purple. All semantic links of meaning overlap are highlighted in green. (B) Infographics about how a TFMN is assembled. Individuals organise their knowledge and emotional perception of the real world in their mental lexicon (comic clouds). [O] . -1

机译：图1：（a）基于文本的Forma Mentis网络示例。 TFMN可以用作边缘彩色图形或作为多路复用网络表示。在青色（红色）突出显示正（负）单词。中立词是黑色的。在青色（红色）突出显示正（否定）单词之间的句法链接。正面和消极概念之间的句法链接在紫色。含义重叠的所有语义链接都以绿色突出显示。（b）关于TFMN如何组装的信息图表。个人在他们的精神词典（漫画云）中对现实世界组织了他们的知识和情感感知。

Unsupervised Fact Checking by Counter-Weighted Positive and Negative Evidential Paths in A Knowledge Graph

摘要

著录项

相似文献

相关主题

期刊订阅