首页>
外国专利>
METHOD AND SYSTEM FOR EXTRACTING AND CHARACTERIZING RELATIONSHIPS BETWEEN ENTITIES MENTIONED IN DOCUMENTS
METHOD AND SYSTEM FOR EXTRACTING AND CHARACTERIZING RELATIONSHIPS BETWEEN ENTITIES MENTIONED IN DOCUMENTS
展开▼
机译:提取和表征文档中提到的实体之间的关系的方法和系统
展开▼
页面导航
摘要
著录项
相似文献
摘要
Methods and devices for use in gathering and analyzing data from a corpus of documents. A corpus of documents is initially scanned for words that qualify as entities according to user defined criteria. Multiple counters track the number of documents which mention specific entities. A database of entities mentioned in the documents is maintained and an entry for each entity in the corpus is placed in the entity database. The results are then presented to a user in a spiral form with the most important entity at the center of the spiral. The importance of an entity may be determined by either how many entities it is connected to or how many documents mention that entity. A connection exists between two entities if they are both mentioned in at least one document and the more documents mention two specific entities at the same time, the stronger the connection between those two specific entities. The result presentation to the user is capable of also visually representing connections between entities by connecting connected entities with lines. The strength of a connection can also be represented with the width of the line connecting two entities.
展开▼