首页>
外国专利>
TEXT ANALYSIS USING LINGUISTIC AND NON-LINGUISTIC LISTS PROPERTIES
TEXT ANALYSIS USING LINGUISTIC AND NON-LINGUISTIC LISTS PROPERTIES
展开▼
机译:使用语言和非语言列表属性进行文本分析
展开▼
页面导航
摘要
著录项
相似文献
摘要
A system and method are described for extracting information from text, which can be done without prior knowledge that the text includes a list. The method applies analysis rules (S102) to a sentence extending on lines of text (S104) to identify a set of candidate list items in the sentence (S108). Each candidate list item is assigned a set of features including one or more non-linguistic features and a language feature (S108). The linguistic feature defines a syntactic function of an item of the candidate list item that is likely to be in dependency relationship with an item of a candidate list presenter identified in the same sentence (S108). When two or more candidate list items are found with compatible feature sets (S114, S120), a list is generated (S118) that binds them as list items of a common list presenter. Dependency relationships are retrieved between the list presenter and the list items (S122) and information based on the extracted dependency relationships is outputted (S124).
展开▼