首页>
外国专利>
A machine learning system for extracting structured records from web pages and other text sources
A machine learning system for extracting structured records from web pages and other text sources
展开▼
机译:一种用于从网页和其他文本源提取结构化记录的机器学习系统
展开▼
页面导航
摘要
著录项
相似文献
摘要
A method for extracting a structured record (190) from a document (100) is described where the the structured record includes information related to a predetermined subject matter (120), with this information being organized into categories within the structured record. The method comprises the steps of identifying a span of text (130) in the document (100) according to criteria associated with the predetermined subject matter and processing (150) the span of text to extract at least one text element associated with at least one of the categories of the structured record (190) from the document (100).
展开▼