首页> 外国专利> INFORMATION PROCESSING APPARATUS, TEXT STRUCTURALIZATION PROGRAM, AND TEXT STRUCTURALIZATION METHOD

INFORMATION PROCESSING APPARATUS, TEXT STRUCTURALIZATION PROGRAM, AND TEXT STRUCTURALIZATION METHOD

机译:信息处理设备,文本结构化程序和文本结构化方法

摘要

To structuralize natural language data.SOLUTION: An information processing apparatus is provided with: an analysis data generation unit that divides text data into a plurality of terms and generates analysis data obtained by combining the plurality of terms with an anaphoric relation; a verb partition data generation unit that partitions the analysis data for every verb clause being a clause including a verb to generate verb partition data including a plurality of verb partition clauses, the verb partition clauses each including one verb term at its end; and a structuralized data generation unit that detects, from the verb partition data, a no anaphoric relation verb partition clause that is a verb partition clause not having the anaphoric relation with any one of N or less immediately following verb partition clauses, and combines the verb partition clauses other than the no anaphoric relation verb partition clause and a verb partition clause immediately thereafter without combining the no anaphoric relation verb partition clause and a verb partition clause immediately thereafter to generate structuralized data that is data obtained by dividing the analysis data immediately after a verb term included at the end of the no anaphoric relation verb partition clause.SELECTED DRAWING: Figure 11
机译:为了使自然语言数据构造出来一个动词分区数据生成单元,用于为每个动词子句分区的分析数据是包括动词以生成包括多个动词分区条款的动词分区数据的子句,动词分区条款在其结束时包括一个动词术语;和一个结构化数据生成单元,其从动词分区数据中检测到一个不受动词分区条款的无助链接性动词分区子句,该子句与任何一个与n或更少的任何一个以后的动词分区条款,并且组合动词除了不紧接无来的不组合无比视权关系动词分区子句和动词分区子句之外的不紧接无止的Paraphoric关系动词分区子句和动词分区子句的分区子句,以生成通过在a之后立即划分分析数据而获得的结构化数据Verb术语包含在无附加关系动词分区条件的末尾。选择绘图:图11

著录项

  • 公开/公告号JP2021125165A

    专利类型

  • 公开/公告日2021-08-30

    原文格式PDF

  • 申请/专利权人 KYOCERA DOCUMENT SOLUTIONS INC;

    申请/专利号JP20200020329

  • 发明设计人 SAKURAMOTO KENTARO;

    申请日2020-02-10

  • 分类号G06F40/295;G10L15;

  • 国家 JP

  • 入库时间 2022-08-24 22:20:07

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号