首页> 外国专利> Classification of top-level domain (TLD) websites based on a known website classification

Classification of top-level domain (TLD) websites based on a known website classification

机译:基于已知网站分类的顶级域(TLD)网站分类

摘要

Systems and methods for classification of web sites and/or their corresponding URLs based on a known web site classification are provided. According to one embodiment, a website URL is received that is known to be associated with a particular content classification. A list of candidate domain names including a host name of the website URL is generated based on a defined TLD list. For each of the candidate domain names it is determined whether an IP address of the candidate domain name is equal to an IP address of the website URL. When the result is affirmative, the particular content classification is associated with the candidate domain name; otherwise, a cosine similarity measurement process is performed between information associated with the candidate domain name and information associated with the website URL to determine whether to associate the particular content classification with the candidate domain name.
机译:提供了用于基于已知网站分类对网站和/或其对应的URL进行分类的系统和方法。根据一个实施例,接收已知与特定内容分类相关联的网站URL。基于定义的TLD列表生成候选域名列表,包括网站URL的主机名。对于每个候选域名,确定候选域名的IP地址是否等于网站URL的IP地址。如果结果是肯定的,则将特定的内容分类与候选域名相关联;否则,在与候选域名相关联的信息与与网站URL相关联的信息之间执行余弦相似度测量过程,以确定是否将特定内容分类与候选域名相关联。

著录项

  • 公开/公告号US10148700B2

    专利类型

  • 公开/公告日2018-12-04

    原文格式PDF

  • 申请/专利权人 FORTINET INC.;

    申请/专利号US201615199492

  • 申请日2016-06-30

  • 分类号H04L29/06;H04L29/12;

  • 国家 US

  • 入库时间 2022-08-21 12:07:21

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号