首页> 外文会议>International Conference on Informatics and Computing >Developing Indonesian corpus of pornography using simple NLP-text mining (NTM) approach to support government anti-pornography program
【24h】

Developing Indonesian corpus of pornography using simple NLP-text mining (NTM) approach to support government anti-pornography program

机译:使用简单的NLP文本挖掘(NTM)方法开发印尼色情语料库,以支持政府的反色情内容计划

获取原文

摘要

The world of information technology and telecommunication advanced with the presence of the development the internet. With the emergence of the internet, pornography is easily obtained. Pornography in Indonesia is considered illegal because contrary to laws prevailing in Indonesia. Pornography also having the impact on bad at society under the age of one of them is how many children under age already have sexual intercourse. Approach done is taking the title from / content video pornography that is on a web page an (URL). Methods used namely semiautomatic where this method uses the method of manual and automatic and used K-Nearest Neighbor algorithm. K-Nearest Neighbor algorithm is one the algorithm can be utilized for implementation classification. With K-Nearest Neighbor algorithm can classify data that belongs to a porno or not. Tools used is web corp and programming PHP language. The manufacture of limitation from corpus this is build corpus pornography Indonesian language and data taken to object research site is 10 pornography who can access via smartphone.
机译:随着互联网的发展,信息技术和电信的世界也在发展。随着互联网的出现,色情内容很容易获得。印度尼西亚的色情活动被认为是非法的,因为它违反了印度尼西亚的现行法律。色情制品(其中之一)对不道德的社会也有影响,因为有多少未成年的孩子已经发生过性行为。完成的方法是从网页上的/内容视频色情制品中获取标题(URL)。使用的方法即半自动,其中此方法使用手动和自动方法,并使用K-最近邻居算法。 K-最近邻居算法是一种可用于实现分类的算法。使用K-Nearest Neighbor算法可以对属于色情内容的数据进行分类。使用的工具是网络公司和编程PHP语言。从语料库的限制的制造这是建立语料库色情的印度尼西亚语语言,带到对象研究站点的数据是可以通过智能手机访问的10种色情内容。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号