首页> 外文会议>International Conference on Telecommunication Systems Services and Applications >The Implementation of K-Means Clustering Method in Classifying Undergraduate Thesis Titles
【24h】

The Implementation of K-Means Clustering Method in Classifying Undergraduate Thesis Titles

机译:K-Means聚类方法在分类本科论文标题中的实施

获取原文

摘要

One of graduation requirements at university is completing undergraduate thesis. At Industrial Engineering Universitas Ahmad Dahlan, undergraduate thesis titles are documented by thesis coordinator. The problem is that students are less knowledgeable on thesis topics, so they do not really know the previous students' thesis topics. Based on the problem, this research aims at developing a program to classify thesis title so the knowledge on the trend of thesis title topic can be got The method used in this research was K-Means clustering, while range measurement method used was cosine similarity. The testing used Silhouette Coefficient method. The phases from text mining were tokenizing, filtering, stemming, similarity, classifying, testing. The result of this research is a program that can process the title data into trend group pattern of thesis title topic. From 138 data obtained, there are three clusters arranged based on the field on Industrial Engineering study program. Silhouette Coefficient testing shows score of 0.5674 that shows the clustering result is classified low. It occurs since the textual data of the thesis title is too widely distributed, so the title has relatively low similarity score.
机译:大学毕业要求之一是完成本科论文。在工业工程艾哈迈德大学大学上,本科论文标题由论文协调员记录。问题是学生在论文主题上不太了解,因此他们并不真正了解以前的学生的论文主题。基于该问题,本研究旨在开发一个程序来分类论文标题,因此可以获得关于论文标题主题的趋势的知识可以获得本研究中使用的方法是K-Means聚类,而使用范围测量方法是余弦相似性。测试使用剪影系数法。来自文本挖掘的阶段是令牌化,过滤,源,相似性,分类,测试。该研究的结果是一个程序,可以将标题数据处理到论文标题主题的趋势组模式。从获得的138个数据,基于工业工程研究计划的领域,有三个集群。剪影系数测试显示得分为0.5674,显示集群结果被归类为低。它发生自论文标题的文本数据太广泛分布,因此标题具有相对低的相似度分数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号