The Implementation of K-Means Clustering Method in Classifying Undergraduate Thesis Titles

机译：K-Means聚类方法在分类本科论文标题中的实施

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

One of graduation requirements at university is completing undergraduate thesis. At Industrial Engineering Universitas Ahmad Dahlan, undergraduate thesis titles are documented by thesis coordinator. The problem is that students are less knowledgeable on thesis topics, so they do not really know the previous students' thesis topics. Based on the problem, this research aims at developing a program to classify thesis title so the knowledge on the trend of thesis title topic can be got The method used in this research was K-Means clustering, while range measurement method used was cosine similarity. The testing used Silhouette Coefficient method. The phases from text mining were tokenizing, filtering, stemming, similarity, classifying, testing. The result of this research is a program that can process the title data into trend group pattern of thesis title topic. From 138 data obtained, there are three clusters arranged based on the field on Industrial Engineering study program. Silhouette Coefficient testing shows score of 0.5674 that shows the clustering result is classified low. It occurs since the textual data of the thesis title is too widely distributed, so the title has relatively low similarity score.

机译：大学毕业要求之一是完成本科论文。在工业工程艾哈迈德大学大学上，本科论文标题由论文协调员记录。问题是学生在论文主题上不太了解，因此他们并不真正了解以前的学生的论文主题。基于该问题，本研究旨在开发一个程序来分类论文标题，因此可以获得关于论文标题主题的趋势的知识可以获得本研究中使用的方法是K-Means聚类，而使用范围测量方法是余弦相似性。测试使用剪影系数法。来自文本挖掘的阶段是令牌化，过滤，源，相似性，分类，测试。该研究的结果是一个程序，可以将标题数据处理到论文标题主题的趋势组模式。从获得的138个数据，基于工业工程研究计划的领域，有三个集群。剪影系数测试显示得分为0.5674，显示集群结果被归类为低。它发生自论文标题的文本数据太广泛分布，因此标题具有相对低的相似度分数。

著录项

来源
《International Conference on Telecommunication Systems Services and Applications》|2018年|223p|共4页
会议地点
作者
Lisna Zahrotun; Nila Hutami Putri; Arfiani Nur Khusna;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN91-53;
关键词
data mining; educational administrative data processing; educational courses; industrial engineering; pattern classification; pattern clustering; text analysis;

机译：数据挖掘;教育行政数据处理;教育课程;工业工程;模式分类;模式聚类;文本分析;

相似文献

外文文献
中文文献
专利

1. ANALYSIS AND IMPLEMENTATION OF ALGORITHM CLUSTERING AFFINITY PROPAGATION AND K-MEANS AT DATA STUDENT BASED ON GPA AND DURATION OF BACHELOR-THESIS COMPLETION [J] . R.REFIANTI, A.B. MUTIARA, A. JUARNA, Journal of Theoretical and Applied Information Technology . 2012,第1期

机译：基于GPA和学士学位论文完成时间的数据聚类亲和力传播算法和K均值的分析与实现
2. Implementation of Data Mining on Rice Imports by Major Country of Origin Using Algorithm Using K-Means Clustering Method [J] . Agus Perdana Windarto International Journal of Artificial Intelligence Research . 2017,第2期

机译：K-Means聚类算法在主要产地大米进口数据挖掘中的应用
3. Title of the thesis: Multi-sensor Remote Sensing Techniques to Manage Cambodian Forests for Implementation of REDD + policies [J] . 日本リモ—トセンシング学会志 . 2012,第2期

机译：论文标题：利用多传感器遥感技术管理柬埔寨森林以实施REDD +政策
4. The Implementation of K-Means Clustering Method in Classifying Undergraduate Thesis Titles [C] . Lisna Zahrotun, Nila Hutami Putri, Arfiani Nur Khusna International Conference on Telecommunication Systems, Services, and Applications . 2018

机译：K-Means聚类方法在本科论文标题分类中的实现
5. Hardware Implementation and Performance Evaluation of K-Means and K-Means++ Clustering Algorithms [D] . Singh, Manisha . 2019

机译：K-Means和K-Means ++聚类算法的硬件实现和性能评估
6. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm Minimum Spanning Tree and Hierarchical Clustering in an Applied Study [O] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, 2020

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法最小生成树和分层聚类的三种混合方法的比较
7. Classification Of Category Selection Title Undergraduate Thesis Using K-Nearest Neighbor Method [O] . Ratih Kumalasari Niswatin, Ardi Sanjaya 2017

机译：类别选择标题的分类使用k - 最近邻法研究本科论文
8. Algorithmic Transforms in the Implementation of K-Means Clustering onReconfigurable Hardware [R] . Estlick, M., Leeser, M., Szymanskii, J. J., 2000

机译：可重构硬件实现K均值聚类的算法变换

The Implementation of K-Means Clustering Method in Classifying Undergraduate Thesis Titles

摘要

著录项

相似文献

相关主题

期刊订阅