'The 100 Most Influential Persons in History': A Data Mining Perspective

机译：“历史上100人最有影响力的人”：一种数据挖掘观点

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data mining has been widely applied in various domains, however, there have been limited studies into discovering hidden knowledge from factual data about selected groups of people with special characteristics. It is important to mine data about such group of individuals to extract insightful knowledge that could lead to a better understanding of their personalities, in addition to further sociological conclusions. This paper presents the application and outcome of data mining techniques, namely data clustering and association rules extraction, to find common features and relations among social, environmental and socioeconomic factors from the lives of known influential individuals in history. The mining process was initiated by constructing a dataset through defining, extracting, and retrieving important known facts about these individuals from selected and reliable sources. Second, association rules discovery algorithms were applied in order to show interesting patterns and highlight relations between attributes. Finally, the data were clustered into different groups and each cluster was further analyzed to identify its most strongly defining attributes. The extracted association rules showed how some factors are related, such as the effect of environment type and order of birth in the family on the age at which the individual first engaged with their domain of influence. The clustering exercise demonstrated that influential people who grew up in families of a similar size and financial status share many similar characteristics.

机译：数据挖掘已广泛应用于各个领域，然而，研究了从有关特殊特征的所选人群的事实数据中发现隐藏知识的有限研究。除了进一步的社会学结论之外，还可以提取有关这些人的数据，以提取可能导致他们的个性化更好地了解其个性的洞察力。本文介绍了数据挖掘技术的应用和结果，即数据集群和关联规则提取，寻求社会，环境和社会经济因素的共同特征和关系，从历史上的已知有影响力的人的生命。通过通过定义，提取和检索来自所选和可靠的来源的这些个人的重要已知事实来构造数据集来启动挖掘过程。其次，应用关联规则发现算法以显示有趣的模式并突出属性之间的关系。最后，将数据群集为不同的组，并进一步分析每个群集以识别其最强烈的定义属性。提取的关联规则显示了一些因素有关的关系，例如家庭在个人首次与其域名领域与其领域接触的年龄的家庭类型和生育顺序的影响。聚类练习表明，在相似规模和财务状况的家庭中长大的有影响力的人分享了许多相似的特征。

著录项

来源
《IEEE International Conference on Data Mining Workshops》|2011年||共6页
会议地点
作者
Al-Naimi Noora Mohammad; Shaban Khaled Bashir;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP274.2-53;
关键词
association rules extraction; data clustering; mining of socioeconomic data; the 100 most influential people;

机译：关联规则提取;数据聚类;社会经济数据的挖掘;100最具影响力的人;

相似文献

外文文献
中文文献
专利

1. Tutorial on practical tips of the most influential data preprocessing algorithms in data mining [J] . Garcia Salvador, Luengo Julian, Herrera Francisco Knowledge-Based Systems . 2016,第Apra15期

机译：数据挖掘中最有影响力的数据预处理算法的实用技巧教程
2. Data mining and influential analysis of gene expression data for plant resistance genes identification in tomato (Solanum lycopersicum) [J] . Torres-Avilés Francisco, Romeo José S., López-Kleine Liliana Electronic Journal of Biotechnology . 2014,第2期

机译：基因表达数据的数据挖掘及对番茄抗性基因鉴定的影响分析（Solanum lycopersicum）
3. PIB: Profiling Influential Blogger in Online Social Networks, A Knowledge Driven Data Mining Approach [J] . G.U. Vasanthakumar, Bagul Prajakta, P. Deepa Shenoy, Procedia Computer Science . 2015,第1期

机译：PIB：在线社交网络中有影响力的Blogger分析，这是一种知识驱动的数据挖掘方法
4. "The 100 Most Influential Persons in History": A Data Mining Perspective [C] . Al-Naimi Noora Mohammad, Shaban Khaled Bashir 11th IEEE International Conference on Data Mining Workshops . 2011

机译：“历史上最具影响力的100个人”：数据挖掘的视角
5. Recovery: A novel data mining approach from mining engineering perspectives. [D] . Liu, Dong. 2005

机译：恢复：从挖掘工程的角度来看，一种新颖的数据挖掘方法。
6. Everything that looks good ain’t good!: Perspectives on Urban Redevelopment among Persons with a History of Injection Drug Use in Baltimore Maryland [O] . Sabriya L. Linton, Caitlin E. Kennedy, Carl A. Latkin, -1

机译：一切看起来都不好！：马里兰州巴尔的摩市有注射吸毒史的人对城市重建的看法
7. Data mining and influential analysis of gene expression data for plant resistance gene identification in tomato (Solanum lycopersicum) [O] . Torres-Avilés Francisco, Romeo José S., López-Kleine Liliana 2014

机译：番茄植物抗性基因鉴定的基因表达数据的数据挖掘和影响力分析

'The 100 Most Influential Persons in History': A Data Mining Perspective

摘要

著录项

相似文献

相关主题

期刊订阅