...
首页> 外文期刊>Emerging Topics in Computing, IEEE Transactions on >Multilevel Graph-Based Decision Making in Big Scholarly Data: An Approach to Identify Expert Reviewer, Finding Quality Impact Factor, Ranking Journals and Researchers
【24h】

Multilevel Graph-Based Decision Making in Big Scholarly Data: An Approach to Identify Expert Reviewer, Finding Quality Impact Factor, Ranking Journals and Researchers

机译:基于多级图形的基于格子数据的决策:一种识别专家评审员的方法,寻找质量影响因素,排名期刊和研究人员

获取原文
获取原文并翻译 | 示例
           

摘要

Digital libraries, such as conference papers, journal documents, books and thesis, research patents, and experiments generate a vast amount of data, named as, Scholarly Big Data. It covers scholarly related information for both researcher's perspective as well as publisher's perspective, such as academic activities, author's demography, academic social networks, etc. The relationships among Big Scholarly Data can be worthy of solving researcher as well as journal related concerns, if they are prudently treated to extract knowledge. The best approach to efficiently process these relationships is the graph. However, with the rapid growth in the number of digital articles by various libraries, the relationships raise exponentially, generating large graphs, which have become increasingly challenging to be handled in order to analyze scholarly information. On the other hand, many researchers and publishers/journals have severe concerns about the ranking control mechanisms and the consideration of quantity rather than quality. Therefore, in this paper, we proposed graph-based mechanisms to perform four critical decisions that are the need of the today's scholarly community. To improve the quality of the article, we proposed a mechanism for selecting and recommending suitable reviewers for a submitted paper based on researchers' expertise and their popularity in that particular field while avoiding conflict of interest. Also, due to shortcomings in the existing journal ranking approaches, we also designed a journal ranking mechanism including its new impact factor and relative ranking by using a modified version of traditional page ranking algorithm and excluding self-authors citations as well as self-journal citations. Similarly, researchers ranking is also important for various motives that is calculated based on the expert's field, citation count, and a number of publications while avoiding any loophole to increase the ranking such as, self-citations and wrong citations. Also, to efficiently process big graphs generated by a massive number of scholarly related relationships, we proposed an architecture that uses the parallel processing mechanism of the Hadoop ecosystem over the real-time analysis approach of Apache Spark with GraphX. Finally, the efficiency of the proposed system is evaluated in terms of processing time and throughput while implementing the designed decision mechanisms.
机译:数字图书馆,如会议论文,期刊文件,书籍和论文,研究专利和实验,并产生了大量数据,命名为学术大数据。它涵盖了研究人员的观点以及出版商的观点以及学术活动,作者的人口统计学,学术社交网络等的学术相关信息。大学学位数据之间的关系可以求解研究人员以及杂志的相关问题谨慎对待提取知识。有效地处理这些关系的最佳方法是图形。然而,随着各种图书馆的数字文章的数量的快速增长,这种关系征集指数,产生大图,这已经变得越来越具有挑战性,以便分析学术信息。另一方面,许多研究人员和出版商/期刊对排名控制机制的严重问题和对数量而不是质量的考虑。因此,在本文中,我们提出了基于图形的机制,以执行今天的学术界需要的四个关键决策。为了提高文章的质量,我们提出了一种为基于研究人员的专业知识和他们的普及而建议为提交文件的合适审查员选择和推荐合适的审查员,同时避免利益冲突。此外,由于现有日记排列方法中的缺点,我们还设计了一种日志排名机制,包括通过使用传统页面排名算法的修改版本和不包括自我提交人以及自我杂志引文的新的影响因子和相对排名。同样,研究人员对根据专家领域,引文计数和许多出版物计算的各种动机也很重要,同时避免任何漏洞,以增加排名,如自我引用和错误的引用。此外,为了有效地处理由大量的学术相关关系生成的大图,我们提出了一种架构,该架构使用Hadoop生态系统的并行处理机制,通过Graphx使用Apache Spark的实时分析方法。最后,在实现设计的决策机制的同时,在处理时间和吞吐量方面评估所提出的系统的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号