首页> 外文会议>IEEE International Symposium on High Performance Computer Architecture >In-network traffic regulation for Transactional Memory
【24h】

In-network traffic regulation for Transactional Memory

机译:交易记忆的网络网络流量监管

获取原文

摘要

Hardware Transactional Memory (HTM) promises to simplify parallel programming on shared-memory chip multiprocessors by providing atomic execution of code blocks. Concurrently, Networks-On-Chip (NOCs) have emerged as an efficient on-chip communication infrastructure but have been largely neglected in HTM designs. In this work, we explore the interaction between the HTM paradigm and NOCs. In the process, we find a huge source of unnecessary network traffic incurred by transactional requests that are unsuccessful. This problem is identified as false forwarding that adversely affects network performance and energy efficiency. Surprisingly, 39% (up to 79% for a specific workload) of the transactional requests have incurred false forwarding over a wide spectrum of workloads. To combat this problem, we propose TMNOC, a novel approach that exploits the co-design of HTM and NOCs to mitigate false forwarding. Transactional requests that have a high probability to fail are filtered out in-network as early as possible to save energy and improve concurrency in the memory system. Experimental results show that our design reduces total network traffic by 20% on average (up to 40%) for a set of high-contention benchmarks representative of future TM workloads, thereby reducing energy consumption by an average of 24% (up to 39%). Meanwhile, the contention in the coherence directory is reduced by 66% on average. These improvements are achieved with only 5% area overhead added to a conventional on-chip router design.
机译:硬件事务内存(HTM)承诺通过提供代码块的原子执行来简化共享存储芯片多处理器上的并行编程。同时,片上网(NOC)被出现为有效的片上通信基础设施,但在HTM设计中已在很大程度上被忽略。在这项工作中,我们探讨了HTM范例与NOC之间的互动。在此过程中,我们找到了不成功的事务请求产生的不必要的网络流量来源。该问题被标识为错误转发,这对网络性能和能效产生了不利影响。令人惊讶的是,交易请求的39%(特定工作量为79%)在广泛的工作量上产生了错误转发。为了解决这个问题,我们提出了一种新的方法,这是一种利用HTM和NOC的共同设计来缓解错误转发的新方法。尽早在网络中过滤出高概率的交易请求,以节省能量并提高内存系统中的并发性。实验结果表明,我们的设计由20%平均减少了总的网络流量(高达40%)为一组高争用的基准测试代表未来TM的工作负载,从而通过平均24%(最多减少能耗39% )。同时,相干目录中的争用平均减少了66%。这些改进是通过添加到传统的片上路由器设计中的5%面积开销来实现的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号