...
首页> 外文期刊>IEEE Transactions on Computers >Modular Design of High-Throughput, Low-Latency Sorting Units
【24h】

Modular Design of High-Throughput, Low-Latency Sorting Units

机译:高通量,低延迟分选单元的模块化设计

获取原文
获取原文并翻译 | 示例
           

摘要

High-throughput and low-latency sorting is a key requirement in many applications that deal with large amounts of data. This paper presents efficient techniques for designing high-throughput, low-latency sorting units. Our sorting architectures utilize modular design techniques that hierarchically construct large sorting units from smaller building blocks. The sorting units are optimized for situations in which only the $(M)$ largest numbers from $(N)$ inputs are needed, because this situation commonly occurs in many applications for scientific computing, data mining, network processing, digital signal processing, and high-energy physics. We utilize our proposed techniques to design parameterized, pipelined, and modular sorting units. A detailed analysis of these sorting units indicates that as the number of inputs increases their resource requirements scale linearly, their latencies scale logarithmically, and their frequencies remain almost constant. When synthesized to a 65-nm TSMC technology, a pipelined 256-to-4 sorting unit with 19 stages can perform more than 2.7 billion sorts per second with a latency of about 7 ns per sort. We also propose iterative sorting techniques, in which a small sorting unit is used several times to find the largest values.
机译:高吞吐量和低延迟排序是许多处理大量数据的应用程序的关键要求。本文介绍了设计高吞吐量,低延迟的排序单元的有效技术。我们的分拣架构利用模块化设计技术,可以从较小的构造块中分层构建大型分拣单元。分类单元针对仅需要$(N)$个输入中最大$(M)$个数的情况进行了优化,因为这种情况通常发生在科学计算,数据挖掘,网络处理,数字信号处理,和高能物理。我们利用我们提出的技术来设计参数化,流水线和模块化分类单元。对这些分类单元的详细分析表明,随着输入数量的增加,其资源需求呈线性增长,其等待时间呈对数增长,其频率几乎保持不变。当合成为65纳米TSMC技术时,具有19个级的流水线式256至4分选单元可以每秒执行超过27亿次分选,每个分选的等待时间约为7 ns。我们还提出了迭代排序技术,其中多次使用一个小的排序单元来找到最大值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号