Design of buffer architecture for multi-media stream microprocessor
-
摘要: 传统微处理器体系结构不能很好地匹配媒体处理应用的特点.针对处理器与存储器之间日益增长的性能间隙问题,分析了传统微处理器对媒体处理应用的通讯瓶颈;通过分析Cache存储器的特点,得出了传统的Cache结构并不适合现代媒体处理应用的结论,讨论了目前针对处理器通讯瓶颈的一些解决办法;提出了一种以大容量流寄存器堆替代Cache作为中间缓冲器,并能适合于媒体处理应用的金字塔存储层次体系结构设计.该体系结构具有三级并行数据带宽存储层次,即片外SDRAM、全局寄存器堆和局部寄存器堆.三级并行存储层次所能提供的带宽依次提高一个数量级,带宽之比为1∶16∶256,从而可以有效地支持卫星遥感图像预处理对数据带宽的需求.Abstract: The characteristics of media processing applications are poorly matched to conventional microprocessor architectures. The growing processor-memory performance gap was solved. The communication bottlenecks of traditional microprocessor, which is used for media processing application, were discussed. It was concluded that the traditional Cache architecture does not adapt to the modern media processing application by analyzing the characteristics of Cache memory. Some current schemes used to solve the communication bottlenecks were discussed. A new architecture design of pyramid memory hierarchy was brought forward, which substituted the larger stream register files for Cache as the medial buffers and could adapt to media processing applications. This architecture provided a three-tiered parallel data bandwidth hierarchy, including memory bandwidth, global register bandwidth, and local register bandwidth, with a ratio of 1∶16∶256. With this bandwidth scaling, the bandwidth requirements of satellite remote sensing image pretreatment may be efficiently matched.
-
[1] Burger, Goodman J R, Kägi A. Memory bandwidth limitations of future microprocessors. In:Proc 23rd Ann Int'l Symp Computer Architecture, Association of Computing Machinery. New York:ACM Press, 1996.79~90 [2] Bakshi A, Gaudiot J L. Memory latency:to tolerate or to reduce. In:Proceedings of the 12th Symposium on Computer Architecture and High Performance Computing. Brazil:Sao Pedro, 2000.24~27 [3] Stanford. Streaming supercomputer. The 2002 Annual Technical Report, Chapter 5,2002 [4] Rixner S. Stream processor architecture[M] Boston:Kluwer Academic Publishers,2001 [5] Seznec A. A case for two-way skewed-associative caches. In:Proceedings of the 20th Annual International Symposium on Computer Architecture. San Diego:CA,1993.169~178 [6] Chen T F, Baer J L. Effective hardware-based data prefetching for high performance multiprocessors[J] IEEE Transactions on Computers,1995,44(5):609~623 [7] Eggers S. Simultaneous multithreading:a platform for next-generation processors[J] IEEE Micro,1997,12~19 [8] Patterson D, Anderson T, Cardwell N, et al. A case for intelligent RAM:IRAM[J] IEEE Micro,1997,17(2) [9] Oskin M, Chong F, Sherwood T. Active pages:a model of computation for intelligent memory. In:Proceedings of the 25th Annual International Symposium on Computer Architecture. Barcelona, 1998.192~203 [10] Rixner S,William J. Memory access scheduling. In:ISCA 2000. 2000. 128~138 [11] Zhou G, Menas Kafatos. Future intelligent earth observing satellites. In:Pecora 15/Land Satellite Information IV/ISPRS Commission I/FIEOS 2002 Conference Proceedings. Hanover:University of Hannover, 2002.1~8 [12] Cloude S, Papathanassiou K, Reigber A, et al. Multi-frequency polarimetric SAR interferometry for vegetation structure extraction. In:Proceedings of IGARSS’ 2000. 2000. 129~131
点击查看大图
计量
- 文章访问数: 2641
- HTML全文浏览量: 240
- PDF下载量: 4
- 被引次数: 0