Volume 30 Issue 11
Nov.  2004
Turn off MathJax
Article Contents
Li Wenlong, Liu Li, Tang Zhizhonget al. Loop unrolling optimization for software pipelining[J]. Journal of Beijing University of Aeronautics and Astronautics, 2004, 30(11): 1111-1115. (in Chinese)
Citation: Li Wenlong, Liu Li, Tang Zhizhonget al. Loop unrolling optimization for software pipelining[J]. Journal of Beijing University of Aeronautics and Astronautics, 2004, 30(11): 1111-1115. (in Chinese)

Loop unrolling optimization for software pipelining

  • Received Date: 25 Jun 2004
  • Publish Date: 30 Nov 2004
  • Loop unrolling can m ake software pipelining achieve fractional initiation interval and improve resou rce utilization. Optimizations based on unrolling can reduce resource requiremen ts and the heights of critical paths. An algorithm named UTBPC(unrolling times based program characteristics) for determining unrolling factors and unrol ling b ased optimization for software data prefetching were proposed. These optimizatio ns were implemented in ORC(open research compiler), and SPEC CPU2000 benchmarks were tested in Itanium processor. The average perf ormance was improv ed by 2.6%. The results show that UTBPC algorithm and unrolling base d optimization for software data prefetching can improve the overall performance of compilers.

     

  • loading
  • [1] Allen V H, Jones R B, Lee R M, et al. Software pipelining [J]. ACM Computing Surveys, 1995,27(3):367~432 [2]Weiss S, Smith J E. A study of scalar compilation techniques for pipelined supercomputers[J]. ACM Transactions on Mathematical Software, 1990, 16(3):223~245 [3]Rau B R. Iterative modulo scheduling . HPL-94-115, 1994 [4]Mowry T C, Lam M S, Gupta A. Design and evaluation of a compiler algorithm for prefetching . In:Proceeding of the Fifth International Conference on Architectural Support for Programming Languages and Operating Systems . Massachusetts:ACM Press, 1992. 62~73 [5]Roy J, Sun C, Wu C Y. Open research compiler for itanium processor famil y(IPF) . In:MICRO-34 Tutorial . Texas:ACM Press, 2001 [6]Intel Corporation. Intel IA-64 architecture software developer’s manual. Volume 3:Instruction set reference [M]. Intel Corp, 2000 [7]Intel Corporation. Intel IA-64 architecture software developer’s manual. Volume 1:IA-64 application architecture [M]. Intel Corp, 2000 [8]Sanchez F, Cortadella J,Badia R M. Optimal exploration of the unrolling degree for software pipelining . UPC-DAC-1996-41, 1996 [9]Vivek Sarkar. Optimized unrolling of nested loops . In:Proceedings of the 14th International Conference on Supercomputing . New Mexico:ACM Press, 2000. 153~166
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views(3159) PDF downloads(2658) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return