Software bypass and fine-grained parallel scheduling on VLIW
-
摘要: 描述并分析了一种新型的VLIW(Very Long Instruction Word)机器模型,为充分利用该模型的并行特性,提出了基于静态单一赋值(SSA)表示的软件旁路方法,以及集成的基于操作的指令调度与寄存器分配算法.与常规的软件旁路方法和基于周期的调度方法相比,该方法可以灵活地进行结果寄存器的分配,并有效减少由重用结果寄存器导致的"危险值"溢出次数,生成高质量的目标代码.Abstract: A new very long instruction word(VLIW) machine model was described and analyzed.In order to exploit the parallelism in this model, a software bypass method based on static single assignment and an integrated operation-based instruction scheduling and register allocation algorithm were presented. Compared with traditional software bypass and cycle based scheduling algorithm, the new methods can efficiently allocate result registers, schedule micro operations, reduce "hazard value" spills caused by result register reuse, thus yield high quality target code.
-
Key words:
- instruction schedule /
- register allocation /
- very long instruction word /
- software bypass
-
[1] Postiff M A, Greene D A, Tyson G S, et al. The limits of instruction level parallelism in SPEC95 applications[J]. ACM SIGARCH Computer Architecture News, 1999, 27(1):31~34[2] Free Soreware Fundation Inc. GCC info .[3] Hennessy J L, Patterson D A. Computer architecture:a quantitative approach, second edition[M]. San Francisco:Morgan Kaufmann Publishers, 1995[4] Muchnick S S. Advanced compiler design and implementation [M]. San Francisco:Morgan Kaufmann Publishers, 1997[5] Fisher J A. Trace scheduling:a technique for hlobal microcode compaction[J]. IEEE Trans on Computers, 1981, 30(7):478~490[6] Chen Gang. Effective instruction scheduling with limited registers . Boston:The Division of Engineering and Applied Sciences, Havard University, 2001[7] 吴承勇,连瑞琦,张兆庆,等,协作式全局指令调度与寄存器分配[J].计算机学报,2000,23(5):493~499 Wu Chengyong, Lian Ruiqi, Zhang Zhaoqing, et al. Cooperating global instruction scheduling and instant register allocataion[J]. Chinese J Computers, 2000, 23(5):493~499(in Chinese)[8] Hwu W M, Mahlke S A, Chen W Y, et al. The superblock:an effective technique for VLIW and superscalar compilation[J]. The J of Supercomputing, 1993,7(1-2):229~249[9] Mahlke S A, Lin D C, Chen W Y, et al. Effective compiler supoort for predicated execution using the hyperblock[J]. ACM SIGMICRO Newsletter, 1992,23(1-2):45-54[10] Steven J B. Lookahead scheduling[J]. ACM SIGMICRO Newsletter, 1992,23(1-2):256~259 期刊类型引用(7)
1. 冯兴,姚仰平,张健,张献民,黄信. 山区机场高填方边坡稳定性分析. 力学与实践. 2024(01): 89-98 . 百度学术
2. 王娇娇,张虎,金晓颖,黄帅,王宏伟,王文辉,詹涛,周刚义,车富强,李艳,李新宇,何瑞霞,张泽,张圣嵘,李国玉,童长江,王逊,金会军. 变暖背景下砾石换填对多年冻土区机场跑道下地温场的影响. 气候变化研究进展. 2024(03): 291-303 . 百度学术
3. 张森安,龙照,张秦琦,时轶磊,王小龙. 兰州某填挖场地黄土高填方填土体变形特征研究. 甘肃科技. 2024(06): 11-15 . 百度学术
4. 缪定华. 降雨作用下高填方边坡稳定性数值分析. 科技资讯. 2024(22): 208-211 . 百度学术
5. 徐坤,岳磊,栗培龙,陈如鑫,夏雷. 面向道面开裂防控的高填方道基柔性枕梁处治技术研究. 交通节能与环保. 2024(06): 169-173 . 百度学术
6. 杨鑫. 西部某机场高填方地基填方交界搭接技术研究. 地下水. 2021(03): 157-159 . 百度学术
7. 云海浪,韩文喜,张杰,魏浩,吕桂林,黄小龙. 基于FLAC3D的某酒厂高填方地基变形分析. 建筑结构. 2021(S2): 1483-1489 . 百度学术
其他类型引用(2)
-

计量
- 文章访问数: 2319
- HTML全文浏览量: 192
- PDF下载量: 5
- 被引次数: 9