作者: Yuri Dotsenko , Derek Sessions , Andy Glaister , Blaise Pascal Tine , Mikhail Lyapunov
DOI:
关键词:
摘要: Described herein are optimizations of thread loop intermediate representation (IR) code. One embodiment involves an algorithm that, based on data-flow analysis, computes sets temporary variables that loaded at the beginning a and stored upon exit from loop. Another reducing size trip for commonly-found case where piece compute shader is executed by single (or compiler-analyzable range threads). In yet another embodiment, indices cached to avoid excessive divisions, further improving execution speed.