intel/compiler: fix derivative on y axis implementation
authorLionel Landwerlin <lionel.g.landwerlin@intel.com>
Tue, 25 Jun 2019 08:10:14 +0000 (11:10 +0300)
committerLionel Landwerlin <lionel.g.landwerlin@intel.com>
Thu, 27 Jun 2019 18:14:58 +0000 (18:14 +0000)
commit836225840c21dfb9ee77267e06e14bba781f69a1
treebc8b7b3e5aae7cb3e9442b3d01d66a157eed9a2d
parent53f17c4efdcded0283087ccc9777b31010db2fec
intel/compiler: fix derivative on y axis implementation

This rewrites the ddy in EXECUTE_4 mode with a loop to make it more
obvious what is going on and also sets the group each of the 4 threads
in the groups are supposed to execute.

Fixes the following CTS tests :

   dEQP-VK.glsl.derivate.dfdyfine.dynamic_*

Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Co-Authored-by: Jason Ekstrand <jason@jlekstrand.net>
Reviewed-by: Matt Turner <mattst88@gmail.com>
Fixes: 2134ea380033d5 ("intel/compiler/fs: Implement ddy without using align16 for Gen11+")
src/intel/compiler/brw_fs_generator.cpp