intel/fs: Don't optimize DW*1 MUL if it stores value to the accumulator

author Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com>

Fri, 13 Oct 2023 02:02:22 +0000 (05:02 +0300)

committer Eric Engestrom <eric@engestrom.ch>

Wed, 20 Dec 2023 00:05:06 +0000 (00:05 +0000)
author Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com>
Fri, 13 Oct 2023 02:02:22 +0000 (05:02 +0300)
committer Eric Engestrom <eric@engestrom.ch>
Wed, 20 Dec 2023 00:05:06 +0000 (00:05 +0000)
diff --git a/.pick_status.json b/.pick_status.json

index a10aa15..abc560d 100644 (file)
--- a/.pick_status.json
+++ b/.pick_status.json
@@ -194,7 +194,7 @@
          "description": "intel/fs: Don't optimize DW*1 MUL if it stores value to the accumulator",
          "nominated": true,
          "nomination_type": 1,
-        "resolution": 0,
+        "resolution": 1,
          "main_sha": null,
          "because_sha": "a8b86459a1bb74cfdf0d63572a9fe194b2b5b53f",
          "notes": null
diff --git a/src/intel/compiler/brw_fs.cpp b/src/intel/compiler/brw_fs.cpp

index db67b33..696b5db 100644 (file)
--- a/src/intel/compiler/brw_fs.cpp
+++ b/src/intel/compiler/brw_fs.cpp
@@ -2788,6 +2788,29 @@ fs_visitor::opt_algebraic()
           if (brw_reg_type_is_floating_point(inst->src[1].type))
              break;
  
+         /* From the BDW PRM, Vol 2a, "mul - Multiply":
+          *
+          *    "When multiplying integer datatypes, if src0 is DW and src1
+          *    is W, irrespective of the destination datatype, the
+          *    accumulator maintains full 48-bit precision."
+          *    ...
+          *    "When multiplying integer data types, if one of the sources
+          *    is a DW, the resulting full precision data is stored in
+          *    the accumulator."
+          *
+          * There are also similar notes in earlier PRMs.
+          *
+          * The MOV instruction can copy the bits of the source, but it
+          * does not clear the higher bits of the accumulator. So, because
+          * we might use the full accumulator in the MUL/MACH macro, we
+          * shouldn't replace such MULs with MOVs.
+          */
+         if ((brw_reg_type_to_size(inst->src[0].type) == 4 ||
+              brw_reg_type_to_size(inst->src[1].type) == 4) &&
+             (inst->dst.is_accumulator() ||
+              inst->writes_accumulator_implicitly(devinfo)))
+            break;
+
           /* a * 1.0 = a */
           if (inst->src[1].is_one()) {
              inst->opcode = BRW_OPCODE_MOV;
author	Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com>
	Fri, 13 Oct 2023 02:02:22 +0000 (05:02 +0300)
committer	Eric Engestrom <eric@engestrom.ch>
	Wed, 20 Dec 2023 00:05:06 +0000 (00:05 +0000)
.pick_status.json		patch \| blob \| history
src/intel/compiler/brw_fs.cpp		patch \| blob \| history