[mlir][NVGPU]: Fix op description of nvgpu.device_async_wait.

author Yuan Yao <yuayao@nvidia.com>

Wed, 28 Jun 2023 20:50:25 +0000 (13:50 -0700)

committer Yuan Yao <yuayao@nvidia.com>

Fri, 30 Jun 2023 22:47:46 +0000 (15:47 -0700)
author Yuan Yao <yuayao@nvidia.com>
Wed, 28 Jun 2023 20:50:25 +0000 (13:50 -0700)
committer Yuan Yao <yuayao@nvidia.com>
Fri, 30 Jun 2023 22:47:46 +0000 (15:47 -0700)
diff --git a/mlir/include/mlir/Dialect/NVGPU/IR/NVGPU.td b/mlir/include/mlir/Dialect/NVGPU/IR/NVGPU.td

index e595e9d..41571fc 100644 (file)
--- a/mlir/include/mlir/Dialect/NVGPU/IR/NVGPU.td
+++ b/mlir/include/mlir/Dialect/NVGPU/IR/NVGPU.td
@@ -336,8 +336,11 @@ def NVGPU_DeviceAsyncWaitOp : NVGPU_Op<"device_async_wait", []> {
      The `nvgpu.device_async_wait` op will block the execution thread until the group
      associated with the source token is fully completed.
  
-    The optional `$numGroup` attribute gives a lower bound of the number of
-    groups uncompleted when the wait can unblock the thread.
+    The optional `$numGroups` attribute gives an upper bound of the number of
+    groups uncompleted when the wait can unblock the thread. For example,  if
+    16 async groups are pushe and `$numGroups` is set to 12, then the thread
+    will unblock when 12 groups or fewer are in flight (4 groups have
+    completed).
  
      Example:
author	Yuan Yao <yuayao@nvidia.com>
	Wed, 28 Jun 2023 20:50:25 +0000 (13:50 -0700)
committer	Yuan Yao <yuayao@nvidia.com>
	Fri, 30 Jun 2023 22:47:46 +0000 (15:47 -0700)