review.tizen.org Git - platform/upstream/llvm.git/commit

author	Yuan Yao <yuayao@nvidia.com>
	Wed, 28 Jun 2023 20:50:25 +0000 (13:50 -0700)
committer	Yuan Yao <yuayao@nvidia.com>
	Fri, 30 Jun 2023 22:47:46 +0000 (15:47 -0700)
commit	fc37f717770acdfe5504bb9b969a01bb16a187f9
tree	1c9a6939332920f33af9e583669a57c1b76d4fca	tree \| snapshot
parent	9dc88651d591f62ddd7f54b98e3c9a8cb81d8bd5	commit \| diff

[mlir][NVGPU]: Fix op description of nvgpu.device_async_wait.

According to the NVIDIA documentation on `cp.async.wait_group`
(https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#data-movement-and-conversion-instructions-cp-async-wait-group-cp-async-wait-all),
the `numGroups` attribute in `nvgpu.device_async_wait` should give an upper
bound of pending async group count (instead of a lower bound) when the
executing thread can be unblocked.

Reviewed By: christopherbate

Differential Revision: https://reviews.llvm.org/D154046