drm/amdgpu: fix race condition in amd_sched_entity_push_job
authorNicolai Hähnle <Nicolai.Haehnle@amd.com>
Wed, 2 Dec 2015 16:35:12 +0000 (17:35 +0100)
committerAlex Deucher <alexander.deucher@amd.com>
Wed, 2 Dec 2015 20:04:04 +0000 (15:04 -0500)
commit07df04dfcfe51a45239d056d43cbc09702262c82
treecd4cd9b9a9d38c1dd16ea31b11a4e87766b641b9
parente2f784fa8a3b1fbf6999ba44f7327bd9d29f2c5b
drm/amdgpu: fix race condition in amd_sched_entity_push_job

As soon as we leave the spinlock after the job has been added to the job
queue, we can no longer rely on the job's data to be available.

I have seen a null-pointer dereference due to sched == NULL in
amd_sched_wakeup via amd_sched_entity_push_job and
amd_sched_ib_submit_kernel_helper. Since the latter initializes
sched_job->sched with the address of the ring scheduler, which is
guaranteed to be non-NULL, this race appears to be a likely culprit.

Signed-off-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
Bugzilla: https://bugs.freedesktop.org/attachment.cgi?bugid=93079
Reviewed-by: Christian König <christian.koenig@amd.com>
drivers/gpu/drm/amd/scheduler/gpu_scheduler.c