platform/upstream/mesa.git
2 years agoradeonsi: unify how ngg_cull_flags are set
Marek Olšák [Sun, 7 Nov 2021 20:49:19 +0000 (15:49 -0500)]
radeonsi: unify how ngg_cull_flags are set

Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13811>

2 years agod3d12: Fix incorrect hash table usage
Jesse Natalie [Tue, 16 Nov 2021 15:05:45 +0000 (07:05 -0800)]
d3d12: Fix incorrect hash table usage

I'd assumed that since insert didn't take a deleter, it was
find-or-insert, not insert-or-replace. This caused a bo reference
leak if the same bo was used more than once in a batch.

Fixes: fde36d79920 ("d3d12: Don't wait for GPU reads to do CPU reads")
Reviewed By: Bill Kristiansen <billkris@microsoft.com>

Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13819>

2 years agolima: add native txp support
Vasily Khoruzhick [Thu, 30 Sep 2021 04:34:01 +0000 (21:34 -0700)]
lima: add native txp support

Currently lima uses generic TXP lowering that results in downgrading
coords precision to FP16 since we have to do some calculations with
coords instead of loading them directly from varying.

Mali4x0 has native TXP support, however coords and projector have to
come from a single source.

Add NIR lowering pass that combines coords and projector into a single
backend-specific source and use it instead of generic lowering.

Unfortunately this change regresses one test, but it also fails in blob and
disassembly is now identical.

shader-db diff:

total instructions in shared programs: 15623 -> 15603 (-0.13%)
instructions in affected programs: 877 -> 857 (-2.28%)
helped: 7
HURT: 0
helped stats (abs) min: 2 max: 8 x̄: 2.86 x̃: 2
helped stats (rel) min: 0.87% max: 10.53% x̄: 4.93% x̃: 1.85%
95% mean confidence interval for instructions value: -4.95 -0.76
95% mean confidence interval for instructions %-change: -9.31% -0.55%
Instructions are helped.

total loops in shared programs: 3 -> 3 (0.00%)
loops in affected programs: 0 -> 0
helped: 0
HURT: 0

total spills in shared programs: 136 -> 137 (0.74%)
spills in affected programs: 0 -> 1
helped: 0
HURT: 1

total fills in shared programs: 598 -> 602 (0.67%)
fills in affected programs: 0 -> 4
helped: 0
HURT: 1

Tested-by: Denis Pauk <pauk.denis@gmail.com>
Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de>
Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13111>

2 years agoisaspec: Add prototypes for expr evaluators
Rob Clark [Sun, 14 Nov 2021 20:00:07 +0000 (12:00 -0800)]
isaspec: Add prototypes for expr evaluators

Add function prototypes for generated expr evaluators, to avoid a use-
before-declaration issue if an expression references a derived field.

Signed-off-by: Rob Clark <robdclark@chromium.org>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13787>

2 years agofreedreno/ir3: adjust condition for when to use ldib
Ilia Mirkin [Tue, 16 Nov 2021 04:41:59 +0000 (23:41 -0500)]
freedreno/ir3: adjust condition for when to use ldib

We have to use it any time that the image is writable. Otherwise writes
from the same invocation won't have posted into the texture cache.

See: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5629
Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13807>

2 years agonir: fix constant expression of ibitfield_extract
Samuel Pitoiset [Mon, 15 Nov 2021 13:37:41 +0000 (14:37 +0100)]
nir: fix constant expression of ibitfield_extract

This fixes dEQP-VK.graphicsfuzz.cov-condition-bitfield-extract-integer.

For example, nir_ibitfield_extract(3, 1, 2) should return 1.

Cc: 21.3 mesa-stable
Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13791>

2 years agovulkan: Add a dummy sync type
Jason Ekstrand [Mon, 15 Nov 2021 22:41:52 +0000 (16:41 -0600)]
vulkan: Add a dummy sync type

This is useful in WSI scenarios where you want to trivially signal a
fence or semaphore.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agovulkan,anv: Auto-detect syncobj features
Jason Ekstrand [Fri, 12 Nov 2021 15:51:59 +0000 (09:51 -0600)]
vulkan,anv: Auto-detect syncobj features

Instead of having a bunch of const vk_sync_type for each permutation of
vk_drm_syncobj capabilities, have a vk_drm_syncobj_get_type helper which
auto-detects features.  If a driver can't support a feature for some
reason (i915 got timeline support very late, for instance), they can
always mask off feature bits they don't want.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agoanv: Simplify submit_simple_batch()
Jason Ekstrand [Wed, 10 Nov 2021 15:33:57 +0000 (09:33 -0600)]
anv: Simplify submit_simple_batch()

BO waits aren't going away any time soon so using a syncobj here doesn't
really gain us anything.  It just makes it more complicated.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agoanv: Remove unnecessary syncobj wrappers
Jason Ekstrand [Wed, 3 Nov 2021 13:24:02 +0000 (08:24 -0500)]
anv: Remove unnecessary syncobj wrappers

These are entirely unused except for the syncobj in submit_simple_batch.
We can use the libdrm wrappers for that as they're basically equivalent
and the core Vulkan sync code already depends on them.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agovulkan: Add an emulated binary vk_sync type
Jason Ekstrand [Tue, 2 Nov 2021 15:20:00 +0000 (10:20 -0500)]
vulkan: Add an emulated binary vk_sync type

This wraps a timeline vk_sync type and turns it into a binary one.  This
is useful for, for instance, driver layered on D3D12.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agoanv: Convert to the common sync and submit framework
Jason Ekstrand [Wed, 20 Oct 2021 14:44:02 +0000 (09:44 -0500)]
anv: Convert to the common sync and submit framework

This is, unfortunately, a large flag-day mega-commit.  However, any
other approach would likely be fragile and involve a lot more churn as
we try to plumb the new vk_fence and vk_semaphore primitives into ANV's
submit code before we delete it all.  Instead, we do it all in one go
and accept the consequences.

While this should be mostly functionally equivalent to the previous
code, there is one potential perf-affecting change.  The command buffer
chaining optimization no longer works across VkSubmitInfo structs.
Within a single VkSubmitInfo, we will attempt to chain all the command
buffers together but we no longer try to chain across a VkSubmitInfo
boundary.  Hopefully, this isn't a significant perf problem.  If it ever
is, we'll have to teach the core runtime code how to combine two or more
VkSubmitInfos into a single vk_queue_submit.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agoanv: Use helpers in util/os_time.h in the query code
Jason Ekstrand [Wed, 20 Oct 2021 21:44:53 +0000 (16:44 -0500)]
anv: Use helpers in util/os_time.h in the query code

These are about to be the only use of anv_gettime_ns() so lets switch
them over so the next patch can delete the helper outright.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agoanv: Remove the last remnants of in/out fences
Jason Ekstrand [Wed, 20 Oct 2021 18:38:19 +0000 (13:38 -0500)]
anv: Remove the last remnants of in/out fences

This should have been dropped as part of d44ea09e61f4 ("anv: Drop unused
sync_file and BO semaphore code") but we missed a few bits.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agoanv: Add a BO sync type
Jason Ekstrand [Wed, 20 Oct 2021 04:55:27 +0000 (23:55 -0500)]
anv: Add a BO sync type

This is copy+paste from the BO implementation of anv_fence

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agoanv: Delete ANV_SEMAPHORE_TYPE_DUMMY
Jason Ekstrand [Wed, 20 Oct 2021 14:33:11 +0000 (09:33 -0500)]
anv: Delete ANV_SEMAPHORE_TYPE_DUMMY

It's never set as a semaphore type.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agovulkan/wsi/display: Wrap wsi_display_fence in a vk_sync
Jason Ekstrand [Thu, 21 Oct 2021 16:50:10 +0000 (11:50 -0500)]
vulkan/wsi/display: Wrap wsi_display_fence in a vk_sync

This gets rid of the bespoke vfunc interface in the WSI code.
Eventually, I'd love to get rid of wsi_display_fence entirely but
this at least makes it a lot more palatable.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agowsi/display: Rework wsi_fence a bit
Jason Ekstrand [Thu, 21 Oct 2021 16:32:52 +0000 (11:32 -0500)]
wsi/display: Rework wsi_fence a bit

Get rid of most of the guts of the base class and just leave it as a
vtable.  We can also drop some of wsi_display_fence.  One functional
change here is that we're now using VK_SYSTEM_ALLOCATION_SCOPE_INSTANCE
which is more correct anyway because, thanks to the funky reference
counting we do with destroyed and event_received, its lifetime is tied
to the physical device, at best.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agovulkan/wsi: Drop wsi_common_get_current_time()
Jason Ekstrand [Thu, 21 Oct 2021 15:52:29 +0000 (10:52 -0500)]
vulkan/wsi: Drop wsi_common_get_current_time()

Use os_time_get_nano() instead.  At this point, it's just a wrapper
around os_time_get_nano() anyway.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agovulkan/device: Log the timeline mode when lost
Jason Ekstrand [Thu, 28 Oct 2021 16:40:02 +0000 (11:40 -0500)]
vulkan/device: Log the timeline mode when lost

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agovulkan: Add common implementations of vkQueueSubmit and vkQueueWaitIdle
Jason Ekstrand [Tue, 19 Oct 2021 19:27:25 +0000 (14:27 -0500)]
vulkan: Add common implementations of vkQueueSubmit and vkQueueWaitIdle

This adds a new vk_queue_submit object which contains a list of command
buffers as well as wait and signal operations along with a driver hook
which takes a vk_queue and a vk_queue_submit and does the actual submit.
The common code then handles spawning a submit thread if needed, waiting
for timeline points to materialize, dealing with timeline semaphore
emulation via vk_timeline, etc.  All the driver sees are vk_queue.submit
calls with fully materialized vk_sync objects which it can wait on
unconditionally.

This implementation takes a page from RADV's book and only ever spawns
the submit thread if it sees a timeline wait on a time point that has
not yet materialized.  If this never happens, it calls vk_queue.submit
directly from vkQueueSubmit() and the thread is never spawned.

One other nicety of the new framework is that there is no longer a
distinction, from the driver's PoV, between fences and semaphores.  The
fence, if any, is included as just one more signal operation on the
final vk_queue_submit in the batch.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agovulkan: Add a common implementation of VkSemaphore
Jason Ekstrand [Tue, 19 Oct 2021 16:12:14 +0000 (11:12 -0500)]
vulkan: Add a common implementation of VkSemaphore

This is built on the new vk_sync primitives.  In the vk_physical_device,
the driver provides a null-terminated array of vk_sync_type pointers in
priority order.  The semaphore implementation then selects the first
type that meets the necessary criterion.  In particular, semaphores may
or may not be timelines depending on the VkSemaphoreType.  It also
auto-selects the semaphore type based on the external handle types
provided and can down-grade as needed to support a particular external
handle.

The implementation itself is mostly copy+pasted from ANV.  The primary
difference is the fact that anv_semaphore_impl has been replaced with
vk_sync.  The permanent vk_sync is still embedded (like ANV) but the
temporary one is a pointer.  This makes stealing the temporary state as
part of VkQueueSubmit a bit easier.

All of the interesting stuff around waits, signals, etc. is implemented
by the vk_sync interface.  All this code does is wrap it all in the
annoyingly detailed VkFence rules so we can provide the correct Vulkan
entrypoint behavior.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agovulkan: Add a common implementation of VkFence
Jason Ekstrand [Tue, 19 Oct 2021 03:36:44 +0000 (22:36 -0500)]
vulkan: Add a common implementation of VkFence

This is built on the new vk_sync primitives.  In the vk_physical_device,
the driver provides a null-terminated array of vk_sync_type pointers in
priority order.  The fence implementation then selects the first type
that meets the necessary criterion.  In particular, fences can't be
timelines and need to support reset and CPU wait.  It also auto-selects
the fence type based on the external handle types provided and can
down-grade as needed to support a particular external handle.

The implementation itself is mostly copy+pasted from ANV.  The primary
difference is the fact that anv_fence_impl has been replaced with
vk_sync.  The permanent vk_sync is still embedded (like ANV) but the
temporary one is a pointer.  This makes stealing the temporary state as
part of VkQueueSubmit a bit easier.

All of the interesting stuff around waits, resets, etc. is implemented
by the vk_sync interface.  All this code does is wrap it all in the
annoyingly detailed VkFence rules so we can provide the correct Vulkan
entrypoint behavior.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agovulkan: Add an emulated timeline sync type
Jason Ekstrand [Mon, 18 Oct 2021 23:12:59 +0000 (18:12 -0500)]
vulkan: Add an emulated timeline sync type

This is mostly copy+paste (and a bit of re-typing) from ANV.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agoc11/threads: Re-align return values for timed waits
Jason Ekstrand [Wed, 27 Oct 2021 21:32:24 +0000 (16:32 -0500)]
c11/threads: Re-align return values for timed waits

They're supposed to return thrd_timedout (which we mistakenly named
thrd_timeout), not thrd_busy.

Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agovulkan: Add a common vk_drm_syncobj struct
Jason Ekstrand [Mon, 18 Oct 2021 21:50:53 +0000 (16:50 -0500)]
vulkan: Add a common vk_drm_syncobj struct

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agovulkan/vk_device: Add a drm_fd field
Jason Ekstrand [Mon, 18 Oct 2021 21:24:03 +0000 (16:24 -0500)]
vulkan/vk_device: Add a drm_fd field

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agovulkan/util: Include stdlib.h
Jason Ekstrand [Tue, 2 Nov 2021 16:27:08 +0000 (11:27 -0500)]
vulkan/util: Include stdlib.h

It's needed for malloc() which is used by STACK_ARRAY

Fixes: f695171e383e ("vulkan: add common entrypoints for sparse image requirements/properties")
Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agovulkan/meson: Re-arrange libvulkan_util deps a bit
Jason Ekstrand [Thu, 28 Oct 2021 19:39:09 +0000 (14:39 -0500)]
vulkan/meson: Re-arrange libvulkan_util deps a bit

Rename files_vulkan_runtime to vulkan_runtime_files and add a new
vulkan_runtime_deps array for dependencies.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agovulkan: Add a vk_sync base class
Jason Ekstrand [Mon, 18 Oct 2021 21:24:49 +0000 (16:24 -0500)]
vulkan: Add a vk_sync base class

This doesn't map directly to any particular Vulkan object but is,
instead, a base class for the internal implementations of both VkFence
and VkSemaphore.  Its utility will become evident in later patches.

The design of vk_sync will look familiar to anyone with significant
experience in DRM.  The base object itself is just a pointer to a vfunc
table with function pointers providing the implementation of the various
operations.  Depending on how the vk_sync will be used some of of those
vfuncs are optional.  If it's only going to be used for VkSemaphore, for
instance, there's no need for reset().

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agoanv: Wire up the new status check
Jason Ekstrand [Mon, 8 Nov 2021 18:53:10 +0000 (12:53 -0600)]
anv: Wire up the new status check

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agovulkan/device: Add a check_status hook
Jason Ekstrand [Mon, 8 Nov 2021 18:42:40 +0000 (12:42 -0600)]
vulkan/device: Add a check_status hook

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agoanv: Use the new common device lost tracking
Jason Ekstrand [Tue, 19 Oct 2021 23:44:01 +0000 (18:44 -0500)]
anv: Use the new common device lost tracking

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agovulkan: Pull the device lost framework from ANV
Jason Ekstrand [Tue, 19 Oct 2021 23:32:28 +0000 (18:32 -0500)]
vulkan: Pull the device lost framework from ANV

It's a bit on the over-complicated side but the objective is to make the
debug log messages show up in the same thread as the first
VK_ERROR_DEVICE_LOST so we don't massively confuse the app.  It's
unknown if this is actually ever a problem but, with submit happening
off on its own thread, logging errors from threads the client doesn't
know about doesn't seem like a massively great plan.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>

2 years agoanv: Move device memory maps back to anv_device_memory
Jason Ekstrand [Mon, 15 Nov 2021 17:32:38 +0000 (11:32 -0600)]
anv: Move device memory maps back to anv_device_memory

This effectively partially reverts 13fe43714c1f ("anv: Add helpers in
anv_allocator for mapping BOs") where we both added helpers and reworked
memory mapping to stash the maps on the BO.  The problem comes with
external memory.  Due to GEM rules, if a memory object is exported and
then imported or imported twice,  we have to deduplicate the anv_bo
struct but, according to Vulkan rules, they are separate VkDeviceMemory
objects.  This means we either need to always map whole objects and
reference-count the map or we need to handle maps separately for
separate VkDeviceMemory objects.  For now, take the later path.

Fixes: 13fe43714c1f ("anv: Add helpers in anv_allocator for mapping BOs")
Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5612
Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13795>

2 years agomesa: convert unsupported primtypes during display list compilation
Mike Blumenkrantz [Wed, 10 Nov 2021 15:38:38 +0000 (10:38 -0500)]
mesa: convert unsupported primtypes during display list compilation

this adds primitive type translation in before the draw reaches gallium,
which massively increases performance by avoiding any sort of buffer readback

fixes #5249

Reviewed-by: Jesse Natalie <jenatali@microsoft.com>
Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
Reviewed-by: Adam Jackson <ajax@redhat.com>
Acked-by: Marek Olšák <marek.olsak@amd.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13741>

2 years agomove util/indices to core util
Mike Blumenkrantz [Wed, 10 Nov 2021 14:57:48 +0000 (09:57 -0500)]
move util/indices to core util

these are useful tools to have outside of gallium

Reviewed-by: Jesse Natalie <jenatali@microsoft.com>
Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
Reviewed-by: Adam Jackson <ajax@redhat.com>
Acked-by: Marek Olšák <marek.olsak@amd.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13741>

2 years agointel/genxml: Decode VALIGN/HALIGN values in XY_BLOCK_COPY_BLT
Kenneth Graunke [Wed, 10 Nov 2021 02:34:08 +0000 (18:34 -0800)]
intel/genxml: Decode VALIGN/HALIGN values in XY_BLOCK_COPY_BLT

For easier readability in INTEL_DEBUG=bat.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13809>

2 years agointel/genxml: Fix XY_BLOCK_COPY_BLT destination tiling field type
Kenneth Graunke [Wed, 10 Nov 2021 02:26:35 +0000 (18:26 -0800)]
intel/genxml: Fix XY_BLOCK_COPY_BLT destination tiling field type

Fixes: 2f58a63b2f1 ("intel/genxml: Add XY_BLOCK_COPY_BLT on Tigerlake and later.")
Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13809>

2 years agointel/genxml: Fix MI_FLUSH_DW to actually specify the length properly
Kenneth Graunke [Wed, 10 Nov 2021 02:34:29 +0000 (18:34 -0800)]
intel/genxml: Fix MI_FLUSH_DW to actually specify the length properly

Fixes: 569afd37f15 ("intel/genxml: Copy gen12.xml to gen125.xml")
Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13809>

2 years agointel/genxml: Collapse leading underscores on prefixed value defines
Kenneth Graunke [Fri, 29 Oct 2021 10:33:35 +0000 (03:33 -0700)]
intel/genxml: Collapse leading underscores on prefixed value defines

We prefix names with an underscore to make them "safe" C identifiers
when necessary.  For example, a value of "32x32" would become "_32x32".

However, when specifying something like

   <field ... prefix="BLOCK_SIZE">
     <value name="32x32" value="0"/>
   </field>

we already have a prefix that makes the field name safe.  We'd rather
generate a name with a single underscore, i.e.

    #define BLOCK_SIZE_32x32 0

rather than

    #define BLOCK_SIZE__32x32 0

This also fixes up affected defines in crocus.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13809>

2 years agointel/genxml: Simplify prefix handling for field value lists
Kenneth Graunke [Fri, 29 Oct 2021 10:05:21 +0000 (03:05 -0700)]
intel/genxml: Simplify prefix handling for field value lists

When a <field> tag has multiple <value> children, listing symbolic names
for possible field values, we generate #defines for each value, with an
optional prefix.  I don't know why, but this code was checking whether
self.default is None.  We want to generate the same list of #defines,
with a prefix, regardless of whether the field has a default value
specified or not.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13809>

2 years agoiris: Fix parameters to iris_copy_region in reallocate_resource_inplace
Kenneth Graunke [Fri, 1 Oct 2021 20:15:10 +0000 (13:15 -0700)]
iris: Fix parameters to iris_copy_region in reallocate_resource_inplace

We had accidentally passed <x, y, z, l> instead of <l, x, y, z>.

Fixes: b8ef3271c870 ("iris: Move suballocated resources to a dedicated allocation on export")
Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>
Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13815>

2 years agoradv: Don't crash if VkExternalImageFormatProperties isn't provided.
Bas Nieuwenhuizen [Sat, 13 Nov 2021 21:43:37 +0000 (22:43 +0100)]
radv: Don't crash if VkExternalImageFormatProperties isn't provided.

Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5623
Stable: 21.2 21.3
Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13786>

2 years agogallium/u_blitter: clean up texcoords ZW when filling up just XY
Alejandro Piñeiro [Thu, 11 Nov 2021 12:04:36 +0000 (13:04 +0100)]
gallium/u_blitter: clean up texcoords ZW when filling up just XY

To avoid a scenario like this:
  * One blit needed the four components => XYZW filled up with 4 values
  * Following blit needing two components => ZW uses the previous values

We detected this using the v3d driver with the
arb_framebuffer_srgb-blit test, specifically:

  ./bin/arb_framebuffer_srgb-blit texture linear_to_srgb msaa enabled render -auto -fbo

The main linear to srgb with msaa (not doing the resolve yet) blit
requires the four components.

At the end (after a resolve copy), the test uses glReadPixels, and
internally it uses the blitter with two components, but the shader
still uses lod on the texel fetch, so it gets the one used for the
main blit, when it should be zero.

Right now v3d works fine even with that wrong value, and I assume that
any other driver too. But we can't ensure that would keep happening on
the future, so let's use correct values.

Reviewed-by: Marek Olšák <marek.olsak@amd.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13753>

2 years agonir: Group per-primitive outputs at the end for driver location assign.
Timur Kristóf [Sun, 31 Oct 2021 20:09:08 +0000 (21:09 +0100)]
nir: Group per-primitive outputs at the end for driver location assign.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>
Acked-by: Caio Oliveira <caio.oliveira@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13466>

2 years agonir: Don't compact per-vertex and per-primitive outputs together.
Timur Kristóf [Sun, 31 Oct 2021 20:08:08 +0000 (21:08 +0100)]
nir: Don't compact per-vertex and per-primitive outputs together.

Prevent nir_compact_varyings from putting per-vertex and per-primitive
output components in the same slot.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13466>

2 years agonir: Lower cull and clip distance arrays for mesh shaders.
Timur Kristóf [Thu, 28 Oct 2021 14:28:37 +0000 (16:28 +0200)]
nir: Lower cull and clip distance arrays for mesh shaders.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Caio Oliveira <caio.oliveira@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13466>

2 years agonir: Add new option to lower invocation ID from invocation index.
Timur Kristóf [Thu, 28 Oct 2021 14:26:37 +0000 (16:26 +0200)]
nir: Add new option to lower invocation ID from invocation index.

Add this as an option to nir_lower_compute_system_values_options
instead of just relying on the shader's options.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Caio Oliveira <caio.oliveira@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13466>

2 years agonir, spirv: Don't mark NV_mesh_shader primitive indices as per-primitive.
Timur Kristóf [Thu, 21 Oct 2021 09:24:20 +0000 (11:24 +0200)]
nir, spirv: Don't mark NV_mesh_shader primitive indices as per-primitive.

They are not per-primitive in NV_mesh_shader, but a flat array.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Caio Oliveira <caio.oliveira@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13466>

2 years agonir: Fix nir_lower_io with per primitive outputs.
Timur Kristóf [Thu, 21 Oct 2021 09:22:34 +0000 (11:22 +0200)]
nir: Fix nir_lower_io with per primitive outputs.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Acked-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13466>

2 years agonir: Print Mesh Shader specific info.
Timur Kristóf [Thu, 21 Oct 2021 09:21:50 +0000 (11:21 +0200)]
nir: Print Mesh Shader specific info.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13466>

2 years agonir: Rename nir_get_io_vertex_index_src and include per-primitive I/O.
Timur Kristóf [Thu, 14 Oct 2021 16:14:12 +0000 (18:14 +0200)]
nir: Rename nir_get_io_vertex_index_src and include per-primitive I/O.

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13466>

2 years agoac/rgp: Initialize clock_calibration with memset.
Vinson Lee [Sat, 13 Nov 2021 19:50:07 +0000 (11:50 -0800)]
ac/rgp: Initialize clock_calibration with memset.

Fix defect reported by Coverity Scan.

Uninitialized scalar variable (UNINIT)
uninit_use_in_call: Using uninitialized value clock_calibration.
Field clock_calibration.reserved is uninitialized when calling
fwrite.

Fixes: 1ee85e8babe ("ac/rgp: add support for clock calibration")
Suggested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
Signed-off-by: Vinson Lee <vlee@freedesktop.org>
Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13783>

2 years agofreedreno/a4xx: hook up sample mask/id, used to determine helper invocs
Ilia Mirkin [Sat, 13 Nov 2021 23:31:10 +0000 (18:31 -0500)]
freedreno/a4xx: hook up sample mask/id, used to determine helper invocs

This fixes the various gl_HelperInvocation-based tests. There's a
lowering pass which converts it to (1 << sampleid) & samplemask.

Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13806>

2 years agofreedreno/a4xx: include guesses from a3xx for some of the constid's
Ilia Mirkin [Sat, 13 Nov 2021 09:24:43 +0000 (04:24 -0500)]
freedreno/a4xx: include guesses from a3xx for some of the constid's

The ones that are untested are left as comments. The ones that rename
values were tested manually.

Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13806>

2 years agofreedreno/a4xx: indicate whether outputs are uint/sint
Ilia Mirkin [Mon, 15 Nov 2021 06:23:01 +0000 (01:23 -0500)]
freedreno/a4xx: indicate whether outputs are uint/sint

Unclear whether this fixes anything, but the blob does seem to set
these. (Discovered while trying to determine if value clamping was
missing for non-32-bit integer formats, which fail in some tests.)

Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13806>

2 years agofreedreno/a4xx: fix stencil-textured border colors
Ilia Mirkin [Sun, 14 Nov 2021 23:39:38 +0000 (18:39 -0500)]
freedreno/a4xx: fix stencil-textured border colors

These are implemented with unusual sampler formats, so the usual approach
of looking at the format descriptors fails.

Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13806>

2 years agofreedreno/a6xx: re-express buffer textures more logically
Ilia Mirkin [Mon, 15 Nov 2021 23:59:07 +0000 (18:59 -0500)]
freedreno/a6xx: re-express buffer textures more logically

Same as a5xx, move one bit into the tex type, one as a separate named
BUFFER bit.

Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13805>

2 years agofreedreno/a5xx: re-express buffer textures more logically
Ilia Mirkin [Mon, 15 Nov 2021 23:39:31 +0000 (18:39 -0500)]
freedreno/a5xx: re-express buffer textures more logically

Instead of treating it as 2 bits to enable, make BUFFER a type (and
extend the bitfield width), and then add a separate BUFFER bit
(ostensibly to perform the width/height concatenation but who knows).

Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13805>

2 years agofreedreno/a4xx: add proper buffer texture support
Ilia Mirkin [Sun, 14 Nov 2021 04:51:54 +0000 (23:51 -0500)]
freedreno/a4xx: add proper buffer texture support

Rather than faking it as a 1d texture, add the buffer texture type, and
allow a full range of sizes.

Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13805>

2 years agoradeonsi: make si_llvm_emit_clipvertex non-static
Marek Olšák [Sun, 7 Nov 2021 21:30:19 +0000 (16:30 -0500)]
radeonsi: make si_llvm_emit_clipvertex non-static

it will be used in culling code

Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>

2 years agoradeonsi: remove an incorrect comment at lds_byte0_accept_flag
Marek Olšák [Sun, 7 Nov 2021 20:06:54 +0000 (15:06 -0500)]
radeonsi: remove an incorrect comment at lds_byte0_accept_flag

Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>

2 years agoradeonsi: improve memory instruction tracking
Marek Olšák [Sun, 7 Nov 2021 16:34:31 +0000 (11:34 -0500)]
radeonsi: improve memory instruction tracking

Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>

2 years agoradeonsi: add dcc_msaa option to enable DCC for MSAA
Marek Olšák [Fri, 5 Nov 2021 23:34:03 +0000 (19:34 -0400)]
radeonsi: add dcc_msaa option to enable DCC for MSAA

Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>

2 years agoradeonsi: unify GFX9_VSGS_NUM_USER_SGPR and GFX9_TESGS_NUM_USER_SGPR
Marek Olšák [Sun, 7 Nov 2021 04:04:31 +0000 (00:04 -0400)]
radeonsi: unify GFX9_VSGS_NUM_USER_SGPR and GFX9_TESGS_NUM_USER_SGPR

Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>

2 years agoac,radeonsi: cull small lines in the shader using the diamond exit rule
Marek Olšák [Sat, 6 Nov 2021 01:56:24 +0000 (21:56 -0400)]
ac,radeonsi: cull small lines in the shader using the diamond exit rule

It also splits clip_half_line_width into X and Y components for tighter
view culling.

Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>

2 years agoradeonsi: add si_state_rasterizer::ngg_cull_flags_lines and rename the others
Marek Olšák [Sat, 6 Nov 2021 00:13:35 +0000 (20:13 -0400)]
radeonsi: add si_state_rasterizer::ngg_cull_flags_lines and rename the others

Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>

2 years agoradeonsi: set EXTRA_DX_DY_PRECISION for lines where it's supported
Marek Olšák [Thu, 4 Nov 2021 22:28:17 +0000 (18:28 -0400)]
radeonsi: set EXTRA_DX_DY_PRECISION for lines where it's supported

Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>

2 years agoradeonsi: set PERPENDICULAR_ENDCAP_ENA for wide AA lines
Marek Olšák [Thu, 4 Nov 2021 22:09:08 +0000 (18:09 -0400)]
radeonsi: set PERPENDICULAR_ENDCAP_ENA for wide AA lines

This is more correct.

Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>

2 years agoradeonsi: make si_get_small_prim_cull_info static
Marek Olšák [Sat, 6 Nov 2021 18:08:39 +0000 (14:08 -0400)]
radeonsi: make si_get_small_prim_cull_info static

Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>

2 years agoradeonsi: use ac_build_load_to_sgpr in gfx10_emit_ngg_culling_epilogue
Marek Olšák [Sat, 6 Nov 2021 18:07:49 +0000 (14:07 -0400)]
radeonsi: use ac_build_load_to_sgpr in gfx10_emit_ngg_culling_epilogue

This is more correct because we are loading constants into an SGPR even
though there is no effect on behavior in this case.

Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>

2 years agoradeonsi: fix view culling for wide lines
Marek Olšák [Sat, 6 Nov 2021 18:07:25 +0000 (14:07 -0400)]
radeonsi: fix view culling for wide lines

We need to cull wide lines as quads, but only for view culling.

Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>

2 years agoradeonsi: fix shader culling with integer pixel centers
Marek Olšák [Sat, 6 Nov 2021 03:31:07 +0000 (23:31 -0400)]
radeonsi: fix shader culling with integer pixel centers

Only Nine was using them.

Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>

2 years agonir: remove double-validation of src component counts
Ilia Mirkin [Fri, 12 Nov 2021 23:57:53 +0000 (18:57 -0500)]
nir: remove double-validation of src component counts

The nir_tex_instr_src_size helper already sorts this out correctly, no
need to do it twice, and validate_src takes care of it.

Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13781>

2 years agoradv: Add more checking of cache sizes.
Bas Nieuwenhuizen [Sun, 14 Nov 2021 22:59:41 +0000 (23:59 +0100)]
radv: Add more checking of cache sizes.

Hopefully prevents things.

Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13789>

2 years agoradv: Fix memory corruption loading RT pipeline cache entries.
Bas Nieuwenhuizen [Sun, 14 Nov 2021 22:57:45 +0000 (23:57 +0100)]
radv: Fix memory corruption loading RT pipeline cache entries.

Oops. Forgot to account for the size here.

Fixes: ca2d96db51e ("radv: Add caching for RT pipelines.")
Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13789>

2 years agofreedreno/ir3: fix image-to-tex flags, remove 3d -> array hack
Ilia Mirkin [Mon, 15 Nov 2021 23:31:24 +0000 (18:31 -0500)]
freedreno/ir3: fix image-to-tex flags, remove 3d -> array hack

The function would return both the 3d and array flags set for 2d array,
and would return just 3d for cubes. Fix the flags so that they are
appropriate for images.

Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13804>

2 years agoiris: call brw_process_intel_debug_variable() earlier
Paulo Zanoni [Fri, 12 Nov 2021 23:16:38 +0000 (15:16 -0800)]
iris: call brw_process_intel_debug_variable() earlier

We're currently only calling it after creating the screen and the
bufmgr. There are a few cases where Iris checks for the DEBUG_BUFMGR
bit before we call brw_process_intel_debug_variable(), which means
intel_debug is 0 and so we don't run the debug code. Today, these are
all related to the creation of the workaround bo and its mmap.

I found this in a custom branch after I converted to INTEL_DEBUG an
environment variable that I had.

Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13780>

2 years agodocs: update branchpoint instructions
Eric Engestrom [Wed, 10 Nov 2021 22:38:21 +0000 (22:38 +0000)]
docs: update branchpoint instructions

Signed-off-by: Eric Engestrom <eric@engestrom.ch>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13745>

2 years agolima: enable PIPE_CAP_PREFER_POT_ALIGNED_VARYINGS
Vasily Khoruzhick [Fri, 1 Oct 2021 13:32:52 +0000 (06:32 -0700)]
lima: enable PIPE_CAP_PREFER_POT_ALIGNED_VARYINGS

Mali4x0 PP doesn't have a swizzle for load_input, so use POT-aligned
varyings to avoid unnecessary movs for vec3 and precision downgrade
in case if this vec3 is coordinates for a sampler

shader-db:

total instructions in shared programs: 15707 -> 15623 (-0.53%)
instructions in affected programs: 3906 -> 3822 (-2.15%)
helped: 47
HURT: 18
helped stats (abs) min: 1 max: 9 x̄: 3.09 x̃: 2
helped stats (rel) min: 1.49% max: 23.53% x̄: 8.20% x̃: 6.45%
HURT stats (abs)   min: 1 max: 7 x̄: 3.39 x̃: 3
HURT stats (rel)   min: 0.78% max: 20.59% x̄: 10.45% x̃: 10.97%
95% mean confidence interval for instructions value: -2.18 -0.41
95% mean confidence interval for instructions %-change: -5.70% -0.38%
Instructions are helped.

total spills in shared programs: 146 -> 136 (-6.85%)
spills in affected programs: 39 -> 29 (-25.64%)
helped: 6
HURT: 0

total fills in shared programs: 617 -> 598 (-3.08%)
fills in affected programs: 125 -> 106 (-15.20%)
helped: 6
HURT: 0

HURT shaders are vertex shaders where we may need more instructions
for non-packed vec3s. It's acceptable trade-off since we don't get
precision downgrade if this varying is coordinates for a sampler.

Reviewed-by: Erico Nunes <nunes.erico@gmail.com>
Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13151>

2 years agogallium: add PIPE_CAP_PREFER_POT_ALIGNED_VARYINGS
Vasily Khoruzhick [Fri, 1 Oct 2021 13:23:48 +0000 (06:23 -0700)]
gallium: add PIPE_CAP_PREFER_POT_ALIGNED_VARYINGS

Driver should enable this cap if it prefers varyings to be aligned
to power of two in a slot, i.e. vec4 in .xyzw, vec3 in .xyz, vec2 in .xy
or .zw

Reviewed-by: Tapani Pälli <tapani.palli@intel.com>
Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13151>

2 years agodocs/submittingpatches: mention use of the `-x` flag of `git cherry-pick` when backpo...
Eric Engestrom [Wed, 10 Nov 2021 11:08:05 +0000 (11:08 +0000)]
docs/submittingpatches: mention use of the `-x` flag of `git cherry-pick` when backporting a commit

Signed-off-by: Eric Engestrom <eric@engestrom.ch>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13746>

2 years agodocs/submittingpatches: add formatting around the release branches names
Eric Engestrom [Wed, 10 Nov 2021 11:06:44 +0000 (11:06 +0000)]
docs/submittingpatches: add formatting around the release branches names

Signed-off-by: Eric Engestrom <eric@engestrom.ch>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13746>

2 years agodocs/submittingpatches: add link to section describing how to make a backport MR
Eric Engestrom [Wed, 10 Nov 2021 11:05:49 +0000 (11:05 +0000)]
docs/submittingpatches: add link to section describing how to make a backport MR

Signed-off-by: Eric Engestrom <eric@engestrom.ch>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13746>

2 years agofreedreno/a6xx: Fix a bunch of 3D texture layout to match blob behavior.
Emma Anholt [Fri, 5 Nov 2021 18:41:36 +0000 (11:41 -0700)]
freedreno/a6xx: Fix a bunch of 3D texture layout to match blob behavior.

This doesn't get all of the texelfetch sampler3d testcases working, but
it's sure a lot more.

Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13733>

2 years agofreedreno/cffdump: Handle the TILE_ALL flag in unit test generation.
Emma Anholt [Fri, 5 Nov 2021 19:00:33 +0000 (12:00 -0700)]
freedreno/cffdump: Handle the TILE_ALL flag in unit test generation.

Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13733>

2 years agofreedreno/cffdump: Fix up formatting of texturator unit test script output.
Emma Anholt [Fri, 5 Nov 2021 16:56:44 +0000 (09:56 -0700)]
freedreno/cffdump: Fix up formatting of texturator unit test script output.

Now I don't need to re-clang-format as I generate testcases.

Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13733>

2 years agofreedreno/fdl: Add support for unit testing 3D texture array strides.
Emma Anholt [Thu, 4 Nov 2021 23:20:56 +0000 (16:20 -0700)]
freedreno/fdl: Add support for unit testing 3D texture array strides.

Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13733>

2 years agofreedreno/cffdump: Fix 64-bit reg decode in script mode.
Emma Anholt [Thu, 4 Nov 2021 23:03:13 +0000 (16:03 -0700)]
freedreno/cffdump: Fix 64-bit reg decode in script mode.

Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13733>

2 years agofreedreno: Fix the texturator unit test script.
Emma Anholt [Thu, 4 Nov 2021 22:25:25 +0000 (15:25 -0700)]
freedreno: Fix the texturator unit test script.

We no longer have reg defs for the HI fields, so all we can access from
lua is the low 32 bits.  LUA has only double-precision floats for numbers,
so we can't fix that.  However, the high bits are almost always the same,
so it's not that big of a deal to be ignoring them for this script.

Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13733>

2 years agofreedreno/fdl: Dump the generated layout when a layout test fails.
Emma Anholt [Fri, 5 Nov 2021 18:46:23 +0000 (11:46 -0700)]
freedreno/fdl: Dump the generated layout when a layout test fails.

Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13733>

2 years agoutil/vl: move gallium vl_vlc.h and vl_rbsp.h to shared code.
Dave Airlie [Fri, 12 Nov 2021 04:30:37 +0000 (14:30 +1000)]
util/vl: move gallium vl_vlc.h and vl_rbsp.h to shared code.

For vulkan video I need these to parse slice headers, so move
them somewhere easier to get at them.

drops pointer_to_uintptr in favour of a cast.

Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13768>

2 years agointel/dev: Add platform enum with DG2 G10 & G11
Jordan Justen [Tue, 26 Oct 2021 08:03:07 +0000 (01:03 -0700)]
intel/dev: Add platform enum with DG2 G10 & G11

Based on Lionel's "intel/devinfo: store the different kind of DG2".

Ref: 361b3fee3c5 ("intel: move away from booleans to identify platforms")
Ref: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=9e22cfc5e9b92556a56d8a564cdab31045f29010
Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>
Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13797>

2 years agozink: always add VK_IMAGE_CREATE_2D_ARRAY_COMPATIBLE_BIT for 3D images
Mike Blumenkrantz [Fri, 29 Oct 2021 15:22:44 +0000 (11:22 -0400)]
zink: always add VK_IMAGE_CREATE_2D_ARRAY_COMPATIBLE_BIT for 3D images

there's no way to know what an image will be used for, so this bit needs
to always be added

fixes KHR-GL46.packed_pixels.varied_rectangle.compressed_rgb

cc: mesa-stable

Reviewed-by: Dave Airlie <airlied@redhat.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13798>

2 years agozink: stop running discard_if in generated tcs
Mike Blumenkrantz [Thu, 28 Oct 2021 14:18:44 +0000 (10:18 -0400)]
zink: stop running discard_if in generated tcs

just embarrassing smh

Reviewed-by: Dave Airlie <airlied@redhat.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13798>

2 years agozink: skip one GLES31 subset to avoid GPU hangs on Navi10
Samuel Pitoiset [Mon, 15 Nov 2021 17:26:10 +0000 (18:26 +0100)]
zink: skip one GLES31 subset to avoid GPU hangs on Navi10

Weird bug... I will figure out later.

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13796>

2 years agointel/genxml: generate video headers
Dave Airlie [Tue, 9 Nov 2021 00:33:38 +0000 (10:33 +1000)]
intel/genxml: generate video headers

This just generates the video engine pieces.

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13788>

2 years agointel/genxml: fix gen6 LD->VLD typo.
Dave Airlie [Sun, 14 Nov 2021 23:07:48 +0000 (09:07 +1000)]
intel/genxml: fix gen6 LD->VLD typo.

Pointed out by Ilia

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13788>

2 years agointel/genxml: capitalize decoder mode select properly
Matt Turner [Mon, 3 Feb 2020 23:50:54 +0000 (15:50 -0800)]
intel/genxml: capitalize decoder mode select properly

Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13788>