review.tizen.org Git - platform/upstream/mesa.git/log

projects / platform / upstream / mesa.git / log

Marek Olšák [Tue, 30 Dec 2014 17:41:25 +0000 (18:41 +0100)]

radeonsi: emit SURFACE_SYNC last

This fixes a case where a transform feedback buffer is fed back as an index
buffer, because SURFACE_SYNC must be after VS_PARTIAL_FLUSH.

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Mon, 29 Dec 2014 14:09:22 +0000 (15:09 +0100)]

radeonsi: flush all CB/DB caches unconditionally when changing the framebuffer

This is easier to read and will work better with shader image stores.

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Mon, 29 Dec 2014 00:25:48 +0000 (01:25 +0100)]

radeonsi: change TC cache flushing strategy for textures

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Tue, 30 Dec 2014 15:45:51 +0000 (16:45 +0100)]

radeonsi: improve and fix streamout flushing

- we don't usually need to flush TC L2
- we should flush KCACHE
  (not really an issue now since we always flush KCACHE when updating
   descriptors, but it could be a problem if we used CE, which doesn't
   require flushing KCACHE)
- add an explicit VS_PARTIAL_FLUSH flag

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Mon, 29 Dec 2014 13:53:11 +0000 (14:53 +0100)]

radeonsi: use TC L2 for CP DMA operations with shader resources on CIK

So that TC L2 doesn't need to be flushed.

The only problem is with index buffers, which don't use TC.
A simple solution is added that flushes TC L2 before a draw call (TC_L2_dirty).

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Mon, 29 Dec 2014 12:22:00 +0000 (13:22 +0100)]

radeonsi: use TC L2 for updating descriptors on CIK

This allows not flushing TC L2 on CIK later.

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Sun, 4 Jan 2015 21:16:53 +0000 (22:16 +0100)]

radeonsi: don't use TC L2 for updating descriptors on SI

It's causing problems, because we mix uncached CP DMA with cached WRITE_DATA
when updating the same memory.

The solution for SI is to use uncached access here, because CP DMA doesn't
support cached access.

CIK will be handled in the next patch.

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Mon, 29 Dec 2014 13:45:49 +0000 (14:45 +0100)]

radeonsi: only flush the right set of caches for CP DMA operations

That's either framebuffer caches or caches for shader resources.
The motivation is that framebuffer caches need to be flushed very rarely
here.

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Sun, 28 Dec 2014 22:11:38 +0000 (23:11 +0100)]

radeonsi: implement separate ICACHE and KCACHE flush for SI

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Tue, 30 Dec 2014 12:08:32 +0000 (13:08 +0100)]

radeonsi: add a combined flag for flushing a framebuffer

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Mon, 29 Dec 2014 13:02:46 +0000 (14:02 +0100)]

radeonsi: rename flush flags, split the TC flag into L1 and L2

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Mon, 29 Dec 2014 12:39:42 +0000 (13:39 +0100)]

r600g,radeonsi: separate cache flush flags

I will rename them for radeonsi.

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Mon, 29 Dec 2014 12:27:46 +0000 (13:27 +0100)]

r600g: move r6xx-specific streamout flush flagging into r600g

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Sun, 4 Jan 2015 21:01:43 +0000 (22:01 +0100)]

radeonsi: only set BC_OPTIMIZE_DISABLE when necessary

SPI_PS_IN_CONTROL is moved into the SPI mapping state.

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Sun, 4 Jan 2015 20:05:14 +0000 (21:05 +0100)]

radeonsi: do not define FACE as an ordinary PS input

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Sun, 4 Jan 2015 19:23:51 +0000 (20:23 +0100)]

radeonsi: remove flatshade from the shader key

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Sun, 4 Jan 2015 19:09:51 +0000 (20:09 +0100)]

radeonsi: remove special handling of TGSI_INTERPOLATE_COLOR in shader codegen

It doesn't do anything useful. And colors are floating-point, so we can use
fs.interp, remove "flatshade" from the shader key, and rely on the FLAT_SHADE
state only (in the next patch).

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Sun, 4 Jan 2015 13:51:01 +0000 (14:51 +0100)]

radeonsi: implement VERTEXID_NOBASE and BASEVERTEX system values

Only done for completeness. Not used by anything yet.

Tested by advertising PIPE_CAP_VERTEXID_NOBASE.

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Sun, 4 Jan 2015 13:41:49 +0000 (14:41 +0100)]

radeonsi: fix VertexID for OpenGL

This fixes all failing piglit VertexID tests.

Cc: 10.4 <mesa-stable@lists.freedesktop.org>
Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Sun, 28 Dec 2014 20:51:35 +0000 (21:51 +0100)]

radeonsi: clarify a hw bug in shader exports

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Sun, 4 Jan 2015 19:45:35 +0000 (20:45 +0100)]

radeonsi: use ordered compares for SSG and face selection

Ordered compares are what you have in C. Unordered compares are the result
of negating ordered compares (they return true if either argument is NaN).

That special NaN behavior is completely useless here, and unordered
compares produce horrible code with all stable LLVM versions.
(I think that has been fixed in LLVM git)

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Tue, 30 Dec 2014 23:51:27 +0000 (00:51 +0100)]

radeonsi: remove unused and not useful variables

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Tue, 30 Dec 2014 23:42:22 +0000 (00:42 +0100)]

radeonsi: remove init config from states

It really doesn't do anything there.

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Tue, 30 Dec 2014 22:49:59 +0000 (23:49 +0100)]

radeonsi: reduce the size of si_pm4_state

- the relocs array is unused, remove it
- ndw is at most 115 (init), set 140 as the maximum
- compute needs 4 buffers per state, graphics only needs 1; set 4 as the maximum

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>

commit | commitdiff | tree

Marek Olšák [Sun, 4 Jan 2015 20:58:42 +0000 (21:58 +0100)]

tgsi: add uses_centroid into tgsi_shader_info

commit | commitdiff | tree

Marek Olšák [Sun, 4 Jan 2015 14:43:47 +0000 (15:43 +0100)]

st/mesa: fix GL_PRIMITIVE_RESTART_FIXED_INDEX

Cc: 10.2 10.3 10.4 <mesa-stable@lists.freedesktop.org>
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>

commit | commitdiff | tree

Marek Olšák [Sun, 4 Jan 2015 13:27:33 +0000 (14:27 +0100)]

vbo: ignore primitive restart if FixedIndex is enabled in DrawArrays

From GL 4.4 Core profile:

  If both PRIMITIVE_RESTART and PRIMITIVE_RESTART_FIXED_INDEX are
  enabled, the index value determined by PRIMITIVE_RESTART_FIXED_INDEX is
  used. If PRIMITIVE_RESTART_FIXED_INDEX is enabled, primitive restart is not
  performed for array elements transferred by any drawing command not taking a
  type parameter, including all of the *Draw* commands other than *DrawEle-
  ments*.

Cc: 10.2 10.3 10.4 <mesa-stable@lists.freedesktop.org>
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>

commit | commitdiff | tree

Eric Anholt [Tue, 6 Jan 2015 19:30:19 +0000 (11:30 -0800)]

vc4: Fix scaling W projection of the Z coordinate when there's a Z offset.

Fixes piglit glsl-fs-fragcoord-zw-perspective, es3conform
gl_FragCoord_z_frag, and the rest of the piglit glsl 1.10 interpolation
tests.

commit | commitdiff | tree

Eric Anholt [Tue, 6 Jan 2015 00:34:58 +0000 (16:34 -0800)]

vc4: Fix deletion from the program cache.

They key is, oddly enough, in the key field, not in the data field (which
is the vc4_compiled_shader *). Fixes regular failures in fp-long-alu.

commit | commitdiff | tree

Eric Anholt [Sat, 3 Jan 2015 06:55:37 +0000 (22:55 -0800)]

vc4: Skip storing the Z/S contents when it's invalidated.

Improves framerate of 5 seconds of es2gears by 1.57473% +/- 0.669409%
(n=67).

Reviewed-by: Jose Fonseca <jfonseca@vmware.com>

commit | commitdiff | tree

Eric Anholt [Sun, 21 Dec 2014 20:48:59 +0000 (12:48 -0800)]

gallium: Plumb the swap INVALIDATE_ANCILLARY flag through more layers.

v2: Instead of telling the driver that the window system ancillaries have
    been invalidated (when the driver doesn't know which of its buffers
    are the window system's!), introduce a method for invalidating
    specific surfaces.

Reviewed-by: Jose Fonseca <jfonseca@vmware.com>

commit | commitdiff | tree

Eric Anholt [Sun, 21 Dec 2014 19:51:33 +0000 (11:51 -0800)]

egl: Inform the client API when ancillary buffers may become undefined.

This is part of the EGL spec, and is useful for a tiled renderer to avoid
the memory bandwidth cost of storing the depth/stencil buffers.

Reviewed-by: Jose Fonseca <jfonseca@vmware.com>

commit | commitdiff | tree

Vinson Lee [Mon, 5 Jan 2015 22:53:03 +0000 (14:53 -0800)]

ax_prog_flex.m4: Merge upstream OpenBSD fixes.

Merge the following upstream autoconf-archive patches.

ax_prog_flex: change grep syntax to accept e.g. "flex.real" in case a wrapper or symlink is used.
AX_PROG_FLEX: avoid use of grep empty string escape extension (fix for OpenBSD)
AX_PROG_FLEX: Also accept gflex.

Signed-off-by: Vinson Lee <vlee@freedesktop.org>
Reviewed-by: Matt Turner <mattst88@gmail.com>
Reviewed-by: Jonathan Gray <jsg@openbsd.org>

commit | commitdiff | tree

Tom Stellard [Tue, 23 Dec 2014 15:26:23 +0000 (10:26 -0500)]

radeon/llvm: Use amdgcn triple for SI+ on LLVM >= 3.6

commit | commitdiff | tree

Tom Stellard [Wed, 15 Oct 2014 16:24:30 +0000 (12:24 -0400)]

radeonsi: Cache LLVMTargetMachine object in si_screen

Rather than building a new one every compile. This should reduce some
of the overhead of compiling shaders.

One consequence of this change is that we lose the MachineInstrs dumps
when dumping the shaders via R600_DEBUG. The LLVM IR and assembly is
still dumped, and if you still want to see the MachineInstr dump, you
can run the dumped LLVM IR through llc.

commit | commitdiff | tree