profile/ivi/mesa.git
14 years agor300g: fix context destroy under hyperz
Dave Airlie [Tue, 17 Aug 2010 09:01:18 +0000 (19:01 +1000)]
r300g: fix context destroy under hyperz

we were destroying the mm before unrefing all the objects, so segfault.

Signed-off-by: Dave Airlie <airlied@redhat.com>
14 years agoscons: Add -fno-common for 64-bit builds on Mac OS X.
Vinson Lee [Tue, 17 Aug 2010 07:20:58 +0000 (00:20 -0700)]
scons: Add -fno-common for 64-bit builds on Mac OS X.

This option is also needed for 64-bit builds if llvm is enabled.
Other the build fails during linking.

14 years agor600g: kill event type magic number in winsys
Dave Airlie [Tue, 17 Aug 2010 05:16:53 +0000 (15:16 +1000)]
r600g: kill event type magic number in winsys

these events have names, use them.

14 years agoconfigs: Add -ltalloc to linux-dri.
Vinson Lee [Tue, 17 Aug 2010 05:41:44 +0000 (22:41 -0700)]
configs: Add -ltalloc to linux-dri.

Fixes build after glsl2 branch merge, which added talloc dependency.

14 years agor600g: add user clip plane support.
Dave Airlie [Tue, 17 Aug 2010 03:40:15 +0000 (13:40 +1000)]
r600g: add user clip plane support.

Apart from the fact that the radeon.h/r600_states.h editing is a nightmare, this
wasn't so bad.

passes piglit user-clip test now also trivial tests.

Signed-off-by: Dave Airlie <airlied@redhat.com>
14 years agoi965: Use the implied move available in most brw_wm_emit brw_math() calls.
Eric Anholt [Tue, 17 Aug 2010 02:18:11 +0000 (19:18 -0700)]
i965: Use the implied move available in most brw_wm_emit brw_math() calls.

This saves an extra message reg move in the program, though I'm not
clear on whether it will have any performance impact other than cache
footprint.  It will also fix those math calls on Sandybridge, where
the brw_eu_emit.c brw_math() support relies on the implied move being
used.

14 years agoi965: Add disasm for Compr4 instruction compression.
Eric Anholt [Fri, 9 Jul 2010 00:05:42 +0000 (17:05 -0700)]
i965: Add disasm for Compr4 instruction compression.

14 years agoMerge branch 'glsl2'
Ian Romanick [Tue, 17 Aug 2010 02:08:53 +0000 (19:08 -0700)]
Merge branch 'glsl2'

Conflicts:
src/mesa/program/prog_optimize.c

14 years agoutil: Remove check_os_katmai_support.
Vinson Lee [Tue, 17 Aug 2010 01:52:37 +0000 (18:52 -0700)]
util: Remove check_os_katmai_support.

check_os_katmai_support checks that the operating system running on a
SSE-capable processor supports SSE. This is necessary for unpatched
2.2.x and earlier kernels. 2.4.x and later kernels support SSE.

check_os_katmai_support will disable SSE capabilities for 32-bit x86
operating systems for which there is no code path. Currently, this
function handles Linux, Windows, and several BSDs. Mac OS, Cygwin, and
Solaris are several operating systems with no code paths.

Rather than add code for the unhandled operating systems, remove this
function altogether. This will fix SSE detection on all recent 32-bit
x86 operating systems. This completely breaks functionality on unpatched
2.2.x and earlier kernels, although there are likely no Gallium3D users
on such operating systems.

14 years agor600g: drop libdrm_radeon link
Dave Airlie [Fri, 13 Aug 2010 03:54:25 +0000 (13:54 +1000)]
r600g: drop libdrm_radeon link

14 years agoglsl2: Refresh autogenerated file builtin_function.cpp.
Kenneth Graunke [Mon, 16 Aug 2010 22:18:23 +0000 (15:18 -0700)]
glsl2: Refresh autogenerated file builtin_function.cpp.

14 years agoglsl2: Add builtins profile for GLSL 1.30.
Kenneth Graunke [Mon, 16 Aug 2010 22:14:04 +0000 (15:14 -0700)]
glsl2: Add builtins profile for GLSL 1.30.

Many functions are currently wrapped with #if 0 since we haven't
implemented them yet.

14 years agolinker: Include compiler.h to avoid spurious warnings about INLINE
Ian Romanick [Mon, 16 Aug 2010 20:59:34 +0000 (13:59 -0700)]
linker: Include compiler.h to avoid spurious warnings about INLINE

14 years agoglsl2: Silence unused variable warning
Ian Romanick [Mon, 16 Aug 2010 20:59:01 +0000 (13:59 -0700)]
glsl2: Silence unused variable warning

14 years agotranslate: Move loop variable declaration outside for loop.
Vinson Lee [Mon, 16 Aug 2010 20:52:57 +0000 (13:52 -0700)]
translate: Move loop variable declaration outside for loop.

Fixes MSVC build.

14 years agoglcpp: Refresh autogenerated lexer and parser.
Kenneth Graunke [Mon, 16 Aug 2010 20:43:10 +0000 (13:43 -0700)]
glcpp: Refresh autogenerated lexer and parser.

14 years agoglcpp: Remove spurious newline generated by #version handling.
Kenneth Graunke [Mon, 16 Aug 2010 20:42:04 +0000 (13:42 -0700)]
glcpp: Remove spurious newline generated by #version handling.

This was causing line numbering to be off by one.  The newline comes
from the NEWLINE token at the end of the line; there's no need to
insert one.

14 years agor300g: fix assert in the rasterizer block for r3xx-r4xx
Marek Olšák [Mon, 16 Aug 2010 17:17:02 +0000 (19:17 +0200)]
r300g: fix assert in the rasterizer block for r3xx-r4xx

Reported-by: Niels Ole Salscheider <niels_ole@salscheider-online.de>
14 years agoir_to_mesa: Silence unused variable warnings
Ian Romanick [Mon, 16 Aug 2010 16:45:01 +0000 (09:45 -0700)]
ir_to_mesa: Silence unused variable warnings

14 years agoir_to_mesa: Clean up assertions in ir_to_mesa_visitor::visit(ir_texture *)
Ian Romanick [Mon, 16 Aug 2010 16:43:00 +0000 (09:43 -0700)]
ir_to_mesa: Clean up assertions in ir_to_mesa_visitor::visit(ir_texture *)

14 years agoir_to_mesa: Support texture rectangle targets
Ian Romanick [Mon, 16 Aug 2010 16:39:58 +0000 (09:39 -0700)]
ir_to_mesa: Support texture rectangle targets

14 years agotranslate: Remove unused temporary register.
José Fonseca [Mon, 16 Aug 2010 16:20:54 +0000 (17:20 +0100)]
translate: Remove unused temporary register.

Assuming the side-effect of x86_make_reg is also unnecessary.

14 years agotranslate: Eliminate void pointer arithmetic.
José Fonseca [Mon, 16 Aug 2010 16:18:14 +0000 (17:18 +0100)]
translate: Eliminate void pointer arithmetic.

Non-portable.

14 years agost/egl: Add support for EGL_KHR_fence_sync.
Chia-I Wu [Sun, 15 Aug 2010 09:24:14 +0000 (17:24 +0800)]
st/egl: Add support for EGL_KHR_fence_sync.

The extension is implemented by pipe_fence_handle.

14 years agoegl: Add support for EGL_KHR_fence_sync.
Chia-I Wu [Sun, 15 Aug 2010 09:09:48 +0000 (17:09 +0800)]
egl: Add support for EGL_KHR_fence_sync.

Individual drivers still need to support and enable the extension.

14 years agost/egl: Add support for EGL_KHR_reusable_sync.
Chia-I Wu [Sat, 14 Aug 2010 15:56:46 +0000 (23:56 +0800)]
st/egl: Add support for EGL_KHR_reusable_sync.

The extension is implemented by pipe_condvar.

14 years agoegl: Add support for EGL_KHR_reusable_sync.
Chia-I Wu [Sat, 14 Aug 2010 15:09:12 +0000 (23:09 +0800)]
egl: Add support for EGL_KHR_reusable_sync.

Individual drivers still need to support and enable the extension.

14 years agoegl: Update eglext.h.
Chia-I Wu [Sun, 15 Aug 2010 16:18:30 +0000 (00:18 +0800)]
egl: Update eglext.h.

Update to version 7 for EGL_KHR_fence_sync.

14 years agor600c: blit emit updates
Alex Deucher [Mon, 16 Aug 2010 15:34:17 +0000 (11:34 -0400)]
r600c: blit emit updates

- set VGT_MAX_VTX_INDX to a larger value
- emit PA_SC_AA_CONFIG.  The command checker in 2.6.36+
  requires this reg.

14 years agodraw_llvm: fix segfaults on non-SSE2 CPUs where it is disabled (v2)
Luca Barbieri [Sun, 15 Aug 2010 05:37:31 +0000 (07:37 +0200)]
draw_llvm: fix segfaults on non-SSE2 CPUs where it is disabled (v2)

Changes in v2:
- Change function name

Currently draw_llvm refuses to create itself on non-SSE2 CPUs due to
an alleged LLVM bug.

However, this is implemented improperly, because other parts of draw
still attempt to access draw->llvm, resulting in segfaults.

Instead, put the check in debug_get_option_draw_use_llvm, check that
before calling draw_llvm_create, and then check whether draw->llvm is
non-null everywhere else.

14 years agotranslate_sse: major rewrite (v5)
Luca Barbieri [Tue, 10 Aug 2010 00:14:04 +0000 (02:14 +0200)]
translate_sse: major rewrite (v5)

NOTE: Win64 is untested, and is thus currently disabled.
If you have such a system, please enable it and report whether it works.
To enable it, change src/gallium/auxiliary/translate/translate.c

Changes in v5:
- On Win64, preserve %xmm6 and %xmm7 as required by the ABI
- Use _WIN64 instead of WIN64

Changes in v4:
- Use x86_target() and x86_target_caps()
- Enable translate_sse in x86-64, but not in Win64

Changes in v3:
- Win64 support (untested)
- Use u_cpu_detect.h constants instead of #ifs

Changes in v2:
- Minimize #ifs
- Give a name to magic number CHANNELS_0001
- Add support for CPUs without SSE (only memcpy and swizzles, like non SSE2)
- Fixed comments

translate_sse is currently very limited to the point of
being useless in essentially all cases.

In particular, it only support some float32 and unorm8
formats and doesn't work on x86-64.

This commit rewrites it to support:
1. Dumb memory copy for any pair of identical formats
2. All formats that are swizzles of each other
3. Converting 32/64-bit floats and all 8/16/32-bit integers to 32-bit float
4. Converting unorm8/snorm8 to snorm16 and uscaled8/sscaled8 to sscaled16
5. Support for x86-64 (doesn't take advantage of it in any way though)

This new translate can even be useful to translate index buffers for
cards that lack 8-bit index support.

It passes the testsuite I wrote, but note that this is a major change, and more
testing would be great.

14 years agortasm: add minimal x86-64 support and new instructions (v5)
Luca Barbieri [Thu, 12 Aug 2010 16:27:02 +0000 (18:27 +0200)]
rtasm: add minimal x86-64 support and new instructions (v5)

Changes in v5:
- Add sse2_movdqa

Changes in v4:
- Use _WIN64 instead of WIN64

Changes in v3:
- Add target and target caps functions, so that they could be different in
  principle from the current CPU and they don't need #ifs to check

Changes in v2:
- Win64 support (untested)
- Use u_cpu_detect.h constants instead of #ifs

This commit adds minimal x86-64 support: only movs between registers
are supported for r8-r15, and x64_rexw() must be used to ask for 64-bit
operations.

It also adds several new instructions for the new translate_sse code.

movdqa

14 years agotranslate: add support for 8/16-bit indices
Luca Barbieri [Tue, 10 Aug 2010 08:47:23 +0000 (10:47 +0200)]
translate: add support for 8/16-bit indices

Currently, only 32-bit indices are supported, but some use cases
translate needs support for all types.

14 years agotranslate_sse: remove useless generated function wrappers
Luca Barbieri [Tue, 10 Aug 2010 08:31:48 +0000 (10:31 +0200)]
translate_sse: remove useless generated function wrappers

Currently translate_sse puts two trivial wrappers in the translate vtable.

These slow it down and enlarge the source code for no gain, except perhaps
the ability to set a breakpoint there, so remove them.

Breakpoints can be set on the caller of the translate functions, with no
loss of functionality.

14 years agotranslate_generic: factor out common code between linear and indexed
Luca Barbieri [Tue, 10 Aug 2010 08:27:14 +0000 (10:27 +0200)]
translate_generic: factor out common code between linear and indexed

This moves the common code into a separate ALWAYS_INLINE function.

14 years agotranslate_generic: use memcpy if possible (v3)
Luca Barbieri [Tue, 10 Aug 2010 07:51:20 +0000 (09:51 +0200)]
translate_generic: use memcpy if possible (v3)

Changes in v3:
- If we can do a copy, don't try to get an emit func, as that can assert(0)

Changes in v2:
- Add comment regarding copy_size

When used in GPU drivers, translate can be used to simultaneously
perform a gather operation, and convert away from unsupported formats.

In this use case, input and output formats will often be identical: clearly
it would make sense to use a memcpy in this case.

Instead, translate will insist to convert to and from 32-bit floating point
numbers.

This is not only extremely expensive, but it also loses precision for
32/64-bit integers and 64-bit floating point numbers.

This patch changes translate_generic to just use memcpy if the formats are
identical, non-blocked, and with an integral number of bytes per pixel (note
that all sensible vertex formats are like this).

14 years agodrwa: Fix polygon edge flags.
Chia-I Wu [Mon, 16 Aug 2010 14:00:45 +0000 (22:00 +0800)]
drwa: Fix polygon edge flags.

Fix a copy-and-paste error introduced by
f141abdc8fdbff41e16b0ce53fa3fa8fba32a7f9.

14 years agodraw: No need to make max_vertices even.
Chia-I Wu [Mon, 9 Aug 2010 17:05:25 +0000 (01:05 +0800)]
draw: No need to make max_vertices even.

Triangle strip alternates the front/back orientation of its triangles.
max_vertices was made even so that varray never splitted a triangle
strip at the wrong positions.

It did not work with triangle strips with adjacencies.  And it is no
longer relevant with vsplit.

14 years agodraw: Remove DRAW_PIPE_MAX_VERTICES and DRAW_PIPE_FLAG_MASK.
Chia-I Wu [Sat, 7 Aug 2010 13:02:13 +0000 (21:02 +0800)]
draw: Remove DRAW_PIPE_MAX_VERTICES and DRAW_PIPE_FLAG_MASK.

The higher bits of draw elements are no longer used for the stipple or
edge flags.

14 years agodrwa: Add PRIMITIVE macro to vsplit.
Chia-I Wu [Fri, 13 Aug 2010 16:05:28 +0000 (00:05 +0800)]
drwa: Add PRIMITIVE macro to vsplit.

PRIMITIVE is used by the indexed path to flush the entire primitive with
custom vertex count checks.  It replaces the existing fast path.

14 years agodraw: last_vertex_last is always true for GS and SO.
Chia-I Wu [Mon, 9 Aug 2010 16:39:23 +0000 (00:39 +0800)]
draw: last_vertex_last is always true for GS and SO.

That is, OpenGL decomposition rule is assumed.  There should be a
pipe_context state to specify the rules.

14 years agodraw: Remove varray and vcache.
Chia-I Wu [Sat, 7 Aug 2010 07:18:24 +0000 (15:18 +0800)]
draw: Remove varray and vcache.

They have been deprecated by vsplit.

14 years agodraw: Replace vcache by vsplit.
Chia-I Wu [Sat, 7 Aug 2010 12:44:02 +0000 (20:44 +0800)]
draw: Replace vcache by vsplit.

vcache decomposes primitives while vsplit splits primitives.  Splitting
is generally easier to do and is faster.  More importantly, vcache
depends on flatshade_first to decompose.  The outputs may have incorrect
vertex order which is significant to GS.

14 years agodraw: Replace varray by vsplit.
Chia-I Wu [Sat, 7 Aug 2010 07:12:14 +0000 (15:12 +0800)]
draw: Replace varray by vsplit.

vsplit is a superset of varray.  It sets the split flags comparing to
varray.

14 years agodraw: Add vsplit frontend.
Chia-I Wu [Fri, 6 Aug 2010 19:36:18 +0000 (03:36 +0800)]
draw: Add vsplit frontend.

vsplit is based on varray.  It sets the split flags when a primitive is
splitted.  It also has support for indexed primitives.

For indexed primitives, unlike vcache, vsplit splits the primitives
instead of decomposes them.

14 years agodraw: Add new util function draw_pt_trim_count.
Chia-I Wu [Sat, 7 Aug 2010 06:37:26 +0000 (14:37 +0800)]
draw: Add new util function draw_pt_trim_count.

draw_pt_trim_count is renamed from trim in draw_pt.c.

14 years agodraw: Simplify frontend interface a little.
Chia-I Wu [Sun, 18 Jul 2010 08:53:57 +0000 (16:53 +0800)]
draw: Simplify frontend interface a little.

The run method is simplified to take the start vertex and the vertex
count.

14 years agodraw: Add prim flags to middle ends.
Chia-I Wu [Sat, 7 Aug 2010 16:53:02 +0000 (00:53 +0800)]
draw: Add prim flags to middle ends.

Update the middle end interface to pass the primitive flags from the
frontends to the pipeline.  No frontend sets the flags yet.

14 years agodraw: Add flags to draw_prim_info.
Chia-I Wu [Sat, 7 Aug 2010 17:13:26 +0000 (01:13 +0800)]
draw: Add flags to draw_prim_info.

A primitive may be splitted in frontends.  The splitted primitives
should convey certain flag bits so that the decomposer can correctly
decide the stipple or edge flags.

This commit adds flags to draw_prim_info and updates the decomposer to
honor the flags.  Frontends and middle ends will be updated later.

14 years agost/mesa: test for FEATURE defines
nobled [Fri, 13 Aug 2010 20:23:11 +0000 (20:23 +0000)]
st/mesa: test for FEATURE defines

'struct dd_function_table' only conditionally contains
the function pointer NewFramebuffer and friends based on
FEATURE_EXT_framebuffer_* defines. (See src/mesa/main/dd.h)

Fixes the build when the features are disabled and the vfuncs
don't exist.

14 years agodri/nouveau: test for FEATURE defines
nobled [Tue, 13 Jul 2010 02:53:32 +0000 (22:53 -0400)]
dri/nouveau: test for FEATURE defines

'struct dd_function_table' only conditionally contains
the function pointer NewFramebuffer and friends based on
FEATURE_EXT_framebuffer_* defines. (See src/mesa/main/dd.h)

Fixes the build when the features are disabled and the vfuncs
don't exist.

14 years agodri/radeon: test for FEATURE defines
nobled [Tue, 13 Jul 2010 01:22:08 +0000 (21:22 -0400)]
dri/radeon: test for FEATURE defines

'struct dd_function_table' only conditionally contains
the function pointer NewFramebuffer and friends based on
FEATURE_EXT_framebuffer_* defines. (See src/mesa/main/dd.h)

Fixes the build when the features are disabled and the vfuncs
don't exist.

14 years agox86: Remove unnecessary header from sse.h.
Vinson Lee [Mon, 16 Aug 2010 07:26:10 +0000 (00:26 -0700)]
x86: Remove unnecessary header from sse.h.

14 years agox86: Include missing headers in mmx.h.
Vinson Lee [Mon, 16 Aug 2010 06:29:09 +0000 (23:29 -0700)]
x86: Include missing headers in mmx.h.

Include compiler.h for _ASMAPI symbol.
Include mtypes.h for GLcontext symbol.

14 years agox86: Remove unnecessary header from 3dnow.h.
Vinson Lee [Mon, 16 Aug 2010 06:10:42 +0000 (23:10 -0700)]
x86: Remove unnecessary header from 3dnow.h.

14 years agoglsl: Clean up header file inclusion in slang_typeinfo.h.
Vinson Lee [Mon, 16 Aug 2010 05:44:29 +0000 (22:44 -0700)]
glsl: Clean up header file inclusion in slang_typeinfo.h.

Remove imports.h, mtypes.h, and slang_vartable.h.
Include glheader.h for GL symbols.

14 years agoglsl: Include missing header in slang_codegen.h.
Vinson Lee [Mon, 16 Aug 2010 05:39:51 +0000 (22:39 -0700)]
glsl: Include missing header in slang_codegen.h.

Include slang_vartable.h for slang_var_table symbol.

14 years agoglsl: Include missing header in slang_compile_operation.h.
Vinson Lee [Mon, 16 Aug 2010 05:38:23 +0000 (22:38 -0700)]
glsl: Include missing header in slang_compile_operation.h.

Include compiler.h for INLINE symbol.

14 years agoglsl: Clean up header file inclusion in slang_storage.h.
Vinson Lee [Mon, 16 Aug 2010 05:26:27 +0000 (22:26 -0700)]
glsl: Clean up header file inclusion in slang_storage.h.

Remove slang_compile.h.
Include glheader.h for GL symbols.
Include slang_compile_function.h for slang_function_scope symbol.
Include slang_compile_struct.h for slang_struct_scope symbol.
Include slang_compile_variable.h for slang_variable_scope symbol.
Include slang_typeinfo.h for slang_type_specifier symbol.
Include slang_utility.h for slang_atom_pool symbol.

14 years agoglsl: Clean up header file inclusion in slang_mem.h.
Vinson Lee [Mon, 16 Aug 2010 05:07:04 +0000 (22:07 -0700)]
glsl: Clean up header file inclusion in slang_mem.h.

slang_mem.h
Remove imports.h.
Include glheader.h for GL symbols.

slang_label.c
Include imports.h now that slang_mem.c does not include it.

14 years agoglsl: Clean up header file inclusion in slang_ir.h.
Vinson Lee [Mon, 16 Aug 2010 04:17:50 +0000 (21:17 -0700)]
glsl: Clean up header file inclusion in slang_ir.h.

Remove imports.h and slang_compile.h.
Include glheader.h for GL symbols.
Include slang_compile_variable.h for slang_variable symbol.

14 years agoglsl: Include missing header in slang_link.c.
Vinson Lee [Mon, 16 Aug 2010 04:13:32 +0000 (21:13 -0700)]
glsl: Include missing header in slang_link.c.

Include slang_compile.h for _slang_compile function.

14 years agoglsl: Include missing headers in slang_builtin.c.
Vinson Lee [Mon, 16 Aug 2010 04:10:16 +0000 (21:10 -0700)]
glsl: Include missing headers in slang_builtin.c.

Include slang_typeinfo.h for slang_type_specifier symbol.
Include slang_compiler_struct.h for slang_struct_ symbol.

14 years agor300g: fix an invalid pointer in free
Marek Olšák [Mon, 16 Aug 2010 03:05:43 +0000 (05:05 +0200)]
r300g: fix an invalid pointer in free

14 years agor300g: Let hyperz init fail
nobled [Sun, 15 Aug 2010 03:53:18 +0000 (03:53 +0000)]
r300g: Let hyperz init fail

Signed-off-by: Marek Olšák <maraeo@gmail.com>
14 years agor300g: Fix leaks in failed context creation
nobled [Sun, 15 Aug 2010 03:59:15 +0000 (03:59 +0000)]
r300g: Fix leaks in failed context creation

This changes r300_destroy_context() so it can be called
on a partially-initialized context, and uses it when
r300_create_context() hits a fatal error.

This makes sure r300_create_context() doesn't leak memory
or neglect to call r300_update_num_contexts() when it fails.

Signed-off-by: Marek Olšák <maraeo@gmail.com>
14 years agor300g: Fix macro
nobled [Sun, 15 Aug 2010 02:50:04 +0000 (02:50 +0000)]
r300g: Fix macro

This fixes a potential bug if (has_hyperz) is false
(it would still init the atom as if has_hyperz were true).

Signed-off-by: Marek Olšák <maraeo@gmail.com>
14 years agost/mesa: remove output register reads inside shaders
Marek Olšák [Sun, 15 Aug 2010 23:53:30 +0000 (01:53 +0200)]
st/mesa: remove output register reads inside shaders

This is a GLSL2 regression fix.

14 years agor300/compiler: implement DP2 opcode
Marek Olšák [Sun, 15 Aug 2010 21:44:16 +0000 (23:44 +0200)]
r300/compiler: implement DP2 opcode

14 years agor300/compiler: implement SSG opcode
Marek Olšák [Sun, 15 Aug 2010 20:53:17 +0000 (22:53 +0200)]
r300/compiler: implement SSG opcode

14 years agor300/compiler: fix allocation of temporaries in radeonTransformTEX
Marek Olšák [Sun, 15 Aug 2010 20:52:07 +0000 (22:52 +0200)]
r300/compiler: fix allocation of temporaries in radeonTransformTEX

14 years agollvmpipe: special case triangles which fall in a single 16x16 block
Keith Whitwell [Sun, 15 Aug 2010 21:58:26 +0000 (22:58 +0100)]
llvmpipe: special case triangles which fall in a single 16x16 block

Check for these and route them to a dedicated handler with one fewer
levels of recursive rasterization.

14 years agomesa: Check that _XOPEN_SOURCE is defined before using it.
Vinson Lee [Sun, 15 Aug 2010 20:08:15 +0000 (13:08 -0700)]
mesa: Check that _XOPEN_SOURCE is defined before using it.

14 years agollvmpipe: consolidate several loops in lp_rast_triangle
Keith Whitwell [Sun, 15 Aug 2010 16:40:39 +0000 (17:40 +0100)]
llvmpipe: consolidate several loops in lp_rast_triangle

14 years agollvmpipe: remove all traces of step arrays, pos_tables
Keith Whitwell [Sun, 15 Aug 2010 16:31:13 +0000 (17:31 +0100)]
llvmpipe: remove all traces of step arrays, pos_tables

No need to calculate these values any longer, nor to store them in the
bin data.  Improves isosurf a bit more, 115->123 fps.

14 years agollvmpipe: eliminate last usage of step array in rast_tmp.h
Keith Whitwell [Sun, 15 Aug 2010 16:24:54 +0000 (17:24 +0100)]
llvmpipe: eliminate last usage of step array in rast_tmp.h

For 16 and 64 pixel levels, calculate a mask which is linear in x and
y (ie not in the swizzle layout).

When iterating over full and partial masks, figure out position by
manipulating the bit number set in the mask, rather than relying on
postion arrays.

Similarly, calculate the lower-level c values from dcdx, dcdy and the
position rather than relying on the step array.

14 years agollvmpipe: don't refer to plane->step when dcdx or dcdy would do
Keith Whitwell [Sun, 15 Aug 2010 16:22:25 +0000 (17:22 +0100)]
llvmpipe: don't refer to plane->step when dcdx or dcdy would do

14 years agollvmpipe: also use build_mask at 16, 64 pixel levels
Keith Whitwell [Sun, 15 Aug 2010 15:49:26 +0000 (16:49 +0100)]
llvmpipe: also use build_mask at 16, 64 pixel levels

14 years agollvmpipe: version of block4 which doesn't need the full step array
Keith Whitwell [Sun, 15 Aug 2010 15:32:45 +0000 (16:32 +0100)]
llvmpipe: version of block4 which doesn't need the full step array

No noticable slowdown with isosurf.

14 years agollvmpipe: reorganize block4 loop, nice speedup
Keith Whitwell [Sun, 15 Aug 2010 15:21:46 +0000 (16:21 +0100)]
llvmpipe: reorganize block4 loop, nice speedup

isosurf 95->115 fps just by exchanging the two inner loops in this
function...

14 years agogallium/docs: Cleanup debugging. Spelling, grammar, organization.
Corbin Simpson [Sun, 15 Aug 2010 10:26:58 +0000 (03:26 -0700)]
gallium/docs: Cleanup debugging. Spelling, grammar, organization.

14 years agogallium/docs: Add formatting for envvar role; change debugging.
Corbin Simpson [Sun, 15 Aug 2010 10:05:18 +0000 (03:05 -0700)]
gallium/docs: Add formatting for envvar role; change debugging.

Per Jakob's request. Not super-pretty, but it's a good point for modding
later.

14 years agoglsl: Fix self inclusion in slang_compile_function.h.
Vinson Lee [Sun, 15 Aug 2010 08:30:02 +0000 (01:30 -0700)]
glsl: Fix self inclusion in slang_compile_function.h.

Fix self inclusion introduced by commit
4fef77c7c5455b983daa93cacf90d2b3baa3967e.

14 years agoglsl: Include missing header in slang_ir.h.
Vinson Lee [Sun, 15 Aug 2010 08:23:10 +0000 (01:23 -0700)]
glsl: Include missing header in slang_ir.h.

Include prog_instruction.h for gl_inst_opcode symbol.

14 years agoglsl: Clean up header file inclusion in slang_link.h.
Vinson Lee [Sun, 15 Aug 2010 08:06:43 +0000 (01:06 -0700)]
glsl: Clean up header file inclusion in slang_link.h.

Remove slang_compile.h.
Include mtypes.h for GLcontext symbol.

14 years agoglsl: Clean up header file inclusion in slang_label.h.
Vinson Lee [Sun, 15 Aug 2010 07:59:34 +0000 (00:59 -0700)]
glsl: Clean up header file inclusion in slang_label.h.

Move mtypes.h and prog_instruction.h to slang_label.c.
Remove imports.h.
Include glheader.h from GL symbols.

14 years agor300g: mark HiZ/ZMask_clear atoms as non-dirty after emission in clear
Marek Olšák [Sun, 15 Aug 2010 07:13:50 +0000 (09:13 +0200)]
r300g: mark HiZ/ZMask_clear atoms as non-dirty after emission in clear

14 years agoglsl: Clean up header file inclusion in slang_emit.h.
Vinson Lee [Sun, 15 Aug 2010 06:40:42 +0000 (23:40 -0700)]
glsl: Clean up header file inclusion in slang_emit.h.

Remove imports.h.
Remove mtypes.h.
Remove slang_compile.h.
Include glheader.h for GL symbols.
Include slang_vartable.h for slang_var_table symbol.

14 years agoglsl: Clean up header file inclusion in slang_compile.h.
Vinson Lee [Sun, 15 Aug 2010 06:32:08 +0000 (23:32 -0700)]
glsl: Clean up header file inclusion in slang_compile.h.

Remove imports.h.
Remove slang_typeinfo.h.
Remove slang_compile_operation.h.
Include glheader.h for GL symbols.
Include slang_utility.h for slang_atom_pool symbol.

14 years agoglsl: Include missing headers in slang_compile_struct.h.
Vinson Lee [Sun, 15 Aug 2010 06:18:29 +0000 (23:18 -0700)]
glsl: Include missing headers in slang_compile_struct.h.

Include glheader.h for GL symbols.
Include slang_utility.h for slang_atom symbol.

14 years agor300g: do not use HiZ if HiZ RAM is not properly initialized
Marek Olšák [Sun, 15 Aug 2010 01:14:25 +0000 (03:14 +0200)]
r300g: do not use HiZ if HiZ RAM is not properly initialized

14 years agor300g: rename dirty_zmask -> zmask_in_use
Marek Olšák [Sun, 15 Aug 2010 00:18:15 +0000 (02:18 +0200)]
r300g: rename dirty_zmask -> zmask_in_use

14 years agor300g: do not clear with blitter if we clear just the ZMask RAM
Marek Olšák [Fri, 13 Aug 2010 02:57:30 +0000 (04:57 +0200)]
r300g: do not clear with blitter if we clear just the ZMask RAM

This skips the blitter clear path entirely if the color is not cleared and
the depth+stencil is cleared with the ZMask.

14 years agor300g: do not use fastfill if ZMask RAM is not properly initialized
Marek Olšák [Sat, 14 Aug 2010 16:37:04 +0000 (18:37 +0200)]
r300g: do not use fastfill if ZMask RAM is not properly initialized

z_fastfill -> dirty_zmask[level].

14 years agor300g: separate num_cs_end_dwords out from prepare_for_rendering
Marek Olšák [Sat, 14 Aug 2010 20:35:23 +0000 (22:35 +0200)]
r300g: separate num_cs_end_dwords out from prepare_for_rendering

14 years agoglsl: Include missing headers in slang_print.h.
Vinson Lee [Sun, 15 Aug 2010 06:14:42 +0000 (23:14 -0700)]
glsl: Include missing headers in slang_print.h.

Include glheader.h for GL symbols.
Include slang_compile_function.h for slang_function symbol.
Include slang_compile_operation.h for slang_operation symbol.
Include slang_compile_variable.h for slang_variable and slang_variable_scope symbols.
Include slang_typeinfo.h for slang_type_qualifer and slang_fully_specified_type symbols.

14 years agoglsl: Include missing headers in slang_compile_function.h
Vinson Lee [Sun, 15 Aug 2010 05:57:47 +0000 (22:57 -0700)]
glsl: Include missing headers in slang_compile_function.h

Include glheader.h for GL symbols.
Include slang_compile_function.h for slang_function symbol.
Include slang_compile_operation.h for slang_operation symbol.
Include slang_compile_variable.h for slang_variable and
slang_variable_scope symbols.
Include slang_log.h for slang_info_log symbols.
Include slang_utility.h for slang_atom and slang_atom_pool symbols.

14 years agoglsl: Include missing headers in slang_compiler_operation.h.
Vinson Lee [Sun, 15 Aug 2010 05:43:26 +0000 (22:43 -0700)]
glsl: Include missing headers in slang_compiler_operation.h.

Include glheader.h for GL symbols.
Include slang_compile_variable.h for slang_variable_scope symbol.
Include slang_utility.h for slang_atom symbol.

14 years agoglsl: Include missing headers in slang_simplify.h.
Vinson Lee [Sun, 15 Aug 2010 04:21:10 +0000 (21:21 -0700)]
glsl: Include missing headers in slang_simplify.h.

Include glheader.h for GL symbols.
Include slang_compile.h for slang_name_space symbol.
Include slang_compile_function.h for slang_function symbol.
Include slang_compile_operation.h for slang_operation symbol.
Include slang_log.h for slang_info_log symbol.
Include slang_utility.h for slang_atom_pool symbol.

14 years agoglsl: Include missing header in slang_utility.h.
Vinson Lee [Sun, 15 Aug 2010 01:56:17 +0000 (18:56 -0700)]
glsl: Include missing header in slang_utility.h.

Include glheader.h for GL symbols.