Søren Sandmann Pedersen [Sat, 19 Sep 2009 06:32:28 +0000 (02:32 -0400)]
Store the operator table more compactly.
The four cases for each operator:
none-are-opaque, src-is-opaque, dest-is-opaque, both-are-opaque
are packed into one uint32_t per operator. The relevant strength
reduced operator can then be found by packing the source-is-opaque and
dest-is-opaque into two bits and shifting that number of bytes.
Chris Wilson pointed out a bug in the original version of this commit:
dest_is_opaque and source_is_opaque were used as booleans, but their
actual values were the results of a logical AND with the
FAST_PATH_OPAQUE flag, so the shift value was wildly wrong.
The only reason it actually passed the test suite (on x86) was that
the compiler computed the shift amount in the cl register, and the low
byte of FAST_PATH_OPAQUE happens to be 0, so no shifting actually took
place, and the original operator was returned.
Søren Sandmann Pedersen [Fri, 18 Sep 2009 18:10:31 +0000 (14:10 -0400)]
Make the operator strength reduction constant time.
By extending the operator information table to cover all operators we
can replace the loop with a table look-up. At the same time, base the
operator optimization on the computed flags rather than the ones in
the image struct.
Finally, as an extra optimization, we no longer ignore the case where
there is a mask. Instead we consider the source opaque if both source
and mask are opaque, or if the source is opaque and the mask is
missing.
Søren Sandmann Pedersen [Thu, 17 Sep 2009 07:16:27 +0000 (03:16 -0400)]
Add a fast path cache
This patch adds a cache in front of the fast path tables to reduce the
overhead of pixman_composite(). It is fixed size with move-to-front to
make sure the most popular fast paths are at the beginning of the cache.
The cache is thread local to avoid locking.
Søren Sandmann Pedersen [Sat, 6 Mar 2010 01:40:41 +0000 (20:40 -0500)]
Post-release version bump to 0.17.11
Søren Sandmann Pedersen [Sat, 6 Mar 2010 01:06:08 +0000 (20:06 -0500)]
Pre-release version bump to 0.17.10
Søren Sandmann Pedersen [Fri, 26 Feb 2010 19:15:22 +0000 (14:15 -0500)]
Move __force_align_arg_pointer workaround before composite32()
Since otherwise the workaround won't take effect when you call
pixman_image_composite32() directly.
Søren Sandmann Pedersen [Thu, 4 Mar 2010 07:30:22 +0000 (02:30 -0500)]
Merge branch 'more-flags'
Søren Sandmann Pedersen [Sat, 27 Feb 2010 05:37:19 +0000 (00:37 -0500)]
test: Remove obsolete comment
Siarhei Siamashka [Wed, 24 Feb 2010 02:14:45 +0000 (04:14 +0200)]
ARM: added 'neon_composite_over_reverse_n_8888' fast path
This fast path function improves performance of 'poppler' cairo-perf trace.
Benchmark from ARM Cortex-A8 @720MHz
before:
[ # ] backend test min(s) median(s) stddev. count
[ 0] image poppler 38.986 39.158 0.23% 6/6
after:
[ # ] backend test min(s) median(s) stddev. count
[ 0] image poppler 24.981 25.136 0.28% 6/6
Siarhei Siamashka [Wed, 24 Feb 2010 00:26:57 +0000 (02:26 +0200)]
ARM: added 'neon_composite_src_x888_8888' fast path
This fast path function improves performance of 'gnome-system-monitor'
cairo-perf trace.
Benchmark from ARM Cortex-A8 @720MHz
before:
[ # ] backend test min(s) median(s) stddev. count
[ 0] image gnome-system-monitor 68.838 68.899 0.05% 5/6
after:
[ # ] backend test min(s) median(s) stddev. count
[ 0] image gnome-system-monitor 53.336 53.384 0.09% 6/6
Siarhei Siamashka [Tue, 23 Feb 2010 23:44:00 +0000 (01:44 +0200)]
ARM: added 'neon_composite_over_n_8888_8888_ca' fast path
This fast path function improves performance of 'firefox-talos-gfx'
cairo-perf trace.
Benchmark from ARM Cortex-A8 @720MHz
before:
[ # ] backend test min(s) median(s) stddev. count
[ 0] image firefox-talos-gfx 139.969 141.176 0.35% 6/6
after:
[ # ] backend test min(s) median(s) stddev. count
[ 0] image firefox-talos-gfx 111.810 112.196 0.23% 6/6
Søren Sandmann Pedersen [Mon, 15 Feb 2010 00:18:35 +0000 (19:18 -0500)]
Restructure the flags computation in compute_image_info().
Restructure the code to use switches instead of ifs. This saves a few
comparisons and make the code slightly easier to follow. Also add some
comments.
Søren Sandmann Pedersen [Mon, 15 Feb 2010 00:14:44 +0000 (19:14 -0500)]
Move workaround code to pixman-image.c
It is more natural to put it where all the other flags are computed.
Søren Sandmann Pedersen [Mon, 22 Feb 2010 11:06:22 +0000 (06:06 -0500)]
Turn need_workaround into another flag.
Instead of storing it as a boolean in the image struct, just use
another flag for it.
Søren Sandmann Pedersen [Fri, 18 Sep 2009 15:54:21 +0000 (11:54 -0400)]
Eliminate _pixman_image_is_opaque() in favor of a new FAST_PATH_IS_OPAQUE flag
The new FAST_PATH_IS_OPAQUE flag is computed along with the others in
_pixman_image_validate().
Søren Sandmann Pedersen [Fri, 18 Sep 2009 08:17:57 +0000 (04:17 -0400)]
Eliminate _pixman_image_is_solid()
Instead of calling this function in compute_image_info(), just do the
relevant checks when the extended format is computed.
Move computation of solidness to validate
Søren Sandmann Pedersen [Fri, 18 Sep 2009 08:06:30 +0000 (04:06 -0400)]
Move computation of extended format code to validate.
Instead of computing the extended format on every composite, just
compute it once and store it in the image.
Søren Sandmann Pedersen [Mon, 22 Feb 2010 10:16:27 +0000 (05:16 -0500)]
Add new FAST_PATH_SIMPLE_REPEAT flag
This flags indicates that the image is untransformed an
repeating. Such images can be composited quickly by simply repeating
the composite operation.
Søren Sandmann Pedersen [Fri, 18 Sep 2009 07:54:36 +0000 (03:54 -0400)]
Compute the image flags at validation time instead of composite time
Instead of computing all the image flags at composite time, we compute
them once in _pixman_image_validate() and cache them in the image.
Søren Sandmann Pedersen [Thu, 25 Feb 2010 03:09:41 +0000 (22:09 -0500)]
RELEASING: Update the release instructions.
Søren Sandmann Pedersen [Thu, 25 Feb 2010 03:02:13 +0000 (22:02 -0500)]
Post-release version bump
Søren Sandmann Pedersen [Thu, 25 Feb 2010 02:52:30 +0000 (21:52 -0500)]
Pre-release version bump
Søren Sandmann Pedersen [Thu, 25 Feb 2010 02:01:29 +0000 (21:01 -0500)]
Merge branch 'trap-fixes'
Søren Sandmann Pedersen [Thu, 25 Feb 2010 01:51:25 +0000 (20:51 -0500)]
Add a1-trap-test
When a trapezoid sample point is exactly on a polygon edge, the rule
is that it is considered inside the trapezoid if the edge is a top or
left edge, but outside for bottom and right edges.
This program tests that for a1 trapezoids.
Søren Sandmann Pedersen [Sun, 21 Feb 2010 07:01:16 +0000 (02:01 -0500)]
Hide the C++ extern "C" declarations behind macros.
That way they don't confuse the indenting algorithm in editors such as
Emacs.
Søren Sandmann Pedersen [Sat, 20 Feb 2010 18:09:01 +0000 (13:09 -0500)]
Merge branch 'eliminate-composite'
Conflicts:
pixman/pixman-sse2.c
Søren Sandmann Pedersen [Sun, 14 Feb 2010 01:08:13 +0000 (20:08 -0500)]
Move all code to do debugging spew into pixman-private.
Rather than the region code having its own little debug system, move
all of it into pixman-private where there is already return_if_fail()
macros etc. These macros are now enabled in development snapshots and
nowhere else. Previously they were never enabled unless you modified
the code.
At the same time, remove all the asserts from the region code since we
can never turn them on anyway, and replace them with
critical_if_fail() macros that will print spew to standard error when
DEBUG is defined.
Finally, also change the debugging spew in pixman-bits-image.c to use
return_val_if_fail() instead of its own fprintf().
Alexander Larsson [Fri, 19 Feb 2010 10:22:52 +0000 (11:22 +0100)]
Test pixman_region32_init_from_image in region-test
Alexander Larsson [Mon, 15 Feb 2010 08:40:50 +0000 (09:40 +0100)]
Add pixman_region{32}_init_from_image
This creates a region from an image in PIXMAN_a1 format.
Alexander Larsson [Mon, 15 Feb 2010 08:39:59 +0000 (09:39 +0100)]
Move SCREEN_SHIFT_LEFT/RIGHT to pixman-private.h
This is needed for later use in other code.
Makoto Kato [Thu, 18 Feb 2010 05:30:01 +0000 (14:30 +0900)]
Compile by USE_SSE2 only without USE_MMX
Although we added MMX emulation for Microsoft Visual C++ compiler for x64,
USE_SSE2 still requires USE_MMX. So we remove dependency of USE_MMX
for Windows x64.
Signed-off-by: Makoto Kato <m_kato@ga2.so-net.ne.jp>
Søren Sandmann Pedersen [Wed, 16 Sep 2009 11:29:08 +0000 (07:29 -0400)]
Move NULL check out of get_image_info()
The NULL check is only necessary for masks, so there is no reason to
do it for destinations and sources.
Søren Sandmann Pedersen [Wed, 16 Sep 2009 10:54:43 +0000 (06:54 -0400)]
Add a fast path for non-repeating sources in walk_region_internal().
In the common case where there is no repeating, the loop in
walk_region_internal() reduces to just walking of the boxes involved
and calling the composite function.
Søren Sandmann Pedersen [Tue, 15 Sep 2009 05:34:33 +0000 (01:34 -0400)]
Move more things out of the inner loop in do_composite().
Specifically,
- the src_ and mask_repeat computations
- the check for whether the involved images cover the composite
region.
Søren Sandmann Pedersen [Tue, 15 Sep 2009 05:16:40 +0000 (01:16 -0400)]
Move region computation out of the loop in do_composite()
We only need to compute the composite region once, not on every
iteration.
Søren Sandmann Pedersen [Tue, 15 Sep 2009 05:14:41 +0000 (01:14 -0400)]
Move get_image_info() out of the loop in do_composite
The computation of image formats and flags is invariant to the loop,
so it can all be moved out.
Søren Sandmann Pedersen [Tue, 15 Sep 2009 05:13:36 +0000 (01:13 -0400)]
Manually inline _pixman_run_fast_path()
Move all of the code into do_composite().
Søren Sandmann Pedersen [Tue, 15 Sep 2009 05:11:04 +0000 (01:11 -0400)]
Move compositing functionality from pixman-utils.c into pixman.c
_pixman_run_fast_path() and pixman_compute_composite_region() are both
moved to pixman-image, since at this point that's the only place they
are being called from.
Søren Sandmann Pedersen [Sat, 7 Nov 2009 20:13:03 +0000 (15:13 -0500)]
Move compositing to its own function, do_composite()
Søren Sandmann Pedersen [Tue, 15 Sep 2009 04:58:51 +0000 (00:58 -0400)]
Optimize for the common case wrt. the workaround.
In the common case no images need the workaround, so we check for that
first, and only if an image does need a workaround do we check which
one of the images actually need it.
Søren Sandmann Pedersen [Tue, 15 Sep 2009 04:48:12 +0000 (00:48 -0400)]
Eliminate all the composite methods.
They are no longer necessary because we will just walk the fast path
tables, and the general composite path is treated as another fast
path.
This unfortunately means that sse2_composite() can no longer be
responsible for realigning the stack to 16 bytes, so we have to move
that to pixman_image_composite().
Søren Sandmann Pedersen [Tue, 15 Sep 2009 04:23:11 +0000 (00:23 -0400)]
Delete unused _pixman_walk_composite_region() function
Søren Sandmann Pedersen [Sat, 7 Nov 2009 20:28:57 +0000 (15:28 -0500)]
Don't call _pixman_implementation_composite() anymore.
Instead just call _pixman_run_fast_path(). Since we view
general_composite() as a fast path now, we know that it will find
*some* compositing routine.
Søren Sandmann Pedersen [Tue, 15 Sep 2009 03:58:40 +0000 (23:58 -0400)]
Delete unused sources_cover() function
Søren Sandmann Pedersen [Tue, 15 Sep 2009 03:47:39 +0000 (23:47 -0400)]
Store a pointer to the array of fast paths in the implementation struct.
Also add an empty fast path table to the vmx implementation, so that
we can assume sure the pointer is never NULL.
Søren Sandmann Pedersen [Tue, 15 Sep 2009 04:20:22 +0000 (00:20 -0400)]
Make fast_composite_scaled_nearest() another fast path.
This requires another couple of flags
FAST_PATH_SCALE_TRANSFORM
FAST_PATH_NEAREST_FILTER
Søren Sandmann Pedersen [Sun, 13 Sep 2009 09:29:48 +0000 (05:29 -0400)]
Make general_composite_rect() just another fast path.
We introduce a new PIXMAN_OP_any fake operator and a PIXMAN_any fake
format that match anything. Then general_composite_rect() can be used
as another fast path.
Because general_composite_rect() does not require the sources to cover
the clip region, we add a new flag FAST_PATH_COVERS_CLIP which is part
of the set of standard flags for fast paths.
Because this flag cannot be computed until after the clip region is
available, we have to call pixman_compute_composite_region32() before
checking for fast paths. This will resolve itself when we get to the
point where _pixman_run_fast_path() is only called once per composite
operation.
Søren Sandmann Pedersen [Sat, 13 Feb 2010 23:23:34 +0000 (18:23 -0500)]
Post-release version bump
Søren Sandmann Pedersen [Sat, 13 Feb 2010 23:12:32 +0000 (18:12 -0500)]
Pre-release version bump
Søren Sandmann Pedersen [Sun, 31 May 2009 19:06:33 +0000 (15:06 -0400)]
Once unrolled version of fast_path_composite_nearest_scaled()
Separate out the fetching and combining code in two inline
functions. Then do two pixels per iteration.
Søren Sandmann Pedersen [Sat, 30 May 2009 06:50:38 +0000 (02:50 -0400)]
Generalize and optimize fast_composite_src_scaled_nearest()
- Make it work for PIXMAN_OP_OVER
- Split repeat computation for x and y, and only the x part in the
inner loop.
- Move stride multiplication outside of inner loop
Søren Sandmann Pedersen [Sat, 13 Feb 2010 17:26:09 +0000 (12:26 -0500)]
Merge branch 'bitmasks'
Søren Sandmann Pedersen [Sat, 13 Feb 2010 16:18:13 +0000 (11:18 -0500)]
Makefile.am: Remove 'check' from release-check
It's already included in distcheck.
Søren Sandmann Pedersen [Sat, 13 Feb 2010 14:40:33 +0000 (09:40 -0500)]
Turn off asserts in development snapshots (bug 26314).
There is not much real benefit in having asserts turned on in
snapshots because it doesn't lead to any new bug reports, just to
people not installing development snapshots since they case X server
crashes. So just turn them off.
While we are at it, limit the number of messages to stderr to 5
instead of 50.
Siarhei Siamashka [Sun, 7 Feb 2010 05:33:52 +0000 (07:33 +0200)]
ARM: Remove any use of environment variables for cpu features detection
Old code assumed that all ARMv7 processors support NEON instructions
unless overrided by environment variable ARM_TRUST_HWCAP. This causes
X server to die with SIGILL if NEON support is disabled in the kernel
configuration. Additionally, ARMv7 processors lacking NEON unit are
going to become available eventually.
The problem was reported by user bearsh at irc.freenode.net #gentoo-embedded
Alexander Larsson [Tue, 9 Feb 2010 14:57:18 +0000 (15:57 +0100)]
Add pixman_image_get_destroy_data()
This way you can get back user data that was set using
pixman_image_set_destroy_function().
Alexander Larsson [Tue, 9 Feb 2010 12:22:38 +0000 (13:22 +0100)]
Add extern "C" guards for c++
Søren Sandmann Pedersen [Sun, 13 Sep 2009 08:28:20 +0000 (04:28 -0400)]
Move checks for src/mask repeat right before walking the region.
Also add a couple of additional checks to the src/mask repeat check.
Søren Sandmann Pedersen [Sun, 13 Sep 2009 07:43:16 +0000 (03:43 -0400)]
Compute src, mask, dest flags and base fast path decisions on them.
This makes sets the stage for caching the information by image instead
of computing it on each composite invocation.
This patch also computes format codes for images such as PIXMAN_solid,
so that we can no longer end up in the situation that a fast path is
selected for a 1x1 solid image, when that fast path doesn't actually
understand repeating.
Søren Sandmann Pedersen [Sun, 13 Sep 2009 06:34:32 +0000 (02:34 -0400)]
Add src_, mask_, and dest_flags fields to fast path arrays
Update all the fast path tables to match using a new
PIXMAN_STD_FAST_PATH macro.
For now, use 0 for the flags fields.
Søren Sandmann Pedersen [Sat, 12 Sep 2009 09:26:50 +0000 (05:26 -0400)]
Move calls to source_is_fastpathable() into get_source_format()
Søren Sandmann Pedersen [Sat, 12 Sep 2009 09:09:53 +0000 (05:09 -0400)]
Fold get_fast_path() into _pixman_run_fast_path()
Also factor out the source format code computation to its own
function.
Søren Sandmann Pedersen [Sat, 12 Sep 2009 08:30:22 +0000 (04:30 -0400)]
Consolidate the source and mask sanity checks in a function
Søren Sandmann Pedersen [Sat, 12 Sep 2009 08:10:30 +0000 (04:10 -0400)]
Move pixbuf checks after src_format and mask_format have been computed.
Søren Sandmann Pedersen [Sat, 12 Sep 2009 08:03:25 +0000 (04:03 -0400)]
Move the sanity checks for src, mask and destination into get_fast_path()
Søren Sandmann Pedersen [Wed, 27 Jan 2010 12:11:11 +0000 (07:11 -0500)]
Turn some uint16_t variables to int32_t in the fast paths.
This is necessary now that we have a 32 bit version of
pixman_image_composite().
Søren Sandmann Pedersen [Thu, 21 Jan 2010 18:43:53 +0000 (13:43 -0500)]
Implement get_scanline_64() correctly for solid fill images.
Previously they would be evaluated at 8 bits and then expanded.
Benjamin Otte [Tue, 26 Jan 2010 18:37:34 +0000 (19:37 +0100)]
Make pixman_image_fill_rectangles() call pixman_image_fill_boxes()
Avoids duplication of code
Benjamin Otte [Tue, 26 Jan 2010 18:03:38 +0000 (19:03 +0100)]
Add pixman_image_fill_boxes() API
It's basically the 32bit version of pixman_image_fill_rectangles(), just
with a saner data type.
Benjamin Otte [Tue, 26 Jan 2010 17:52:27 +0000 (18:52 +0100)]
Add pixman_image_composite32()
This is equal to pixman_image_composite(), just with 32bit parameters.
pixman_image_composite() now just calls pixman_image_composite32()
Benjamin Otte [Tue, 26 Jan 2010 18:09:56 +0000 (19:09 +0100)]
Make region argument to pixman_region(32)_init_rects() const
No indenting of the header to keep git blame working
Benjamin Otte [Tue, 26 Jan 2010 18:08:29 +0000 (19:08 +0100)]
Fix typo
Søren Sandmann Pedersen [Tue, 19 Jan 2010 19:23:57 +0000 (14:23 -0500)]
Fix some warnings
Søren Sandmann Pedersen [Mon, 18 Jan 2010 00:34:27 +0000 (19:34 -0500)]
Post-release version bump
Søren Sandmann Pedersen [Sun, 17 Jan 2010 23:56:11 +0000 (18:56 -0500)]
Pre-release version bump
Søren Sandmann Pedersen [Sun, 17 Jan 2010 21:45:23 +0000 (16:45 -0500)]
bits: Print an error if someone tries to create an image with bpp < depth
Something in the X server apparently does this.
Søren Sandmann Pedersen [Sat, 16 Jan 2010 15:31:22 +0000 (10:31 -0500)]
When fetching from an alpha map, replace the alpha channel of the image
Previously it would be multiplied onto the image pixel, but the Render
specification is pretty clear that the alpha map should be used
*instead* of any alpha channel within the image.
This makes the assumption that the pixels in the image are already
premultiplied with the alpha channel from the alpha map. If we don't
make this assumption and the image has an alpha channel of its own, we
would have to first unpremultiply that pixel, and then premultiply the
alpha value onto the color channels, and then replace the alpha
channel.
Søren Sandmann Pedersen [Sat, 16 Jan 2010 15:09:25 +0000 (10:09 -0500)]
pixman_image_validate() needs to also validate the alpha map.
This is the other half of bug 25950.
Søren Sandmann Pedersen [Sat, 16 Jan 2010 15:07:48 +0000 (10:07 -0500)]
When fetching from an alpha map, use the alpha map's fetch function.
Don't use the one from the image. This is the first half of bug 25950.
Søren Sandmann Pedersen [Sat, 16 Jan 2010 14:48:45 +0000 (09:48 -0500)]
test: Add new alphamap test program.
This program demonstrates three bugs relating to alpha maps:
- When fetching from an alpha map into 32 bit intermediates, we use
the fetcher from the image, and not the one from the alpha map.
- For 64 bit intermediates we call fetch_pixel_generic_lossy_32()
which then calls fetch_pixel_raw_64, which is NULL because alpha
images are never validated.
- The alpha map should be used *in place* of any existing alpha
channel, but we are actually multiplying it onto the image.
Søren Sandmann Pedersen [Sat, 16 Jan 2010 12:40:07 +0000 (07:40 -0500)]
fetch-test: Fix spelling error (pallete -> palette)
Alan Coopersmith [Thu, 14 Jan 2010 17:42:34 +0000 (09:42 -0800)]
Update Sun license notices to current X.Org standard form
Signed-off-by: Alan Coopersmith <alan.coopersmith@sun.com>
Søren Sandmann Pedersen [Sun, 10 Jan 2010 14:15:24 +0000 (09:15 -0500)]
fetch-test: Various formatting fixes
Pierre-Loup A. Griffais [Tue, 5 Jan 2010 23:26:07 +0000 (01:26 +0200)]
Interpret the angle of a conical gradient in degrees.
The conical gradient angle's fixed point degrees to
radians conversion code is missing a factor of pi.
Søren Sandmann Pedersen [Fri, 11 Dec 2009 16:14:19 +0000 (11:14 -0500)]
region: Enable or disable fatal errors and selfchecks based on version number
There is a couple of bugs in bugzilla where bugs in the X server
triggered asserts in the pixman region code. It is probably better to
let the X server survive this. (In fact, I thought I had disabled them
for 0.16.0, but apparently not).
The patch below uses these rules:
- In _stable_ pixman releases, assertions and selfchecks are turned
off. Assertions, so that the X server doesn't die. Selfchecks,
for performance reasons.
- In _unstable_ pixman releases, both assertions and selfcheck are
turned on. These releases are what get added to development
distributions such as rawhide, so we want as much self-checking
as possible.
- In _random git checkouts_, assertions are enabled, so that bugs
are caught, but selfchecks are disabled so that you can use them
for performance work without having to fiddle with turning
selfchecks off.
Søren Sandmann Pedersen [Sat, 12 Sep 2009 07:27:44 +0000 (03:27 -0400)]
Some minor formatting fixes.
Søren Sandmann Pedersen [Wed, 16 Dec 2009 22:54:41 +0000 (17:54 -0500)]
arm-simd: Whitespace fixes
Søren Sandmann Pedersen [Wed, 16 Dec 2009 22:49:44 +0000 (17:49 -0500)]
mmx: Eliminate trailing whitespace.
Søren Sandmann Pedersen [Wed, 16 Dec 2009 20:23:50 +0000 (15:23 -0500)]
Add 'check' to release-check make target
Søren Sandmann Pedersen [Wed, 16 Dec 2009 20:15:17 +0000 (15:15 -0500)]
Reorder tests so that they fastest ones run first.
Marvin Schmidt [Sun, 15 Nov 2009 15:04:09 +0000 (16:04 +0100)]
Build tests and run non-GTK+ ones on `make check`
Setting TESTS will run the tests on `make check`
Bug 25131
Siarhei Siamashka [Wed, 9 Dec 2009 22:25:58 +0000 (00:25 +0200)]
ARM: added 'neon_combine_add_u' function
Siarhei Siamashka [Wed, 9 Dec 2009 22:22:12 +0000 (00:22 +0200)]
ARM: added 'neon_combine_over_u' function
Siarhei Siamashka [Wed, 9 Dec 2009 21:49:04 +0000 (23:49 +0200)]
ARM: macro template for single scanline compositing functions
Existing template already supports 2D images processing,
but pixman also needs some NEON optimized functions for
improving performance when compositing is decoupled
into "fetch -> process -> store" stages and done via
temporary scanline buffer. That's why a new simplified
template which deals only with the generation of single
scanline processing functions is handy.
Siarhei Siamashka [Mon, 14 Dec 2009 17:14:36 +0000 (19:14 +0200)]
Use canonical pixman license notice for recently added ARM NEON assembly files
Siarhei Siamashka [Fri, 6 Nov 2009 00:25:47 +0000 (02:25 +0200)]
ARM: added 'neon_composite_src_pixbuf_8888' fast path
This is ARM NEON optimized conversion of native RGBA format used by
GTK/GDK into native 32bpp RGBA format used by cairo/pixman.
Siarhei Siamashka [Thu, 5 Nov 2009 18:27:38 +0000 (20:27 +0200)]
ARM: added 'neon_composite_src_0888_0565_rev' fast path
This is ARM NEON optimized conversion of native RGB format used by
GTK/GDK into r5g6b5 format.
Siarhei Siamashka [Thu, 5 Nov 2009 17:43:09 +0000 (19:43 +0200)]
ARM: added 'neon_src_0888_8888_rev' fast path
This is ARM NEON optimized conversion of native RGB format used by
GTK/GDK into native 32bpp RGB format used by cairo/pixman.
Siarhei Siamashka [Wed, 9 Dec 2009 09:29:13 +0000 (11:29 +0200)]
ARM: added 'neon_composite_over_n_8888' fast path
Siarhei Siamashka [Wed, 9 Dec 2009 09:02:04 +0000 (11:02 +0200)]
ARM: added 'neon_composite_over_n_0565' fast path