platform/upstream/orc.git
23 months agotestsuite: fix memory leak in parse test
Tim-Philipp Müller [Fri, 11 Nov 2022 23:56:06 +0000 (23:56 +0000)]
testsuite: fix memory leak in parse test

Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/84>

2 years agoAdd NetBSD/powerpc support.
Thomas Klausner [Wed, 9 Nov 2022 17:02:39 +0000 (18:02 +0100)]
Add NetBSD/powerpc support.

Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/83>

2 years agoAdd missing include.
Thomas Klausner [Wed, 9 Nov 2022 17:02:25 +0000 (18:02 +0100)]
Add missing include.

Fixes
../orc/orccodemem.c:227:71: error: 'errno' undeclared (first use in this function)

Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/83>

2 years agotestsuite: fix compiler warning about unused variable
Tim-Philipp Müller [Tue, 8 Nov 2022 15:35:53 +0000 (15:35 +0000)]
testsuite: fix compiler warning about unused variable

Happens on gst-macos-12.3 with --werror with clang 13.1.6
"Apple clang version 13.1.6 (clang-1316.0.21.2)":
../testsuite/memcpy_speed.c:25:23: error: variable 'sum' set but not used

2 years agoci: bump macOS/iOS versions
Tim-Philipp Müller [Tue, 8 Nov 2022 15:16:16 +0000 (15:16 +0000)]
ci: bump macOS/iOS versions

2 years agoBack to development
Tim-Philipp Müller [Tue, 1 Nov 2022 13:13:14 +0000 (13:13 +0000)]
Back to development

2 years agoRelease 0.4.33 orc-0.4.33
Tim-Philipp Müller [Sat, 29 Oct 2022 22:34:59 +0000 (23:34 +0100)]
Release 0.4.33

Fixes #42

2 years agoFix build on iOS simulator
Tim-Philipp Müller [Mon, 31 Oct 2022 15:35:42 +0000 (15:35 +0000)]
Fix build on iOS simulator

pthread_jit_write_protect_np() is available but unusable on
iOS simulator:

../orc/orccompiler.c:460:3: error: 'pthread_jit_write_protect_np' is unavailable: not available on iOS

Fixes #43

2 years agomeson: Enable only SSE and MMX backends for Windows
Seungha Yang [Mon, 20 Jun 2022 18:14:39 +0000 (03:14 +0900)]
meson: Enable only SSE and MMX backends for Windows

Other backends would not work or do not make sense

2 years agoci: move deploy stage behind build stage
Tim-Philipp Müller [Mon, 31 Oct 2022 12:23:46 +0000 (12:23 +0000)]
ci: move deploy stage behind build stage

2 years agoorcarm.c: Implement orc_arm_flush_cache on Windows
Chun-wei Fan [Wed, 22 Sep 2021 07:40:38 +0000 (15:40 +0800)]
orcarm.c: Implement orc_arm_flush_cache on Windows

Use the Windows API FlushInstructionCache() to flush the CPU cache on ARM64
Windows.  As a consequence, include windows.h with WIN32_LEAN_AND_MEAN defined.

2 years agoorccpu-arm.c: Assume ARMv8 on Windows ARM64
Chun-wei Fan [Wed, 22 Sep 2021 07:37:46 +0000 (15:37 +0800)]
orccpu-arm.c: Assume ARMv8 on Windows ARM64

ARM64 Windows are supported on ARMv8 CPUs only, so just assume that we have
the NEON and EDSP ARM instructions.

2 years agoorccpu-arm.c: Include headers conditionally
Chun-wei Fan [Wed, 22 Sep 2021 07:33:17 +0000 (15:33 +0800)]
orccpu-arm.c: Include headers conditionally

unistd.h and sys/time.h may not be universally available, so only include them
if they were found at configure time.

2 years agomeson: don't depend on pthreads when compiling for windows
Steve Lhomme [Fri, 19 Aug 2022 09:04:08 +0000 (11:04 +0200)]
meson: don't depend on pthreads when compiling for windows

On UNIX toolchains cross compiling for Windows, winpthread will be detected and
added as a dependency even though it's not used.

2 years agomeson: export ORC_STATIC_COMPILATION in the pkgconfig file
Steve Lhomme [Fri, 19 Aug 2022 09:02:15 +0000 (11:02 +0200)]
meson: export ORC_STATIC_COMPILATION in the pkgconfig file

If the library is compiled statically the define also needs to be set in the
orc-0.4.pc file so that users of the library (in a UNIX toolchain cross
compiling to Windows) will not import the functions as DLL imports.

2 years agoorc: fix ORC_RESTRICT definition for MSVC
Tim-Philipp Müller [Sat, 29 Oct 2022 22:44:27 +0000 (23:44 +0100)]
orc: fix ORC_RESTRICT definition for MSVC

Fixes #40

2 years agomeson: bump req to >= 0.55 and use meson.can_run_host_binaries()
Tim-Philipp Müller [Sat, 29 Oct 2022 22:32:58 +0000 (23:32 +0100)]
meson: bump req to >= 0.55 and use meson.can_run_host_binaries()

.. instead of the deprecated meson.has_exe_wrapper()

2 years agoci: build docs and deploy to gitlab pages
Tim-Philipp Müller [Sat, 29 Oct 2022 15:13:08 +0000 (16:13 +0100)]
ci: build docs and deploy to gitlab pages

https://gitlab.freedesktop.org/gstreamer/orc/-/issues/27

2 years agoneon: Fix discrepancy when using loadupdb in 32-bit
Gaetan Bahl [Mon, 22 Aug 2022 17:32:29 +0000 (19:32 +0200)]
neon: Fix discrepancy when using loadupdb in 32-bit

Fix shifted outputs when output array is 8-byte aligned but not 16-byte aligned and loop shift is 1.

Fixes #32

Signed-off-by: Gaetan Bahl <gaetan.bahl@nxp.com>
2 years agoneon: Fix testsuite not passing on arm CPUs
Gaetan Bahl [Fri, 20 May 2022 13:41:27 +0000 (15:41 +0200)]
neon: Fix testsuite not passing on arm CPUs

Set the FPCR.FZ bit before running tests using ARM NEON,
in order to make tests pass for most opcodes.

Add a way to check for expected failures in the test suite,
since ARM NEON does not comply to IEEE754.

Errors are expected when using divf (resp. sqrtf) on large
(resp. small) numbers.

Fixes #33, #20

Signed-off-by: Gaetan Bahl <gaetan.bahl@nxp.com>
2 years agoneon: Fix sqrtf only processing two inputs out of four
Gaetan Bahl [Fri, 20 May 2022 13:47:53 +0000 (15:47 +0200)]
neon: Fix sqrtf only processing two inputs out of four

This solves an issue where two out of four inputs are not processed
by passing the correct value of vec shift.

Fixes #33, #20.

Signed-off-by: Gaetan Bahl <gaetan.bahl@nxp.com>
2 years agoneon: Fix divf only processing two inputs out of four
Gaetan Bahl [Fri, 20 May 2022 13:41:27 +0000 (15:41 +0200)]
neon: Fix divf only processing two inputs out of four

This solves an issue where two out of four input values are not processed
by passing the correct value of vec shift.

Fixes #33, #20.

Signed-off-by: Gaetan Bahl <gaetan.bahl@nxp.com>
2 years agoneon: Fix out-of-shift errors when compiling certain float/double opcodes using neon
Gaetan Bahl [Tue, 17 May 2022 16:15:51 +0000 (18:15 +0200)]
neon: Fix out-of-shift errors when compiling certain float/double opcodes using neon

Setting the correct shift values solve the "out-of-shift" errors
and allow the following operators to successfully compile:
addf, subf, mulf, maxf, minf, cmpeqf, convfl, convlf, addd, subd, muld, divd

Fixes #33, #20, #2.

Signed-off-by: Gaetan Bahl <gaetan.bahl@nxp.com>
2 years agoorccodemem: Fix use-after-free in error paths
Bastien Nocera [Fri, 6 May 2022 13:56:17 +0000 (15:56 +0200)]
orccodemem: Fix use-after-free in error paths

In file included from gstreamer/subprojects/orc/orc/orc.h:7,
                 from gstreamer/subprojects/orc/orc/orcprogram.h:5,
                 from gstreamer/subprojects/orc/orc/orccodemem.c:30:
gstreamer/subprojects/orc/orc/orccodemem.c: In function ‘orc_code_region_allocate_codemem_dual_map.constprop’:
gstreamer/subprojects/orc/orc/orcdebug.h:138:3: warning: pointer ‘filename’ may be used after ‘free’ [-Wuse-after-free]
  138 |   orc_debug_print((level), __FILE__, ORC_FUNCTION, __LINE__, __VA_ARGS__); \
      |   ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
gstreamer/subprojects/orc/orc/orcdebug.h:92:26: note: in expansion of macro ‘ORC_DEBUG_PRINT’
   92 | #define ORC_WARNING(...) ORC_DEBUG_PRINT(ORC_DEBUG_WARNING, __VA_ARGS__)
      |                          ^~~~~~~~~~~~~~~
gstreamer/subprojects/orc/orc/orccodemem.c:252:5: note: in expansion of macro ‘ORC_WARNING’
  252 |     ORC_WARNING ("failed to create write map '%s'. err=%i", filename, errno);
      |     ^~~~~~~~~~~
gstreamer/subprojects/orc/orc/orccodemem.c:234:3: note: call to ‘free’ here
  234 |   free (filename);
      |   ^~~~~~~~~~~~~~~
gstreamer/subprojects/orc/orc/orcdebug.h:138:3: warning: pointer ‘filename’ may be used after ‘free’ [-Wuse-after-free]
  138 |   orc_debug_print((level), __FILE__, ORC_FUNCTION, __LINE__, __VA_ARGS__); \
      |   ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
gstreamer/subprojects/orc/orc/orcdebug.h:92:26: note: in expansion of macro ‘ORC_DEBUG_PRINT’
   92 | #define ORC_WARNING(...) ORC_DEBUG_PRINT(ORC_DEBUG_WARNING, __VA_ARGS__)
      |                          ^~~~~~~~~~~~~~~
gstreamer/subprojects/orc/orc/orccodemem.c:245:5: note: in expansion of macro ‘ORC_WARNING’
  245 |     ORC_WARNING("failed to create exec map '%s'. err=%i", filename, errno);
      |     ^~~~~~~~~~~
gstreamer/subprojects/orc/orc/orccodemem.c:234:3: note: call to ‘free’ here
  234 |   free (filename);
      |   ^~~~~~~~~~~~~~~

Fixes: bb5fcb31 ("orccodemem: Report errno during failures to create mmap codemap.")

2 years agoci: Add ARM64 and ARM Windows UWP jobs
Nirbheek Chauhan [Fri, 28 Jan 2022 07:07:12 +0000 (12:37 +0530)]
ci: Add ARM64 and ARM Windows UWP jobs

Cross and native files are modified versions of the files used in the
gstreamer CI since we use the same Docker image.

Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/64>

2 years agoci: Update Windows image and macos/ios tags
Nirbheek Chauhan [Thu, 27 Jan 2022 21:42:00 +0000 (03:12 +0530)]
ci: Update Windows image and macos/ios tags

The latest image contains VS 2019, and was built in
https://gitlab.freedesktop.org/gstreamer/gstreamer/-/merge_requests/1570

Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/63>

2 years agoneon: Fix unsigned only implementation of loadoffb, loadoffw and loadoffl
Knobe, Daniel [Tue, 7 Dec 2021 09:22:15 +0000 (10:22 +0100)]
neon: Fix unsigned only implementation of loadoffb, loadoffw and loadoffl

This has direct impact on bayer2rgb performance. Tested on i.MX8mm aarch64 -> Speedboost of ~17%.
Reason:
The line loadoffw t, s, -1 results in orc silent compile error
Pipeline:
gst-launch-1.0 -v videotestsrc ! video/x-bayer,width=1920,height=1080 ! bayer2rgb ! fpsdisplaysink video-sink=fakesink sync=0
Average performance with fix: 25.21fps
Average performance without fix: 21.60fps

Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/62>

3 years agomeson: Disable ARM64 support for Windows
Seungha Yang [Tue, 28 Sep 2021 14:04:29 +0000 (23:04 +0900)]
meson: Disable ARM64 support for Windows

Disable Windows ARM64 support/build for now, because it doesn't work.

Fixes: https://gitlab.freedesktop.org/gstreamer/orc/-/issues/36
Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/61>

3 years agoexecutor: Fix orc_executor_set_program() to save OrcCode
Doug Nazar [Tue, 17 Aug 2021 23:49:08 +0000 (19:49 -0400)]
executor: Fix orc_executor_set_program() to save OrcCode

orc_executor_set_program() was missed when it was added to
orc_executor_new().

3 years agomeson: Set recursive for extract_all_objects()
Doug Nazar [Wed, 18 Aug 2021 00:15:05 +0000 (20:15 -0400)]
meson: Set recursive for extract_all_objects()

Fixes the following warning:

WARNING: extract_all_objects called without setting recursive
keyword argument. Meson currently defaults to
non-recursive to maintain backward compatibility but
the default will be changed in the future.
orc-test\meson.build:16:0: ERROR: Fatal warnings enabled, aborting

3 years agoorc: Add support for MacOS Hardened runtime.
Doug Nazar [Fri, 9 Apr 2021 08:59:53 +0000 (04:59 -0400)]
orc: Add support for MacOS Hardened runtime.

3 years agoorccodemem: Report errno during failures to create mmap codemap.
Doug Nazar [Fri, 9 Apr 2021 08:57:05 +0000 (04:57 -0400)]
orccodemem: Report errno during failures to create mmap codemap.

3 years agoorc/orccpu-powerpc.c: fix build with kernel < 4.11
Fabrice Fontaine [Mon, 15 Feb 2021 19:17:43 +0000 (20:17 +0100)]
orc/orccpu-powerpc.c: fix build with kernel < 4.11

Build with powerpc and kernel < 4.11 is broken since version 0.4.30 and
https://gitlab.freedesktop.org/gstreamer/orc/-/commit/a999325abea6a5549d60d99ddeb0271d2aa00235:

FAILED: orc/liborc-0.4.so.0.32.0.p/orccpu-powerpc.c.o
/home/giuliobenetti/autobuild/run/instance-3/output-1/host/bin/powerpc-linux-gcc -Iorc/liborc-0.4.so.0.32.0.p -Iorc -I../orc -I. -I.. -fdiagnostics-color=always -pipe -Wall -Winvalid-pch -std=gnu99 -O3 -DHAVE_CONFIG_H -fvisibility=hidden -D_LARGEFILE_SOURCE -D_LARGEFILE64_SOURCE -D_FILE_OFFSET_BITS=64 -O2 -fPIC -pthread -DORC_ENABLE_UNSTABLE_API -D_GNU_SOURCE -DBUILDING_ORC -MD -MQ orc/liborc-0.4.so.0.32.0.p/orccpu-powerpc.c.o -MF orc/liborc-0.4.so.0.32.0.p/orccpu-powerpc.c.o.d -o orc/liborc-0.4.so.0.32.0.p/orccpu-powerpc.c.o -c ../orc/orccpu-powerpc.c
../orc/orccpu-powerpc.c: In function 'orc_check_powerpc_proc_auxv':
../orc/orccpu-powerpc.c:164:21: error: 'AT_L1D_CACHESIZE' undeclared (first use in this function); did you mean 'AT_DCACHEBSIZE'?
  164 |       if (buf[i] == AT_L1D_CACHESIZE) {
      |                     ^~~~~~~~~~~~~~~~
      |                     AT_DCACHEBSIZE
../orc/orccpu-powerpc.c:164:21: note: each undeclared identifier is reported only once for each function it appears in
../orc/orccpu-powerpc.c:168:21: error: 'AT_L2_CACHESIZE' undeclared (first use in this function); did you mean 'AT_ICACHEBSIZE'?
  168 |       if (buf[i] == AT_L2_CACHESIZE) {
      |                     ^~~~~~~~~~~~~~~
      |                     AT_ICACHEBSIZE
../orc/orccpu-powerpc.c:172:21: error: 'AT_L3_CACHESIZE' undeclared (first use in this function); did you mean 'AT_ICACHEBSIZE'?
  172 |       if (buf[i] == AT_L3_CACHESIZE) {
      |                     ^~~~~~~~~~~~~~~
      |                     AT_ICACHEBSIZE

Indeed, AT_{L1D,L2,L3}_CACHESIZE is only defined since kernel 4.11 and
https://github.com/torvalds/linux/commit/98a5f361b8625c6f4841d6ba013bbf0e80d08147

Fixes:
 - http://autobuild.buildroot.org/results/0821e96cba3e455edd47b87485501d892fc7ac6a

Signed-off-by: Fabrice Fontaine <fontaine.fabrice@gmail.com>
Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/56>

4 years agomeson: don't set gnu99 as default c_std
Tim-Philipp Müller [Sat, 7 Nov 2020 13:24:18 +0000 (13:24 +0000)]
meson: don't set gnu99 as default c_std

Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/55>

4 years agoaarch32: Implement loadupdb instruction
Marek Vasut [Mon, 31 Aug 2020 00:26:33 +0000 (02:26 +0200)]
aarch32: Implement loadupdb instruction

Fill in aarch32 opcodes for loadupdb instruction, which is used
by various color space conversion programs. There is likely still
some space for optimization.

4 years agoaarch64: Implement loadupdb instruction
Marek Vasut [Fri, 26 Jun 2020 14:31:02 +0000 (16:31 +0200)]
aarch64: Implement loadupdb instruction

Fill in aarch64 opcodes for loadupdb instruction, which is used
by various color space conversion programs. This is thus far only
available on aarch64, but arm32 port should be easy.

4 years agoaarch64: Implement sqrtf instruction
Marek Vasut [Sun, 30 Aug 2020 16:26:15 +0000 (18:26 +0200)]
aarch64: Implement sqrtf instruction

Fill in aarch64 opcodes for sqrtf instruction.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement divf instruction
Marek Vasut [Sun, 30 Aug 2020 16:18:38 +0000 (18:18 +0200)]
aarch64: Implement divf instruction

Fill in aarch64 opcodes for divf instruction.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement double-precision floating-point arithmetic instructions
Marek Vasut [Sun, 30 Aug 2020 16:18:31 +0000 (18:18 +0200)]
aarch64: Implement double-precision floating-point arithmetic instructions

Fill in aarch64 opcodes for double-precision floating point
arithmetic instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement flags2d
Marek Vasut [Sat, 29 Aug 2020 23:23:05 +0000 (01:23 +0200)]
aarch64: Implement flags2d

Implement support for .flags 2d by adding code for handling the loop
counter. The implementation is very similar to aarch32.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement const64 loadiq
Marek Vasut [Sat, 29 Aug 2020 22:33:07 +0000 (00:33 +0200)]
aarch64: Implement const64 loadiq

Add support for loading 64bit constants on aarch64 by emitting a
sequence of pc-relative load, branch past the literal pool and a
constant as a literal pool entry.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement accumulator store
Marek Vasut [Sat, 29 Aug 2020 15:27:20 +0000 (17:27 +0200)]
aarch64: Implement accumulator store

Add code to store the accumulator register.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement floating-point arithmetic instructions
Marek Vasut [Sat, 27 Jun 2020 21:26:30 +0000 (23:26 +0200)]
aarch64: Implement floating-point arithmetic instructions

Fill in aarch64 opcodes for floating point arithmetic instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement andn instruction
Marek Vasut [Sat, 27 Jun 2020 21:36:22 +0000 (23:36 +0200)]
aarch64: Implement andn instruction

Fill in aarch64 opcode for andn instruction.
This is bic instruction with reversed operands.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement splatw3q instruction
Marek Vasut [Sat, 27 Jun 2020 20:55:17 +0000 (22:55 +0200)]
aarch64: Implement splatw3q instruction

This uses TRN2 instruction twice on the same data, first to expand
top half-word of SRC register into TMP register and then to expand
top world of TMP register to DST register. The following scheme is
implemented: src[ABCDEFGH] -> tmp[ABABEFEF] -> dst[ABABABAB] .

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement swapX instructions
Marek Vasut [Sat, 27 Jun 2020 17:35:45 +0000 (19:35 +0200)]
aarch64: Implement swapX instructions

Fill in aarch64 opcodes for swapX instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement div255w instruction
Marek Vasut [Fri, 26 Jun 2020 16:27:40 +0000 (18:27 +0200)]
aarch64: Implement div255w instruction

Fill in aarch64 opcodes for div255w instruction.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement mulX instructions
Marek Vasut [Fri, 26 Jun 2020 16:46:52 +0000 (18:46 +0200)]
aarch64: Implement mulX instructions

Fill in aarch64 opcodes for mulX instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement cmpX instructions
Marek Vasut [Fri, 26 Jun 2020 16:44:15 +0000 (18:44 +0200)]
aarch64: Implement cmpX instructions

Fill in aarch64 opcodes for cmpX instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement avgX instructions
Marek Vasut [Fri, 26 Jun 2020 16:39:29 +0000 (18:39 +0200)]
aarch64: Implement avgX instructions

Fill in aarch64 opcodes for avgX instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement splitX/splatX instructions
Marek Vasut [Fri, 26 Jun 2020 11:38:52 +0000 (13:38 +0200)]
aarch64: Implement splitX/splatX instructions

Fill in aarch64 opcodes for splitX/splatX instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement signX instructions
Marek Vasut [Fri, 26 Jun 2020 10:35:57 +0000 (12:35 +0200)]
aarch64: Implement signX instructions

Fill in aarch64 opcodes for signX instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement vminX/vmaxX instructions
Marek Vasut [Fri, 26 Jun 2020 10:27:12 +0000 (12:27 +0200)]
aarch64: Implement vminX/vmaxX instructions

Fill in aarch64 opcodes for vminX/vmaxX instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement accX instructions
Marek Vasut [Fri, 26 Jun 2020 03:08:14 +0000 (05:08 +0200)]
aarch64: Implement accX instructions

Fill in aarch64 opcodes for accX instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement loadiX instructions
Marek Vasut [Fri, 26 Jun 2020 00:22:41 +0000 (02:22 +0200)]
aarch64: Implement loadiX instructions

Fill in aarch64 opcodes for loadiX instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement subX instructions
Marek Vasut [Fri, 26 Jun 2020 01:43:41 +0000 (03:43 +0200)]
aarch64: Implement subX instructions

Fill in aarch64 opcodes for subX instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement andX instructions
Marek Vasut [Fri, 26 Jun 2020 01:37:02 +0000 (03:37 +0200)]
aarch64: Implement andX instructions

Fill in aarch64 opcodes for andX instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement absX instructions
Marek Vasut [Fri, 26 Jun 2020 01:35:22 +0000 (03:35 +0200)]
aarch64: Implement absX instructions

Fill in aarch64 opcodes for absX instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement xorX instructions
Marek Vasut [Fri, 26 Jun 2020 01:32:34 +0000 (03:32 +0200)]
aarch64: Implement xorX instructions

Fill in aarch64 opcodes for xorX instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement copyX/orX instructions
Marek Vasut [Thu, 25 Jun 2020 22:30:12 +0000 (00:30 +0200)]
aarch64: Implement copyX/orX instructions

Fill in aarch64 opcodes for copyX/orX instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement mergeX instructions
Marek Vasut [Thu, 25 Jun 2020 19:43:03 +0000 (21:43 +0200)]
aarch64: Implement mergeX instructions

Fill in aarch64 opcodes for mergeX instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Clean up mergeX/splatX instructions
Marek Vasut [Thu, 25 Jun 2020 19:22:35 +0000 (21:22 +0200)]
aarch64: Clean up mergeX/splatX instructions

Factor out common code from the mergeX/splatX instructions so it
is easier to add the aarch64 counterpart.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement loadX instructions
Marek Vasut [Wed, 24 Jun 2020 19:43:56 +0000 (21:43 +0200)]
aarch64: Implement loadX instructions

Fill in aarch64 opcodes for loadX instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement shift instructions
Marek Vasut [Thu, 25 Jun 2020 18:28:30 +0000 (20:28 +0200)]
aarch64: Implement shift instructions

Fill in aarch64 opcodes for shift instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement mov instructions
Marek Vasut [Thu, 25 Jun 2020 17:09:42 +0000 (19:09 +0200)]
aarch64: Implement mov instructions

Fill in aarch64 opcodes for mov instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement mulhX instructions
Marek Vasut [Thu, 25 Jun 2020 16:20:39 +0000 (18:20 +0200)]
aarch64: Implement mulhX instructions

Fill in aarch64 opcodes for mulhX instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement select{0,1}X instructions
Marek Vasut [Thu, 25 Jun 2020 18:58:09 +0000 (20:58 +0200)]
aarch64: Implement select{0,1}X instructions

Fill in aarch64 opcodes for select{0,1}X instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement convX instructions
Marek Vasut [Thu, 25 Jun 2020 19:09:46 +0000 (21:09 +0200)]
aarch64: Implement convX instructions

Fill in aarch64 opcodes for convX instructions.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Implement unary instruction emit
Marek Vasut [Thu, 25 Jun 2020 18:57:42 +0000 (20:57 +0200)]
aarch64: Implement unary instruction emit

Fill in missing code to emit unary instructions for aarch64 neon.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Fix MSVC warnings
Marek Vasut [Sun, 30 Aug 2020 22:52:09 +0000 (00:52 +0200)]
aarch64: Fix MSVC warnings

This fixes various warnings reported by the MSVC in CI.
No functional change.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Repair storeX instructions
Marek Vasut [Sun, 28 Jun 2020 00:09:11 +0000 (02:09 +0200)]
aarch64: Repair storeX instructions

The storeX instructions are a mirror image of loadX instructions, however
the current implementation is a copy and the storeX arguments are swapped.
This does lead to incorrect store and either to segfault or invalid data.
Fix this by correcting the storeX instruction arguments.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Repair emit for imm 1
Marek Vasut [Sat, 27 Jun 2020 20:52:02 +0000 (22:52 +0200)]
aarch64: Repair emit for imm 1

The logical immediate encoding of 1 is 0, so encode_logical_imm()
would return 0 in that case. However, 0 is treated as a failure.
Adjust encode_logical_imm() such that it return a negative return
value in case of failure, 0 in case of success and the logical
encoding of immediate value is returned via parameter.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Repair 8bit load/store opcode
Marek Vasut [Fri, 26 Jun 2020 19:15:41 +0000 (21:15 +0200)]
aarch64: Repair 8bit load/store opcode

The opcode for 8bit load/store is 0 instead of 1, fix this.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: Use 64bit operations on 64bit pointers
Marek Vasut [Wed, 24 Jun 2020 19:43:56 +0000 (21:43 +0200)]
aarch64: Use 64bit operations on 64bit pointers

This fixes loading values from arrays, where the array pointers
are 64bit, but the operation incrementing the pointer was 32bit
and thus the top 32bits of the pointer were set to zero. Fix it
by using 64bit operations on the pointer.

Signed-off-by: Marek Vasut <marex@denx.de>
4 years agoaarch64: orcprogram-neon porting to aarch64
Dongju Chae [Mon, 21 Oct 2019 11:21:47 +0000 (20:21 +0900)]
aarch64: orcprogram-neon porting to aarch64

This PR ports orcprogram-neon.c for AArch64 support.
It makes the orc compiler generate aarch64 assembly codes.

Currently, example1 succesfully runs on aarch64 (tested on an Android device).

Signed-off-by: Dongju Chae <dongju.chae@samsung.com>
4 years agoaarch64: implement emits for some vector instructions and ORC ops (add)
Dongju Chae [Mon, 21 Oct 2019 11:18:35 +0000 (20:18 +0900)]
aarch64: implement emits for some vector instructions and ORC ops (add)

This commit implements vector load/store/add instructions and ORC 'add' ops.
They reuse existing macros like 'BINARY', and aarch64-specific arguments are added.

Signed-off-by: Dongju Chae <dongju.chae@samsung.com>
4 years agoaarch64: implement emits for general instructions
Dongju Chae [Mon, 21 Oct 2019 10:48:49 +0000 (19:48 +0900)]
aarch64: implement emits for general instructions

This commit implement emits for general instructions.
Also, it defines a bunch of macros that can be used in orcprogram-neon.c

Signed-off-by: Dongju Chae <dongju.chae@samsung.com>
4 years agoaarch64: make some setups for aarch64 support
Dongju Chae [Mon, 21 Oct 2019 10:31:14 +0000 (19:31 +0900)]
aarch64: make some setups for aarch64 support

This commit adds the setup codes for aarch64 support including meson build and aarch64 selection.

Signed-off-by: Dongju Chae <dongju.chae@samsung.com>
4 years agomeson: Get rid of outdated version_compare checks
Nirbheek Chauhan [Wed, 1 Jul 2020 01:51:30 +0000 (07:21 +0530)]
meson: Get rid of outdated version_compare checks

4 years agoci: Also pass --fatal-meson-warnings
Nirbheek Chauhan [Wed, 1 Jul 2020 00:38:19 +0000 (06:08 +0530)]
ci: Also pass --fatal-meson-warnings

To ensure that warnings don't sneak in.

4 years agomeson: Bump version to 0.54 to fix warnings
Nirbheek Chauhan [Wed, 1 Jul 2020 00:45:16 +0000 (06:15 +0530)]
meson: Bump version to 0.54 to fix warnings

4 years agoRelease 0.4.32
Tim-Philipp Müller [Fri, 11 Sep 2020 09:30:15 +0000 (10:30 +0100)]
Release 0.4.32

Fixes #31

4 years agoci: Add an Android arm64 job run on Fedora
Nirbheek Chauhan [Mon, 31 Aug 2020 08:20:28 +0000 (13:50 +0530)]
ci: Add an Android arm64 job run on Fedora

Uses the same image as the android examples job on the cerbero CI.

Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/51>

4 years agoci: Add iOS CI with a dynamic cross file
Nirbheek Chauhan [Mon, 31 Aug 2020 07:38:05 +0000 (13:08 +0530)]
ci: Add iOS CI with a dynamic cross file

Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/51>

4 years agoci: Use the latest meson syntax
Nirbheek Chauhan [Mon, 31 Aug 2020 06:57:01 +0000 (12:27 +0530)]
ci: Use the latest meson syntax

Also rename macos job for consistency. Install ninja using pip on
Windows to workaround https://github.com/mesonbuild/meson/pull/7669

Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/51>

4 years agoci: Use werror on macOS too
Nirbheek Chauhan [Mon, 31 Aug 2020 07:28:24 +0000 (12:58 +0530)]
ci: Use werror on macOS too

Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/51>

4 years agomeson: Use pkgconfig.generate
Jan Alexander Steffens (heftig) [Tue, 28 Jul 2020 14:19:23 +0000 (16:19 +0200)]
meson: Use pkgconfig.generate

Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/49>

4 years agoci: mark the msys job as manual and allowed to fail
Jordan Petridis [Wed, 29 Jul 2020 11:59:00 +0000 (11:59 +0000)]
ci: mark the msys job as manual and allowed to fail

the msys setup is not deterministic due to issues with msys installation,
so it keeps breaking every now and then.

Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/48>

4 years agoorc: Use page size aligned memory block for Windows
Seungha Yang [Tue, 28 Jul 2020 09:00:09 +0000 (18:00 +0900)]
orc: Use page size aligned memory block for Windows

VirtualProtect will change protection attribute of all memory range
which belong to the same page range.

Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/48>

4 years agoorc: Remove trailing white spaces
Seungha Yang [Tue, 28 Jul 2020 09:00:38 +0000 (18:00 +0900)]
orc: Remove trailing white spaces

Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/48>

4 years agoci: add basic macOS CI
Tim-Philipp Müller [Mon, 20 Jul 2020 23:19:13 +0000 (00:19 +0100)]
ci: add basic macOS CI

Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/46>

4 years agomeson: use / instead of join_paths() for readability
Tim-Philipp Müller [Tue, 21 Jul 2020 09:20:03 +0000 (10:20 +0100)]
meson: use / instead of join_paths() for readability

4 years agoci: run benchmark tests as well
Tim-Philipp Müller [Tue, 21 Jul 2020 00:19:01 +0000 (01:19 +0100)]
ci: run benchmark tests as well

4 years agomeson: hook up benchmark as benchmark
Tim-Philipp Müller [Tue, 21 Jul 2020 00:16:47 +0000 (01:16 +0100)]
meson: hook up benchmark as benchmark

4 years agobenchmorc: fix file path to test orc file on windows
Tim-Philipp Müller [Mon, 20 Jul 2020 23:48:48 +0000 (00:48 +0100)]
benchmorc: fix file path to test orc file on windows

Fix compiler warning on msys2:
enchmorc.c:35:31: error: unknown escape sequence: '\o'

The absolute path returned by current_source_dir()
returns backward slashes, which don't work well when
used as a C string constant. join_paths() will make
it all forward-slashes.

4 years agoorcc: use #pragma once in generated headers
Tim-Philipp Müller [Sat, 4 Jul 2020 10:49:09 +0000 (11:49 +0100)]
orcc: use #pragma once in generated headers

Simplifies code and removes build path fragments from
generated file (paths may be different if generated
in a gst-build setup).

There shouldn't be any portability issues with this. Meson
has been using this in its generated config.h for years.

Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/45>

4 years agoorc: Don't set memory protection if there's no code
Nirbheek Chauhan [Thu, 2 Jul 2020 09:32:57 +0000 (15:02 +0530)]
orc: Don't set memory protection if there's no code

If the size of the JIT code is 0, there's no code and the *mem is
uninitialized. This can happen when orcc.exe is used to generate
backup C code.

Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/44>

4 years agoorc: Print hex value directly instead of a constant
Nirbheek Chauhan [Thu, 2 Jul 2020 09:32:12 +0000 (15:02 +0530)]
orc: Print hex value directly instead of a constant

This is actually more useful because the constants are all bitfields
and it's fairly straightforward to look it up:

https://docs.microsoft.com/en-us/windows/win32/memory/memory-protection-constants

Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/44>

4 years agoorc: Fix indentation
Nirbheek Chauhan [Thu, 2 Jul 2020 09:31:40 +0000 (15:01 +0530)]
orc: Fix indentation

Only whitespace changes.

Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/44>

4 years agoci: Attempt to fix broken Windows CI
Nirbheek Chauhan [Tue, 30 Jun 2020 11:14:59 +0000 (16:44 +0530)]
ci: Attempt to fix broken Windows CI

https://gitlab.freedesktop.org/nirbheek/orc/-/pipelines/169274

Update to latest image and use the gstreamer runner tag. Matches the
gst-ci template.

Also try to make MSYS2 CI more resilient by following:
https://github.com/msys2/setup-msys2/blob/master/main.js#L98

Part-of: <https://gitlab.freedesktop.org/gstreamer/orc/-/merge_requests/42>