platform/upstream/openblas.git
7 years agoTHUNDERX2T99: Add Optimized CNRM2 Implementation
Ashwin Sekhar T K [Thu, 19 Jan 2017 10:27:13 +0000 (15:57 +0530)]
THUNDERX2T99: Add Optimized CNRM2 Implementation

7 years agoTHUNDERX2T99: Add Optimized SNRM2 Implementation
Ashwin Sekhar T K [Thu, 19 Jan 2017 08:57:02 +0000 (00:57 -0800)]
THUNDERX2T99: Add Optimized SNRM2 Implementation

7 years agoUpdate .gitignore
Ashwin Sekhar T K [Wed, 18 Jan 2017 08:39:04 +0000 (00:39 -0800)]
Update .gitignore

7 years agoTHUNDERX2T99: Add threaded DDOT Implementation
Ashwin Sekhar T K [Thu, 19 Jan 2017 05:26:17 +0000 (10:56 +0530)]
THUNDERX2T99: Add threaded DDOT Implementation

7 years agoTHUNDERX2T99: Add Optimized DDOT Implementation
Ashwin Sekhar T K [Thu, 19 Jan 2017 05:23:48 +0000 (10:53 +0530)]
THUNDERX2T99: Add Optimized DDOT Implementation

7 years agoTHUNDERX2T99: Improve SGEMM
Ashwin Sekhar T K [Wed, 18 Jan 2017 08:57:11 +0000 (00:57 -0800)]
THUNDERX2T99: Improve SGEMM

7 years agoTHUNDERX2T99: Improve DGEMM
Ashwin Sekhar T K [Tue, 17 Jan 2017 07:16:23 +0000 (23:16 -0800)]
THUNDERX2T99: Improve DGEMM

7 years agoTHUNDERX2T99: Add Optimized DAXPY Implementation
Ashwin Sekhar T K [Tue, 17 Jan 2017 08:28:54 +0000 (00:28 -0800)]
THUNDERX2T99: Add Optimized DAXPY Implementation

7 years agoTHUNDERX2T99: Add Optimized SGEMM Implementation
Ashwin Sekhar T K [Wed, 11 Jan 2017 09:37:11 +0000 (15:07 +0530)]
THUNDERX2T99: Add Optimized SGEMM Implementation

7 years agoARM64: Let target VULCAN inherit THUNDERX2T99 properties
Ashwin Sekhar T K [Wed, 11 Jan 2017 07:47:10 +0000 (13:17 +0530)]
ARM64: Let target VULCAN inherit THUNDERX2T99 properties

7 years agoMerge pull request #1067 from martin-frbg/msysinst
Martin Kroeker [Mon, 16 Jan 2017 15:03:53 +0000 (16:03 +0100)]
Merge pull request #1067 from martin-frbg/msysinst

Fix DESTDIR support for cygwin/msys2 install

7 years agoFix DESTDIR support for cygwin/msys2 install
Martin Kroeker [Mon, 16 Jan 2017 14:15:46 +0000 (15:15 +0100)]
Fix DESTDIR support for cygwin/msys2 install

fixes #1066

7 years agoMerge pull request #1061 from ashwinyes/develop_aarch64_vulcan_thunderx_patch
Zhang Xianyi [Mon, 16 Jan 2017 05:20:10 +0000 (13:20 +0800)]
Merge pull request #1061 from ashwinyes/develop_aarch64_vulcan_thunderx_patch

Add new targets for ARM64

7 years agoUpdate Makefile.install (#1064)
Martin Kroeker [Wed, 11 Jan 2017 16:40:06 +0000 (17:40 +0100)]
Update Makefile.install (#1064)

* Update Makefile.install to reflect name change of lapacke_mangling.h source

7 years agoMerge pull request #1063 from wernsaar/develop
Werner Saar [Wed, 11 Jan 2017 11:37:45 +0000 (12:37 +0100)]
Merge pull request #1063 from wernsaar/develop

prepared kernel/setparam-ref.c for UNROLL values, that are not a power of two

7 years agoprepared kernel/setparam-ref.c for UNROLL values, that are not a power of two
Werner Saar [Wed, 11 Jan 2017 10:56:50 +0000 (11:56 +0100)]
prepared kernel/setparam-ref.c for UNROLL values, that are not a power of two

7 years agoMerge pull request #1062 from wernsaar/develop
Werner Saar [Wed, 11 Jan 2017 09:30:46 +0000 (10:30 +0100)]
Merge pull request #1062 from wernsaar/develop

prepared parameter.c for UNROLL values, that are not a power of two

7 years agoprepared parameter.c for UNROLL values, that are not a power of two
Werner Saar [Wed, 11 Jan 2017 08:50:28 +0000 (09:50 +0100)]
prepared parameter.c for UNROLL values, that are not a power of two

7 years agoprepared lapack/lauum for UNROLL values, that are not a power of two
Werner Saar [Wed, 11 Jan 2017 06:29:17 +0000 (07:29 +0100)]
prepared lapack/lauum for UNROLL values, that are not a power of two

7 years agoARM64: Add Cavium THUNDERX2T99 Target
Ashwin Sekhar T K [Tue, 10 Jan 2017 08:55:55 +0000 (14:25 +0530)]
ARM64: Add Cavium THUNDERX2T99 Target

7 years agoARM64: Fix auto detect of ARM64 cpus
Ashwin Sekhar T K [Tue, 10 Jan 2017 07:23:47 +0000 (12:53 +0530)]
ARM64: Fix auto detect of ARM64 cpus

7 years agoTHUNDERX: Add optimized version of daxpy
Andrew Pinski [Fri, 17 Jul 2015 04:08:03 +0000 (00:08 -0400)]
THUNDERX: Add optimized version of daxpy

This is better for single core but does not change anything for multiple cores

7 years agoMerge pull request #1060 from martin-frbg/lapacke-mingw
Martin Kroeker [Tue, 10 Jan 2017 18:09:49 +0000 (19:09 +0100)]
Merge pull request #1060 from martin-frbg/lapacke-mingw

Split LAPACKE 3.7.0 obj list (take 2, missed splitting the actual ar command invocation)

7 years agoSplit LAPACKE 3.7.0 obj list (take 2)
Martin Kroeker [Tue, 10 Jan 2017 16:11:35 +0000 (17:11 +0100)]
Split LAPACKE 3.7.0 obj list (take 2)

Missed the splitting of the actual ar call

7 years agoMerge pull request #1059 from wernsaar/develop
Werner Saar [Tue, 10 Jan 2017 15:00:28 +0000 (16:00 +0100)]
Merge pull request #1059 from wernsaar/develop

updated some level1 funcions, that are not thread save

7 years agoupdated some level1 funcions, that are not thread save
Werner Saar [Tue, 10 Jan 2017 13:05:07 +0000 (14:05 +0100)]
updated some level1 funcions, that are not thread save

7 years agoMerge pull request #1058 from wernsaar/develop
Werner Saar [Tue, 10 Jan 2017 10:30:08 +0000 (11:30 +0100)]
Merge pull request #1058 from wernsaar/develop

prepared lapack/potrf functions for UNROLL values, that are not a pow…

7 years agoprepared lapack/potrf functions for UNROLL values, that are not a power of two
Werner Saar [Tue, 10 Jan 2017 09:50:28 +0000 (10:50 +0100)]
prepared lapack/potrf functions for UNROLL values, that are not a power of two

7 years agoTHUNDERX: Add an optimized version of ddot
Andrew Pinski [Thu, 16 Jul 2015 07:30:16 +0000 (03:30 -0400)]
THUNDERX: Add an optimized version of ddot

7 years agoARM64: Add Cavium THUNDERX Target
Andrew Pinski [Tue, 10 Jan 2017 06:27:36 +0000 (11:57 +0530)]
ARM64: Add Cavium THUNDERX Target

7 years agoVULCAN: Add optimized DGEMM implementation
Ashwin Sekhar T K [Mon, 9 Jan 2017 13:18:39 +0000 (18:48 +0530)]
VULCAN: Add optimized DGEMM implementation

7 years agoARM64: Add the VULCAN Target
Ashwin Sekhar T K [Tue, 4 Oct 2016 08:50:20 +0000 (01:50 -0700)]
ARM64: Add the VULCAN Target

7 years agoCORTEXA57: Add assembly kernels for copy routines
Ashwin Sekhar T K [Tue, 4 Oct 2016 08:24:28 +0000 (01:24 -0700)]
CORTEXA57: Add assembly kernels for copy routines

7 years agoMerge pull request #1055 from ksraste/develop
Zhang Xianyi [Tue, 10 Jan 2017 05:58:26 +0000 (13:58 +0800)]
Merge pull request #1055 from ksraste/develop

Add msa optimization for AXPY, COPY, SCALE, SWAP

7 years agoAdding multi-threading for copy, dot, rot, and asum funcitons
jiahaipeng [Sun, 11 Dec 2016 09:09:50 +0000 (09:09 +0000)]
Adding multi-threading for copy, dot, rot, and asum funcitons

7 years agomodify the blas_l1_thread.c for support multi-threded for L1 fuction with return...
jiahaipeng [Sun, 11 Dec 2016 09:02:18 +0000 (09:02 +0000)]
modify the blas_l1_thread.c for support multi-threded for L1 fuction with return value

7 years agoMerge pull request #1057 from martin-frbg/lapacke-mingw
Martin Kroeker [Mon, 9 Jan 2017 19:45:26 +0000 (20:45 +0100)]
Merge pull request #1057 from martin-frbg/lapacke-mingw

Split the obj list of LAPACKE 3.7.0

7 years agoSplit the obj list of LAPACKE 3.7.0
Martin Kroeker [Mon, 9 Jan 2017 17:29:53 +0000 (18:29 +0100)]
Split the obj list of LAPACKE 3.7.0

Split obj list to allow building with mingw (argument list too long for the msys ar)

7 years agoAdd msa optimization for AXPY, COPY, SCALE, SWAP
kaustubh [Mon, 9 Jan 2017 12:57:23 +0000 (18:27 +0530)]
Add msa optimization for AXPY, COPY, SCALE, SWAP

Signed-off-by: kaustubh <kaustubh.raste@imgtec.com>
7 years agoAdd msa optimization for AXPY, COPY, SCALE, SWAP
kaustubh [Mon, 9 Jan 2017 12:52:09 +0000 (18:22 +0530)]
Add msa optimization for AXPY, COPY, SCALE, SWAP

Signed-off-by: kaustubh <kaustubh.raste@imgtec.com>
7 years agoMerge pull request #1054 from wernsaar/develop
Werner Saar [Mon, 9 Jan 2017 12:38:56 +0000 (13:38 +0100)]
Merge pull request #1054 from wernsaar/develop

prepared lapack/getrf functions for UNROLL values, that are not a pow…

7 years agoprepared lapack/getrf functions for UNROLL values, that are not a power of two
Werner Saar [Mon, 9 Jan 2017 11:57:26 +0000 (12:57 +0100)]
prepared lapack/getrf functions for UNROLL values, that are not a power of two

7 years agoMerge branch 'z13' into develop
Zhang Xianyi [Mon, 9 Jan 2017 10:52:42 +0000 (05:52 -0500)]
Merge branch 'z13' into develop

Conflicts:
CONTRIBUTORS.md

7 years agoAdd USE_TRMM=1 for IBM z13 in kernel/Makefile.L3
Zhang Xianyi [Mon, 9 Jan 2017 10:48:09 +0000 (05:48 -0500)]
Add USE_TRMM=1 for IBM z13 in kernel/Makefile.L3

7 years agoMerge pull request #1053 from wernsaar/develop
Werner Saar [Mon, 9 Jan 2017 10:17:38 +0000 (11:17 +0100)]
Merge pull request #1053 from wernsaar/develop

prepared driver/level3 functions for UNROLL values, that are not a po…

7 years agoprepared driver/level3 functions for UNROLL values, that are not a power of two
Werner Saar [Mon, 9 Jan 2017 09:38:15 +0000 (10:38 +0100)]
prepared driver/level3 functions for UNROLL values, that are not a power of two

7 years agoMerge pull request #1050 from martin-frbg/fflags
Zhang Xianyi [Mon, 9 Jan 2017 08:23:22 +0000 (16:23 +0800)]
Merge pull request #1050 from martin-frbg/fflags

Apply COMMON_OPT to default FFLAGS

7 years agoMerge pull request #1052 from martin-frbg/locking
Zhang Xianyi [Mon, 9 Jan 2017 08:22:58 +0000 (16:22 +0800)]
Merge pull request #1052 from martin-frbg/locking

Fix thread data races detected by helgrind 3.12

7 years agoRelocate declaration of alloc_lock outside ifdef block
Martin Kroeker [Mon, 9 Jan 2017 00:10:43 +0000 (01:10 +0100)]
Relocate declaration of alloc_lock outside ifdef block

7 years agoFix thread data races detected by helgrind 3.12
Martin Kroeker [Sun, 8 Jan 2017 22:33:51 +0000 (23:33 +0100)]
Fix thread data races detected by helgrind 3.12

Ref. #995, may possibly help solve issues seen in 660,883

7 years agoApply COMMON_OPT to default FFLAGS to avoid building non-optimized LAPACK by mistake
Martin Kroeker [Sun, 8 Jan 2017 20:17:22 +0000 (21:17 +0100)]
Apply COMMON_OPT to default FFLAGS to avoid building non-optimized LAPACK by mistake

7 years agoMerge pull request #1049 from wernsaar/develop
Werner Saar [Sun, 8 Jan 2017 08:30:19 +0000 (09:30 +0100)]
Merge pull request #1049 from wernsaar/develop

removed blas_thread_shutdown from gensymbol

7 years agoremoved blas_thread_shutdown from gensymbol
Werner Saar [Sun, 8 Jan 2017 07:51:30 +0000 (08:51 +0100)]
removed blas_thread_shutdown from gensymbol

7 years agoMerge pull request #1047 from brada4/erre
Zhang Xianyi [Sun, 8 Jan 2017 03:19:06 +0000 (11:19 +0800)]
Merge pull request #1047 from brada4/erre

Improve R benchmark timing

7 years agoMerge pull request #1040 from martin-frbg/develop
Zhang Xianyi [Sun, 8 Jan 2017 03:18:38 +0000 (11:18 +0800)]
Merge pull request #1040 from martin-frbg/develop

Use appropriate int32/int64 format for error number in message string

7 years agoMerge pull request #1036 from sva-img/develop
Zhang Xianyi [Sun, 8 Jan 2017 03:18:05 +0000 (11:18 +0800)]
Merge pull request #1036 from sva-img/develop

Added prefetch to CGEMV and ZGEMV.

7 years agoanti GC and reflow
Andrew [Sat, 7 Jan 2017 18:01:42 +0000 (19:01 +0100)]
anti GC and reflow

7 years agoinit
Andrew [Sat, 7 Jan 2017 18:01:21 +0000 (19:01 +0100)]
init

7 years agoMerge pull request #1046 from wernsaar/develop
Werner Saar [Sat, 7 Jan 2017 14:09:56 +0000 (15:09 +0100)]
Merge pull request #1046 from wernsaar/develop

updated lapack to version 3.7.0 with latest patches from git

7 years agofix for appveyor test
Werner Saar [Sat, 7 Jan 2017 13:27:08 +0000 (14:27 +0100)]
fix for appveyor test

7 years agoupdated exports/gensymbol for lapack-3.7.0
Werner Saar [Sat, 7 Jan 2017 12:20:28 +0000 (13:20 +0100)]
updated exports/gensymbol for lapack-3.7.0

7 years agofiltered out -fopenmp and fix for mingw
Werner Saar [Sat, 7 Jan 2017 07:41:42 +0000 (08:41 +0100)]
filtered out -fopenmp and fix for mingw

7 years agoremoved xerbla and lsame for Makefile
Werner Saar [Fri, 6 Jan 2017 15:35:20 +0000 (16:35 +0100)]
removed xerbla and lsame for Makefile

7 years agoremoved obj-files, that are moved to lapack 3.7.0
Werner Saar [Fri, 6 Jan 2017 15:14:53 +0000 (16:14 +0100)]
removed obj-files, that are moved to lapack 3.7.0

7 years agofiltered out optimized functions
Werner Saar [Fri, 6 Jan 2017 12:42:31 +0000 (13:42 +0100)]
filtered out optimized functions

7 years agoadded lapack 3.7.0 with latest patches from git
Werner Saar [Fri, 6 Jan 2017 10:48:40 +0000 (11:48 +0100)]
added lapack 3.7.0 with latest patches from git

7 years agoremoved lapack-devel.log
Werner Saar [Fri, 6 Jan 2017 10:46:58 +0000 (11:46 +0100)]
removed lapack-devel.log

7 years agoremoved lapack 3.6.0
Werner Saar [Fri, 6 Jan 2017 10:44:57 +0000 (11:44 +0100)]
removed lapack 3.6.0

7 years agoMerge pull request #1043 from quickwritereader/z13
Martin Kroeker [Thu, 5 Jan 2017 18:15:36 +0000 (19:15 +0100)]
Merge pull request #1043 from quickwritereader/z13

Z13

7 years agoUpdate xerbla.c
Martin Kroeker [Wed, 4 Jan 2017 22:16:48 +0000 (23:16 +0100)]
Update xerbla.c

7 years agoUpdate README.md
Abdurrauf [Wed, 4 Jan 2017 15:41:24 +0000 (19:41 +0400)]
Update README.md

7 years agodtrmm and dgemm for z13
Abdurrauf [Wed, 4 Jan 2017 15:32:33 +0000 (19:32 +0400)]
dtrmm and dgemm for z13

7 years agoUse appropriate int32/int64 format for error number in message string
Martin Kroeker [Thu, 29 Dec 2016 23:45:59 +0000 (00:45 +0100)]
Use appropriate int32/int64 format for error number in message string

7 years agoAdded prefetch to CGEMV and ZGEMV.
Shivraj Patil [Tue, 27 Dec 2016 06:03:51 +0000 (11:33 +0530)]
Added prefetch to CGEMV and ZGEMV.

Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com>
7 years agoMerge pull request #1032 from kiwifb/OSX_target
Zhang Xianyi [Sun, 18 Dec 2016 06:48:22 +0000 (14:48 +0800)]
Merge pull request #1032 from kiwifb/OSX_target

Do not override MACOSX_DEPLOYMENT_TARGET if it is already defined.

7 years agoMerge pull request #1025 from mfoster96/develop
Zhang Xianyi [Sun, 18 Dec 2016 06:47:59 +0000 (14:47 +0800)]
Merge pull request #1025 from mfoster96/develop

Fix for issue #1024: arm-linux-androideabi-g++ Compiler Error in /cpu…

7 years agoMerge pull request #1031 from kiwifb/make
Zhang Xianyi [Sun, 18 Dec 2016 06:46:52 +0000 (14:46 +0800)]
Merge pull request #1031 from kiwifb/make

Never use "make" in makefiles. Only $(MAKE).

7 years agoMerge pull request #1030 from ksraste/develop
Zhang Xianyi [Sun, 18 Dec 2016 06:46:16 +0000 (14:46 +0800)]
Merge pull request #1030 from ksraste/develop

Updated data prefetch in TRSM, ASUM, DOT functions

7 years agoDo not override MACOSX_DEPLOYMENT_TARGET if it is already defined.
François Bissey [Wed, 14 Dec 2016 22:42:17 +0000 (11:42 +1300)]
Do not override MACOSX_DEPLOYMENT_TARGET if it is already defined.

7 years agoNever use "make" in makefiles. Only $(MAKE).
François Bissey [Wed, 14 Dec 2016 22:38:23 +0000 (11:38 +1300)]
Never use "make" in makefiles. Only $(MAKE).

7 years agoUpdated data prefetch in TRSM, ASUM, DOT functions
kaustubh [Wed, 14 Dec 2016 08:35:11 +0000 (14:05 +0530)]
Updated data prefetch in TRSM, ASUM, DOT functions

Signed-off-by: kaustubh <kaustubh.raste@imgtec.com>
7 years agoUpdated data prefetch in TRSM, ASUM, DOT functions
kaustubh [Tue, 13 Dec 2016 08:32:14 +0000 (14:02 +0530)]
Updated data prefetch in TRSM, ASUM, DOT functions

Signed-off-by: kaustubh <kaustubh.raste@imgtec.com>
7 years agoUpdated data prefetch in TRSM, ASUM, DOT functions
kaustubh [Tue, 13 Dec 2016 06:11:17 +0000 (11:41 +0530)]
Updated data prefetch in TRSM, ASUM, DOT functions

Signed-off-by: kaustubh <kaustubh.raste@imgtec.com>
7 years agoMerge pull request #1017 from martin-frbg/develop
Zhang Xianyi [Fri, 9 Dec 2016 08:55:13 +0000 (16:55 +0800)]
Merge pull request #1017 from martin-frbg/develop

Make c_check, f_check convert any --exclude-libs arguments to linker flags

7 years agoFix for issue #1024: arm-linux-androideabi-g++ Compiler Error in /cpuid_arm.c
Michael Foster [Fri, 2 Dec 2016 17:28:31 +0000 (09:28 -0800)]
Fix for issue #1024: arm-linux-androideabi-g++ Compiler Error in /cpuid_arm.c
Line 77: Compiler requires non-void function to return a value

7 years agoMerge pull request #1015 from ararslan/aa/freebsd
Zhang Xianyi [Fri, 2 Dec 2016 02:28:57 +0000 (10:28 +0800)]
Merge pull request #1015 from ararslan/aa/freebsd

Include system headers for blas_server on FreeBSD

7 years agoMerge pull request #996 from grisuthedragon/lapack-3.6.1
Zhang Xianyi [Fri, 2 Dec 2016 02:27:51 +0000 (10:27 +0800)]
Merge pull request #996 from grisuthedragon/lapack-3.6.1

Lapack 3.6.1

7 years agoConvert --exclude-libs argument to linker flag
Martin Kroeker [Tue, 22 Nov 2016 08:17:03 +0000 (09:17 +0100)]
Convert --exclude-libs argument to linker flag

Fixes build with TDM-GCC

7 years agoConvert --exclude-libs argument to linker flag
Martin Kroeker [Tue, 22 Nov 2016 08:14:55 +0000 (09:14 +0100)]
Convert --exclude-libs argument to linker flag

Fixes build with TDM-GCC

7 years agoMerge pull request #1016 from ksraste/develop
Zhang Xianyi [Tue, 22 Nov 2016 07:54:56 +0000 (15:54 +0800)]
Merge pull request #1016 from ksraste/develop

Add data prefetch in DOT and ASUM functions

7 years agoAdd data prefetch in DOT and ASUM functions
kaustubh [Tue, 22 Nov 2016 05:51:03 +0000 (11:21 +0530)]
Add data prefetch in DOT and ASUM functions

Signed-off-by: kaustubh <kaustubh.raste@imgtec.com>
7 years agoInclude system headers on FreeBSD
Alex Arslan [Thu, 17 Nov 2016 05:58:20 +0000 (21:58 -0800)]
Include system headers on FreeBSD

7 years agoMerge pull request #1002 from brada4/limpio
Zhang Xianyi [Mon, 7 Nov 2016 02:41:20 +0000 (10:41 +0800)]
Merge pull request #1002 from brada4/limpio

Remove few lines of dead code.

7 years agoMerge pull request #1010 from martin-frbg/cpuid
Zhang Xianyi [Mon, 7 Nov 2016 02:26:13 +0000 (10:26 +0800)]
Merge pull request #1010 from martin-frbg/cpuid

Add TARGETs for newer Intel CPUs - Kaby Lake, Knights Landing, Apollo Lake

7 years agoMerge pull request #1009 from martin-frbg/getarch-newline-fix
Zhang Xianyi [Mon, 7 Nov 2016 02:25:51 +0000 (10:25 +0800)]
Merge pull request #1009 from martin-frbg/getarch-newline-fix

Getarch newline fix

7 years agoAdd files via upload
Martin Kroeker [Sun, 6 Nov 2016 22:27:30 +0000 (23:27 +0100)]
Add files via upload

7 years agoAdd files via upload
Martin Kroeker [Sun, 6 Nov 2016 22:26:39 +0000 (23:26 +0100)]
Add files via upload

7 years agoAdd files via upload
Martin Kroeker [Sun, 6 Nov 2016 22:26:04 +0000 (23:26 +0100)]
Add files via upload

7 years agoAdd files via upload
Martin Kroeker [Sun, 6 Nov 2016 16:38:20 +0000 (17:38 +0100)]
Add files via upload

7 years agoDelete CMakeLists.txt
Martin Kroeker [Sun, 6 Nov 2016 16:37:37 +0000 (17:37 +0100)]
Delete CMakeLists.txt