platform/upstream/openblas.git
7 years agoMerge pull request #1043 from quickwritereader/z13
Martin Kroeker [Thu, 5 Jan 2017 18:15:36 +0000 (19:15 +0100)]
Merge pull request #1043 from quickwritereader/z13

Z13

7 years agoUpdate README.md
Abdurrauf [Wed, 4 Jan 2017 15:41:24 +0000 (19:41 +0400)]
Update README.md

7 years agodtrmm and dgemm for z13
Abdurrauf [Wed, 4 Jan 2017 15:32:33 +0000 (19:32 +0400)]
dtrmm and dgemm for z13

8 years agoInit IBM z system (s390x) porting.
Zhang Xianyi [Fri, 15 Apr 2016 22:02:24 +0000 (18:02 -0400)]
Init IBM z system (s390x) porting.

8 years agoBump to 0.2.19.dev.
Zhang Xianyi [Tue, 12 Apr 2016 19:32:10 +0000 (15:32 -0400)]
Bump to 0.2.19.dev.

8 years agoUpdate doc for 0.2.18 version.
Zhang Xianyi [Tue, 12 Apr 2016 19:28:31 +0000 (15:28 -0400)]
Update doc for 0.2.18 version.

8 years agoDelete LOCAL_BUFFER_SIZE for other architectures.
Zhang Xianyi [Tue, 12 Apr 2016 15:49:28 +0000 (11:49 -0400)]
Delete LOCAL_BUFFER_SIZE for other architectures.

8 years agoRefs #834. Fix zgemv config bug on Steamroller.
Zhang Xianyi [Tue, 12 Apr 2016 14:26:11 +0000 (22:26 +0800)]
Refs #834. Fix zgemv config bug on Steamroller.

8 years agoMerge pull request #837 from wernsaar/develop
wernsaar [Fri, 8 Apr 2016 09:13:27 +0000 (11:13 +0200)]
Merge pull request #837 from wernsaar/develop

updated zgemm- and ztrmm-kernel for POWER8

8 years agoupdated benchmark Makefile for ESSL
Werner Saar [Fri, 8 Apr 2016 08:37:59 +0000 (10:37 +0200)]
updated benchmark Makefile for ESSL

8 years agoupdated zgemm- and ztrmm-kernel for POWER8
Werner Saar [Fri, 8 Apr 2016 07:05:37 +0000 (09:05 +0200)]
updated zgemm- and ztrmm-kernel for POWER8

8 years agoUpdated cgemm- and sgemm-kernel for POWER8 SMP
Werner Saar [Thu, 7 Apr 2016 13:08:15 +0000 (15:08 +0200)]
Updated cgemm- and sgemm-kernel for POWER8 SMP

8 years agoRefs xianyi/OpenBLAS-CI#10 , Fix sdot for scipy test_iterative.test_convergence test...
Zhang Xianyi [Wed, 6 Apr 2016 17:44:18 +0000 (01:44 +0800)]
Refs xianyi/OpenBLAS-CI#10 , Fix sdot for scipy test_iterative.test_convergence test failure on AMD bulldozer and piledriver.

8 years agobugfixes for sgemm- and cgemm-kernel
Werner Saar [Wed, 6 Apr 2016 09:15:21 +0000 (11:15 +0200)]
bugfixes for sgemm- and cgemm-kernel

8 years agoMerge pull request #833 from wernsaar/develop
wernsaar [Mon, 4 Apr 2016 10:29:51 +0000 (12:29 +0200)]
Merge pull request #833 from wernsaar/develop

updated optimized cgemm- and ctrmm-kernel for POWER8

8 years agoupdated optimized cgemm- and ctrmm-kernel for POWER8
Werner Saar [Mon, 4 Apr 2016 07:12:08 +0000 (09:12 +0200)]
updated optimized cgemm- and ctrmm-kernel for POWER8

8 years agoMerge pull request #832 from wernsaar/develop
wernsaar [Sun, 3 Apr 2016 13:05:25 +0000 (15:05 +0200)]
Merge pull request #832 from wernsaar/develop

updated cgemm- and ctrmm-kernel for POWER8

8 years agoupdated cgemm- and ctrmm-kernel for POWER8
Werner Saar [Sun, 3 Apr 2016 12:30:49 +0000 (14:30 +0200)]
updated cgemm- and ctrmm-kernel for POWER8

8 years agoadded ESSL to Makefile for benchmarks
Werner Saar [Sun, 3 Apr 2016 05:21:48 +0000 (07:21 +0200)]
added ESSL to Makefile for benchmarks

8 years agoMerge pull request #831 from wernsaar/develop
wernsaar [Sat, 2 Apr 2016 16:05:44 +0000 (18:05 +0200)]
Merge pull request #831 from wernsaar/develop

updated sgemm- and strmm-kernel for POWER8

8 years agoupdated sgemm- and strmm-kernel for POWER8
Werner Saar [Sat, 2 Apr 2016 15:16:36 +0000 (17:16 +0200)]
updated sgemm- and strmm-kernel for POWER8

8 years agoMerge pull request #830 from eschnett/patch-1
Zhang Xianyi [Fri, 1 Apr 2016 21:35:22 +0000 (17:35 -0400)]
Merge pull request #830 from eschnett/patch-1

Correct small typo in comment

8 years agoCorrect small typo in comment
Erik Schnetter [Fri, 1 Apr 2016 17:49:33 +0000 (13:49 -0400)]
Correct small typo in comment

8 years agoMerge pull request #829 from jeromerobert/bug828
Zhang Xianyi [Fri, 1 Apr 2016 01:59:40 +0000 (21:59 -0400)]
Merge pull request #829 from jeromerobert/bug828

Allow to force to do not use -j as make argument

8 years agoAllow to force to do not use -j as make argument
Jerome Robert [Thu, 31 Mar 2016 21:03:52 +0000 (23:03 +0200)]
Allow to force to do not use -j as make argument

Close #828 (hopefully)

8 years agoMerge pull request #827 from wernsaar/develop
wernsaar [Wed, 30 Mar 2016 10:04:49 +0000 (12:04 +0200)]
Merge pull request #827 from wernsaar/develop

added optimized dgemv_n kernel for POWER8

8 years agoadded optimized dgemv_n kernel for POWER8
Werner Saar [Wed, 30 Mar 2016 09:10:53 +0000 (11:10 +0200)]
added optimized dgemv_n kernel for POWER8

8 years agoMerge pull request #826 from wernsaar/develop
wernsaar [Mon, 28 Mar 2016 13:09:52 +0000 (15:09 +0200)]
Merge pull request #826 from wernsaar/develop

added optimized asum kernels for POWER8

8 years agoadded optimized casum kernel for POWER8
Werner Saar [Mon, 28 Mar 2016 12:12:08 +0000 (14:12 +0200)]
added optimized casum kernel for POWER8

8 years agoadded optimized zasum kernel for POWER8
Werner Saar [Mon, 28 Mar 2016 11:37:32 +0000 (13:37 +0200)]
added optimized zasum kernel for POWER8

8 years agoadded optimized sasum kernel for POWER8
Werner Saar [Mon, 28 Mar 2016 10:44:25 +0000 (12:44 +0200)]
added optimized sasum kernel for POWER8

8 years agoadded optimized dasum kernel for POWER8
Werner Saar [Mon, 28 Mar 2016 10:17:15 +0000 (12:17 +0200)]
added optimized dasum kernel for POWER8

8 years agoMerge pull request #825 from wernsaar/develop
wernsaar [Sun, 27 Mar 2016 17:04:06 +0000 (19:04 +0200)]
Merge pull request #825 from wernsaar/develop

added optimized cswap and zswap kernel for POWER8

8 years agoadded otimized cswap and zswap kernels for POWER8
Werner Saar [Sun, 27 Mar 2016 16:31:37 +0000 (18:31 +0200)]
added otimized cswap and zswap kernels for POWER8

8 years agoadded optimized zscal kernel for POWER8
Werner Saar [Sun, 27 Mar 2016 14:31:50 +0000 (16:31 +0200)]
added optimized zscal kernel for POWER8

8 years agoadded optimized sscal kernel for POWER8
Werner Saar [Sun, 27 Mar 2016 09:05:56 +0000 (11:05 +0200)]
added optimized sscal kernel for POWER8

8 years agoMerge pull request #824 from wernsaar/develop
wernsaar [Sun, 27 Mar 2016 08:43:17 +0000 (10:43 +0200)]
Merge pull request #824 from wernsaar/develop

added optimized drot-kernel and srot-kernel for POWER8

8 years agoadded drot- and srot-kernel optimimized for POWER8
Werner Saar [Sun, 27 Mar 2016 06:57:11 +0000 (08:57 +0200)]
added drot- and srot-kernel optimimized for POWER8

8 years agoMerge pull request #819 from ashwinyes/develop_20160324_fixes_optimizations
Zhang Xianyi [Sun, 27 Mar 2016 04:04:20 +0000 (00:04 -0400)]
Merge pull request #819 from ashwinyes/develop_20160324_fixes_optimizations

Cortex-A57: Fixes and Optimizations

8 years agoadded benchmark test for srot and drot
Werner Saar [Sat, 26 Mar 2016 06:14:13 +0000 (07:14 +0100)]
added benchmark test for srot and drot

8 years agoMerge pull request #823 from wernsaar/develop
wernsaar [Fri, 25 Mar 2016 17:08:48 +0000 (18:08 +0100)]
Merge pull request #823 from wernsaar/develop

added optimized copy and swap kernels for POWER8

8 years agoadded optimized sswap kernel for POWER8
Werner Saar [Fri, 25 Mar 2016 16:34:55 +0000 (17:34 +0100)]
added optimized sswap kernel for POWER8

8 years agoadded optimized ccopy kernel for POWER8
Werner Saar [Fri, 25 Mar 2016 15:54:25 +0000 (16:54 +0100)]
added optimized ccopy kernel for POWER8

8 years agoadded optimized scopy kernel for POWER8
Werner Saar [Fri, 25 Mar 2016 15:06:56 +0000 (16:06 +0100)]
added optimized scopy kernel for POWER8

8 years agoadded optimized zswap kernel for POWER8
Werner Saar [Fri, 25 Mar 2016 14:27:34 +0000 (15:27 +0100)]
added optimized zswap kernel for POWER8

8 years agoadded optimized dswap kernel for POWER8
Werner Saar [Fri, 25 Mar 2016 13:35:43 +0000 (14:35 +0100)]
added optimized dswap kernel for POWER8

8 years agoadded optimized dcopy kernel for POWER8
Werner Saar [Fri, 25 Mar 2016 12:03:02 +0000 (13:03 +0100)]
added optimized dcopy kernel for POWER8

8 years agoMerge pull request #822 from wernsaar/develop
wernsaar [Fri, 25 Mar 2016 09:15:51 +0000 (10:15 +0100)]
Merge pull request #822 from wernsaar/develop

added optimized dscal kernel for POWER8

8 years agoadded optimized dscal kernel for POWER8
Werner Saar [Fri, 25 Mar 2016 08:42:08 +0000 (09:42 +0100)]
added optimized dscal kernel for POWER8

8 years agoCortex-A57: Fix clang compilation errors
Ashwin Sekhar T K [Thu, 24 Mar 2016 05:01:28 +0000 (10:31 +0530)]
Cortex-A57: Fix clang compilation errors

8 years agoCortex-A57: Improve DGEMM 8x4 Implementation
Ashwin Sekhar T K [Thu, 17 Mar 2016 04:53:51 +0000 (10:23 +0530)]
Cortex-A57: Improve DGEMM 8x4 Implementation

8 years agoMerge pull request #817 from wernsaar/develop
wernsaar [Wed, 23 Mar 2016 12:37:04 +0000 (13:37 +0100)]
Merge pull request #817 from wernsaar/develop

added optimized zaxpy kernel for POWER8

8 years agoadded optimized zaxpy kernel for POWER8
Werner Saar [Wed, 23 Mar 2016 10:20:23 +0000 (11:20 +0100)]
added optimized zaxpy kernel for POWER8

8 years agoUpdate appveyor version.
Zhang Xianyi [Tue, 22 Mar 2016 15:37:35 +0000 (11:37 -0400)]
Update appveyor version.

8 years agoMerge pull request #813 from theoractice/develop
Zhang Xianyi [Tue, 22 Mar 2016 15:31:37 +0000 (11:31 -0400)]
Merge pull request #813 from theoractice/develop

Fix access violation on Windows while static linking in MSVC

8 years agoMerge pull request #814 from wernsaar/develop
wernsaar [Tue, 22 Mar 2016 14:24:59 +0000 (15:24 +0100)]
Merge pull request #814 from wernsaar/develop

added optimized daxpy kernel for POWER8

8 years agoadded optimized daxpy kernel for POWER8
Werner Saar [Tue, 22 Mar 2016 13:50:03 +0000 (14:50 +0100)]
added optimized daxpy kernel for POWER8

8 years agoUpdate memory.c
Theoractice [Tue, 22 Mar 2016 12:02:37 +0000 (20:02 +0800)]
Update memory.c

8 years agoFix access violation on Windows while static linking
theoractice [Tue, 22 Mar 2016 11:14:54 +0000 (19:14 +0800)]
Fix access violation on Windows while static linking

8 years agoMerge pull request #1 from xianyi/develop
Theoractice [Tue, 22 Mar 2016 10:33:20 +0000 (05:33 -0500)]
Merge pull request #1 from xianyi/develop

upd

8 years agoMerge pull request #812 from wernsaar/develop
wernsaar [Mon, 21 Mar 2016 12:59:44 +0000 (13:59 +0100)]
Merge pull request #812 from wernsaar/develop

added optimized sdot kernel for POWER8

8 years agoadded optimized sdot kernel for POWER8
Werner Saar [Mon, 21 Mar 2016 12:18:23 +0000 (13:18 +0100)]
added optimized sdot kernel for POWER8

8 years agoMerge pull request #811 from wernsaar/develop
wernsaar [Mon, 21 Mar 2016 09:48:41 +0000 (10:48 +0100)]
Merge pull request #811 from wernsaar/develop

added optimized zdot kernel for POWER8

8 years agoadded optimized zdot kernel for POWER8
Werner Saar [Mon, 21 Mar 2016 09:12:07 +0000 (10:12 +0100)]
added optimized zdot kernel for POWER8

8 years agoMerge branch 'release-0.2.17' into develop
Zhang Xianyi [Mon, 21 Mar 2016 00:52:43 +0000 (20:52 -0400)]
Merge branch 'release-0.2.17' into develop

8 years agoFix change log typo. v0.2.17
Zhang Xianyi [Mon, 21 Mar 2016 00:52:15 +0000 (20:52 -0400)]
Fix change log typo.

8 years agoMerge branch 'master' into develop
Zhang Xianyi [Mon, 21 Mar 2016 00:48:21 +0000 (20:48 -0400)]
Merge branch 'master' into develop
Bump to 0.2.18.dev

Conflicts:
CMakeLists.txt
Makefile.rule

8 years agoMerge branch 'release-0.2.17'
Zhang Xianyi [Mon, 21 Mar 2016 00:44:01 +0000 (20:44 -0400)]
Merge branch 'release-0.2.17'

8 years agoUpdate doc for 0.2.17.
Zhang Xianyi [Mon, 21 Mar 2016 00:43:42 +0000 (20:43 -0400)]
Update doc for 0.2.17.

8 years agoMerge branch 'release-0.2.17' into develop
Zhang Xianyi [Sun, 20 Mar 2016 13:24:28 +0000 (09:24 -0400)]
Merge branch 'release-0.2.17' into develop

8 years agoRefs #807. Enable BUILD_LAPACK_DEPRECATED=1 by default.
Zhang Xianyi [Sun, 20 Mar 2016 13:22:56 +0000 (09:22 -0400)]
Refs #807. Enable BUILD_LAPACK_DEPRECATED=1 by default.

8 years agoMerge pull request #808 from theoractice/develop
Zhang Xianyi [Sun, 20 Mar 2016 13:07:47 +0000 (09:07 -0400)]
Merge pull request #808 from theoractice/develop

Fix a minor compiler error in VisualStudio with CMake

8 years agoMerge pull request #809 from wernsaar/develop
wernsaar [Sun, 20 Mar 2016 12:16:41 +0000 (13:16 +0100)]
Merge pull request #809 from wernsaar/develop

Ref #795: added optimized ddot kernel for POWER8

8 years agoFix a minor compiler error in VisualStudio with CMake
theoractice [Sun, 20 Mar 2016 10:58:18 +0000 (18:58 +0800)]
Fix a minor compiler error in VisualStudio with CMake

8 years agoddot for POWER8: updated licence information
Werner Saar [Sun, 20 Mar 2016 10:19:27 +0000 (11:19 +0100)]
ddot for POWER8: updated licence information

8 years agoadded optimized ddot kernel for POWER8
Werner Saar [Sun, 20 Mar 2016 10:06:06 +0000 (11:06 +0100)]
added optimized ddot kernel for POWER8

8 years agoMerge pull request #806 from wernsaar/develop
wernsaar [Fri, 18 Mar 2016 11:46:16 +0000 (12:46 +0100)]
Merge pull request #806 from wernsaar/develop

adding optimized single precision blas level3 kernels for POWER8

8 years agofixed sgemm- and strmm-kernel
Werner Saar [Fri, 18 Mar 2016 11:12:03 +0000 (12:12 +0100)]
fixed sgemm- and strmm-kernel

8 years agoadd optimized cgemm- and ctrmm-kernel for POWER8
Werner Saar [Fri, 18 Mar 2016 07:17:25 +0000 (08:17 +0100)]
add optimized cgemm- and ctrmm-kernel for POWER8

8 years agoBump devlop version to 0.2.17.dev.
Zhang Xianyi [Tue, 15 Mar 2016 18:52:01 +0000 (14:52 -0400)]
Bump devlop version to 0.2.17.dev.

8 years agoMerge branch 'release-0.2.16' v0.2.16
Zhang Xianyi [Tue, 15 Mar 2016 18:49:10 +0000 (14:49 -0400)]
Merge branch 'release-0.2.16'

8 years agoUpdate 0.2.16 doc
Zhang Xianyi [Tue, 15 Mar 2016 18:48:41 +0000 (14:48 -0400)]
Update 0.2.16 doc

8 years agoMerge branch 'develop' into release-0.2.16
Zhang Xianyi [Tue, 15 Mar 2016 17:56:01 +0000 (13:56 -0400)]
Merge branch 'develop' into release-0.2.16

8 years agoMerge pull request #802 from ashwinyes/develop_20160314_dgemm_optimization
Zhang Xianyi [Tue, 15 Mar 2016 00:31:03 +0000 (20:31 -0400)]
Merge pull request #802 from ashwinyes/develop_20160314_dgemm_optimization

DGEMM Optimizations for Cortex-A57

8 years agoMerge pull request #801 from Keno/patch-3
Zhang Xianyi [Mon, 14 Mar 2016 19:42:31 +0000 (15:42 -0400)]
Merge pull request #801 from Keno/patch-3

Don't pass REALNAME to `.end`

8 years agoUpdate CONTRIBUTORS.md
Ashwin Sekhar T K [Mon, 14 Mar 2016 14:29:41 +0000 (19:59 +0530)]
Update CONTRIBUTORS.md

8 years agoOptimize Dgemm 4x4 for Cortex A57
Ashwin Sekhar T K [Mon, 14 Mar 2016 14:05:23 +0000 (19:35 +0530)]
Optimize Dgemm 4x4 for Cortex A57

8 years agoFunctional Assembly Kernels for CortexA57
Ashwin Sekhar T K [Mon, 14 Mar 2016 14:03:21 +0000 (19:33 +0530)]
Functional Assembly Kernels for CortexA57

Adding functional (non-optimized) kernels for Cortex-A57
with the following layouts.
SGEMM - 16x4, 8x8
CGEMM - 8x4
DGEMM - 8x4, 4x8

8 years agoBUGFIX: KERNEL.POWER8
Werner Saar [Mon, 14 Mar 2016 13:36:59 +0000 (14:36 +0100)]
BUGFIX: KERNEL.POWER8

8 years agoadded sgemm- and strmm-kernel for POWER8
Werner Saar [Mon, 14 Mar 2016 12:52:44 +0000 (13:52 +0100)]
added sgemm- and strmm-kernel for POWER8

8 years agoDon't pass REALNAME to `.end`
Keno Fischer [Sun, 13 Mar 2016 22:56:21 +0000 (18:56 -0400)]
Don't pass REALNAME to `.end`

Putting the procedure there is an MSVC-ism, where it is optional. GCC silently ignores and Clang errors, so it is best to remove this.

8 years agoMerge pull request #800 from jeromerobert/smallscaling
Zhang Xianyi [Thu, 10 Mar 2016 20:45:33 +0000 (15:45 -0500)]
Merge pull request #800 from jeromerobert/smallscaling

Fix smallscaling compilation

8 years agoFix smallscaling compilation
Jerome Robert [Thu, 10 Mar 2016 19:24:41 +0000 (20:24 +0100)]
Fix smallscaling compilation

Also revert 0bbca5e

8 years agoFIX: forgot the add the files cgemv_n_4.c and cgemv_t_4.c
Werner Saar [Thu, 10 Mar 2016 10:10:38 +0000 (11:10 +0100)]
FIX: forgot the add the files cgemv_n_4.c and cgemv_t_4.c

8 years agoMerge pull request #799 from wernsaar/develop
wernsaar [Thu, 10 Mar 2016 09:22:08 +0000 (10:22 +0100)]
Merge pull request #799 from wernsaar/develop

Added optimized cgemv_n and cgemv_t kernels for bulldozer, piledriver…

8 years agoAdded optimized cgemv_n and cgemv_t kernels for bulldozer, piledriver and steamroller
Werner Saar [Thu, 10 Mar 2016 08:42:07 +0000 (09:42 +0100)]
Added optimized cgemv_n and cgemv_t kernels for bulldozer, piledriver and steamroller

8 years agoAdd missing openblas_env makefile.
Zhang Xianyi [Wed, 9 Mar 2016 19:52:47 +0000 (14:52 -0500)]
Add missing openblas_env makefile.

8 years agoRefs #716. Only call getenv at init function.
Zhang Xianyi [Wed, 9 Mar 2016 17:50:07 +0000 (12:50 -0500)]
Refs #716. Only call getenv at init function.

8 years agoMerge pull request #798 from wernsaar/develop
wernsaar [Wed, 9 Mar 2016 14:55:56 +0000 (15:55 +0100)]
Merge pull request #798 from wernsaar/develop

Optimized zgemv_n kernel for bulldozer, piledriver and steamroller

8 years agomodified common.h for piledriver
Werner Saar [Wed, 9 Mar 2016 14:48:29 +0000 (15:48 +0100)]
modified common.h for piledriver