platform/upstream/openblas.git
8 years agoMerge pull request #877 from jeromerobert/bug873
Zhang Xianyi [Mon, 16 May 2016 15:21:56 +0000 (23:21 +0800)]
Merge pull request #877 from jeromerobert/bug873

Disable multi-threading in swap

8 years agoDisable multi-threading in swap
Jerome Robert [Mon, 16 May 2016 13:07:55 +0000 (13:07 +0000)]
Disable multi-threading in swap

* Close #873

8 years agoMerge pull request #876 from wernsaar/develop
Werner Saar [Mon, 16 May 2016 12:52:40 +0000 (14:52 +0200)]
Merge pull request #876 from wernsaar/develop

optimized dgemm on power8 for 20 threads

8 years agooptimized dgemm for 20 threads
Werner Saar [Mon, 16 May 2016 12:14:25 +0000 (14:14 +0200)]
optimized dgemm for 20 threads

8 years agoMerge pull request #869 from ksraste/develop
Zhang Xianyi [Mon, 9 May 2016 14:54:55 +0000 (10:54 -0400)]
Merge pull request #869 from ksraste/develop

DTRSM optimization for MIPS P5600 and I6400 using MSA

8 years agoMerge pull request #868 from sva-img/develop
Zhang Xianyi [Mon, 9 May 2016 14:54:30 +0000 (10:54 -0400)]
Merge pull request #868 from sva-img/develop

build fix for MIPS 32 bit

8 years agoDTRSM optimization for MIPS P5600 and I6400 using MSA
Kaustubh Raste [Mon, 9 May 2016 09:45:26 +0000 (15:15 +0530)]
DTRSM optimization for MIPS P5600 and I6400 using MSA

Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com>
8 years agobuild fix for MIPS 32 bit
Shivraj Patil [Mon, 9 May 2016 09:15:12 +0000 (14:45 +0530)]
build fix for MIPS 32 bit

Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com>
8 years agoMerge pull request #866 from sva-img/develop
Zhang Xianyi [Fri, 6 May 2016 14:53:22 +0000 (10:53 -0400)]
Merge pull request #866 from sva-img/develop

DGEMM optimization for MIPS P5600 and I6400 using MSA

8 years agoconflict resolved by syncing with 'xianyi:develop'
Shivraj Patil [Wed, 4 May 2016 05:37:14 +0000 (11:07 +0530)]
conflict resolved by syncing with 'xianyi:develop'

Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com>
8 years agoMerge pull request #867 from IvanUkhov/space
Zhang Xianyi [Tue, 3 May 2016 21:06:31 +0000 (17:06 -0400)]
Merge pull request #867 from IvanUkhov/space

Wrap CURDIR and DESTDIR in quotes

8 years agoWrap CURDIR and DESTDIR in quotes
Ivan Ukhov [Tue, 3 May 2016 19:31:32 +0000 (21:31 +0200)]
Wrap CURDIR and DESTDIR in quotes

8 years agoDGEMM optimization for MIPS P5600 and I6400 using MSA
Shivraj Patil [Tue, 3 May 2016 09:12:26 +0000 (14:42 +0530)]
DGEMM optimization for MIPS P5600 and I6400 using MSA

Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com>
8 years agoMerge pull request #863 from ashwinyes/develop_20160429_update_numa_binding
Zhang Xianyi [Fri, 29 Apr 2016 15:46:24 +0000 (11:46 -0400)]
Merge pull request #863 from ashwinyes/develop_20160429_update_numa_binding

Update NUMA CPU binding

8 years agoMerge pull request #847 from sva-img/develop
Zhang Xianyi [Fri, 29 Apr 2016 15:44:36 +0000 (11:44 -0400)]
Merge pull request #847 from sva-img/develop

MIPS P5600(32 bit) and I6400(64 bit) cores support added.

8 years agoMerge pull request #864 from wernsaar/develop
Werner Saar [Fri, 29 Apr 2016 11:33:45 +0000 (13:33 +0200)]
Merge pull request #864 from wernsaar/develop

optimized dgemm for POWER8

8 years agooptimized dgemm for POWER8
Werner Saar [Fri, 29 Apr 2016 10:52:47 +0000 (12:52 +0200)]
optimized dgemm for POWER8

8 years agoUpdate NUMA CPU binding
Ashwin Sekhar T K [Fri, 29 Apr 2016 06:28:15 +0000 (11:58 +0530)]
Update NUMA CPU binding

When the number of process can all be
accommodated within the current node,
then use cores from the current node only.

8 years agoMerge pull request #858 from buffer51/develop
Zhang Xianyi [Thu, 28 Apr 2016 15:42:18 +0000 (11:42 -0400)]
Merge pull request #858 from buffer51/develop

Fixed cross-suffix detection for path that contains dashes

8 years agoUse CROSS_SUFFIX only if CROSS is set
buffer51 [Thu, 28 Apr 2016 05:23:02 +0000 (22:23 -0700)]
Use CROSS_SUFFIX only if CROSS is set

8 years agoFixed cross-suffix detection for path that contains dashes when the compiler itself...
buffer51 [Wed, 27 Apr 2016 19:09:44 +0000 (12:09 -0700)]
Fixed cross-suffix detection for path that contains dashes when the compiler itself doesn't

8 years agoMerge pull request #856 from wernsaar/develop
Werner Saar [Wed, 27 Apr 2016 14:34:15 +0000 (16:34 +0200)]
Merge pull request #856 from wernsaar/develop

optimized dgemm for POWER8

8 years agooptimized param.h for POWER8
Werner Saar [Wed, 27 Apr 2016 13:48:09 +0000 (15:48 +0200)]
optimized param.h for POWER8

8 years agooptimized dgemm for POWER8
Werner Saar [Wed, 27 Apr 2016 12:01:08 +0000 (14:01 +0200)]
optimized dgemm for POWER8

8 years agoMerge pull request #852 from buffer51/develop
Zhang Xianyi [Tue, 26 Apr 2016 14:24:33 +0000 (10:24 -0400)]
Merge pull request #852 from buffer51/develop

Added Android as a community-supported OS

8 years agoMerge pull request #851 from rndfax/develop
Zhang Xianyi [Tue, 26 Apr 2016 14:24:13 +0000 (10:24 -0400)]
Merge pull request #851 from rndfax/develop

allow building tests when CROSS compiling but don't run them

8 years agoAdded Android as a community-supported OS
buffer51 [Tue, 26 Apr 2016 10:14:03 +0000 (03:14 -0700)]
Added Android as a community-supported OS

8 years agoallow building tests when CROSS compiling but don't run them
Aleksey Kuleshov [Fri, 22 Apr 2016 15:21:18 +0000 (18:21 +0300)]
allow building tests when CROSS compiling but don't run them

8 years agoMerge pull request #850 from wernsaar/develop
Werner Saar [Mon, 25 Apr 2016 10:00:43 +0000 (12:00 +0200)]
Merge pull request #850 from wernsaar/develop

Bugfixes and enhancements for EXCAVATOR

8 years agoupdated param.h for EXCAVATOR
Werner Saar [Mon, 25 Apr 2016 08:40:04 +0000 (10:40 +0200)]
updated param.h for EXCAVATOR

8 years agoupdated some kernel files for EXCAVATOR
Werner Saar [Mon, 25 Apr 2016 08:36:23 +0000 (10:36 +0200)]
updated some kernel files for EXCAVATOR

8 years agobugfix for EXCAVATOR and DYNAMIC_ARCH
Werner Saar [Mon, 25 Apr 2016 08:13:30 +0000 (10:13 +0200)]
bugfix for EXCAVATOR and DYNAMIC_ARCH

8 years agobugfix in dynamic.c
Werner Saar [Mon, 25 Apr 2016 07:08:38 +0000 (09:08 +0200)]
bugfix in dynamic.c

8 years agoMerge pull request #849 from wernsaar/develop
Werner Saar [Sat, 23 Apr 2016 14:25:27 +0000 (16:25 +0200)]
Merge pull request #849 from wernsaar/develop

optimized gemm for POWER8

8 years agoupdated param.h for POWER8
Werner Saar [Sat, 23 Apr 2016 12:26:24 +0000 (14:26 +0200)]
updated param.h for POWER8

8 years agoadded sgemm_tcopy_8_power8.S
Werner Saar [Sat, 23 Apr 2016 08:04:41 +0000 (10:04 +0200)]
added sgemm_tcopy_8_power8.S

8 years agoadded cgemm_tcopy_8_power8.S
Werner Saar [Sat, 23 Apr 2016 05:37:18 +0000 (07:37 +0200)]
added cgemm_tcopy_8_power8.S

8 years agoMerge pull request #848 from wernsaar/develop
Werner Saar [Fri, 22 Apr 2016 11:46:22 +0000 (13:46 +0200)]
Merge pull request #848 from wernsaar/develop

Optimized zgemm for POWER8 and tested zgemm again

8 years agoOptimized zgemm and tested zgemm again
Werner Saar [Fri, 22 Apr 2016 11:07:12 +0000 (13:07 +0200)]
Optimized zgemm and tested zgemm again

8 years agoMIPS P5600(32 bit) and I6400(64 bit) cores support added.
Shivraj Patil [Fri, 22 Apr 2016 08:33:18 +0000 (14:03 +0530)]
MIPS P5600(32 bit) and I6400(64 bit) cores support added.

Seperated mips and mips64 files.
Configurations support for mips 32 bit.

Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com>
8 years agoMerge pull request #846 from wernsaar/develop
Werner Saar [Thu, 21 Apr 2016 11:52:24 +0000 (13:52 +0200)]
Merge pull request #846 from wernsaar/develop

Optimized sgemm and dgemm for POWER8

8 years agoadded bugfixes for some make files and smallscaling.c
Werner Saar [Thu, 21 Apr 2016 10:54:32 +0000 (12:54 +0200)]
added bugfixes for some make files and smallscaling.c

8 years agoOptimized sgemm and dgemm and tested again.
Werner Saar [Thu, 21 Apr 2016 09:37:57 +0000 (11:37 +0200)]
Optimized sgemm and dgemm and tested again.

8 years agooptimized Makefile.power for POWER8
Werner Saar [Wed, 20 Apr 2016 13:28:28 +0000 (15:28 +0200)]
optimized Makefile.power for POWER8

8 years agoMerge pull request #845 from wernsaar/develop
wernsaar [Wed, 20 Apr 2016 11:44:22 +0000 (13:44 +0200)]
Merge pull request #845 from wernsaar/develop

optimized sgemm for power8

8 years agooptimized sgemm
Werner Saar [Wed, 20 Apr 2016 11:06:38 +0000 (13:06 +0200)]
optimized sgemm

8 years agoadded optimized sgemm_tcopy for power8
Werner Saar [Tue, 19 Apr 2016 14:08:54 +0000 (16:08 +0200)]
added optimized sgemm_tcopy for power8

8 years agoBump to 0.2.19.dev.
Zhang Xianyi [Tue, 12 Apr 2016 19:32:10 +0000 (15:32 -0400)]
Bump to 0.2.19.dev.

8 years agoUpdate doc for 0.2.18 version.
Zhang Xianyi [Tue, 12 Apr 2016 19:28:31 +0000 (15:28 -0400)]
Update doc for 0.2.18 version.

8 years agoDelete LOCAL_BUFFER_SIZE for other architectures.
Zhang Xianyi [Tue, 12 Apr 2016 15:49:28 +0000 (11:49 -0400)]
Delete LOCAL_BUFFER_SIZE for other architectures.

8 years agoRefs #834. Fix zgemv config bug on Steamroller.
Zhang Xianyi [Tue, 12 Apr 2016 14:26:11 +0000 (22:26 +0800)]
Refs #834. Fix zgemv config bug on Steamroller.

8 years agobugfix for arm scal.c and zscal.c
Werner Saar [Mon, 11 Apr 2016 09:21:36 +0000 (11:21 +0200)]
bugfix for arm scal.c and zscal.c

8 years agoadded cholesky benchmarks to Makefile for ESSL
Werner Saar [Sun, 10 Apr 2016 09:28:20 +0000 (11:28 +0200)]
added cholesky benchmarks to Makefile for ESSL

8 years agoMerge pull request #837 from wernsaar/develop
wernsaar [Fri, 8 Apr 2016 09:13:27 +0000 (11:13 +0200)]
Merge pull request #837 from wernsaar/develop

updated zgemm- and ztrmm-kernel for POWER8

8 years agoupdated benchmark Makefile for ESSL
Werner Saar [Fri, 8 Apr 2016 08:37:59 +0000 (10:37 +0200)]
updated benchmark Makefile for ESSL

8 years agoupdated zgemm- and ztrmm-kernel for POWER8
Werner Saar [Fri, 8 Apr 2016 07:05:37 +0000 (09:05 +0200)]
updated zgemm- and ztrmm-kernel for POWER8

8 years agoUpdated cgemm- and sgemm-kernel for POWER8 SMP
Werner Saar [Thu, 7 Apr 2016 13:08:15 +0000 (15:08 +0200)]
Updated cgemm- and sgemm-kernel for POWER8 SMP

8 years agoRefs xianyi/OpenBLAS-CI#10 , Fix sdot for scipy test_iterative.test_convergence test...
Zhang Xianyi [Wed, 6 Apr 2016 17:44:18 +0000 (01:44 +0800)]
Refs xianyi/OpenBLAS-CI#10 , Fix sdot for scipy test_iterative.test_convergence test failure on AMD bulldozer and piledriver.

8 years agobugfixes for sgemm- and cgemm-kernel
Werner Saar [Wed, 6 Apr 2016 09:15:21 +0000 (11:15 +0200)]
bugfixes for sgemm- and cgemm-kernel

8 years agoMerge pull request #833 from wernsaar/develop
wernsaar [Mon, 4 Apr 2016 10:29:51 +0000 (12:29 +0200)]
Merge pull request #833 from wernsaar/develop

updated optimized cgemm- and ctrmm-kernel for POWER8

8 years agoupdated optimized cgemm- and ctrmm-kernel for POWER8
Werner Saar [Mon, 4 Apr 2016 07:12:08 +0000 (09:12 +0200)]
updated optimized cgemm- and ctrmm-kernel for POWER8

8 years agoMerge pull request #832 from wernsaar/develop
wernsaar [Sun, 3 Apr 2016 13:05:25 +0000 (15:05 +0200)]
Merge pull request #832 from wernsaar/develop

updated cgemm- and ctrmm-kernel for POWER8

8 years agoupdated cgemm- and ctrmm-kernel for POWER8
Werner Saar [Sun, 3 Apr 2016 12:30:49 +0000 (14:30 +0200)]
updated cgemm- and ctrmm-kernel for POWER8

8 years agoadded ESSL to Makefile for benchmarks
Werner Saar [Sun, 3 Apr 2016 05:21:48 +0000 (07:21 +0200)]
added ESSL to Makefile for benchmarks

8 years agoMerge pull request #831 from wernsaar/develop
wernsaar [Sat, 2 Apr 2016 16:05:44 +0000 (18:05 +0200)]
Merge pull request #831 from wernsaar/develop

updated sgemm- and strmm-kernel for POWER8

8 years agoupdated sgemm- and strmm-kernel for POWER8
Werner Saar [Sat, 2 Apr 2016 15:16:36 +0000 (17:16 +0200)]
updated sgemm- and strmm-kernel for POWER8

8 years agoMerge pull request #830 from eschnett/patch-1
Zhang Xianyi [Fri, 1 Apr 2016 21:35:22 +0000 (17:35 -0400)]
Merge pull request #830 from eschnett/patch-1

Correct small typo in comment

8 years agoCorrect small typo in comment
Erik Schnetter [Fri, 1 Apr 2016 17:49:33 +0000 (13:49 -0400)]
Correct small typo in comment

8 years agoMerge pull request #829 from jeromerobert/bug828
Zhang Xianyi [Fri, 1 Apr 2016 01:59:40 +0000 (21:59 -0400)]
Merge pull request #829 from jeromerobert/bug828

Allow to force to do not use -j as make argument

8 years agoAllow to force to do not use -j as make argument
Jerome Robert [Thu, 31 Mar 2016 21:03:52 +0000 (23:03 +0200)]
Allow to force to do not use -j as make argument

Close #828 (hopefully)

8 years agoMerge pull request #827 from wernsaar/develop
wernsaar [Wed, 30 Mar 2016 10:04:49 +0000 (12:04 +0200)]
Merge pull request #827 from wernsaar/develop

added optimized dgemv_n kernel for POWER8

8 years agoadded optimized dgemv_n kernel for POWER8
Werner Saar [Wed, 30 Mar 2016 09:10:53 +0000 (11:10 +0200)]
added optimized dgemv_n kernel for POWER8

8 years agoMerge pull request #826 from wernsaar/develop
wernsaar [Mon, 28 Mar 2016 13:09:52 +0000 (15:09 +0200)]
Merge pull request #826 from wernsaar/develop

added optimized asum kernels for POWER8

8 years agoadded optimized casum kernel for POWER8
Werner Saar [Mon, 28 Mar 2016 12:12:08 +0000 (14:12 +0200)]
added optimized casum kernel for POWER8

8 years agoadded optimized zasum kernel for POWER8
Werner Saar [Mon, 28 Mar 2016 11:37:32 +0000 (13:37 +0200)]
added optimized zasum kernel for POWER8

8 years agoadded optimized sasum kernel for POWER8
Werner Saar [Mon, 28 Mar 2016 10:44:25 +0000 (12:44 +0200)]
added optimized sasum kernel for POWER8

8 years agoadded optimized dasum kernel for POWER8
Werner Saar [Mon, 28 Mar 2016 10:17:15 +0000 (12:17 +0200)]
added optimized dasum kernel for POWER8

8 years agoMerge pull request #825 from wernsaar/develop
wernsaar [Sun, 27 Mar 2016 17:04:06 +0000 (19:04 +0200)]
Merge pull request #825 from wernsaar/develop

added optimized cswap and zswap kernel for POWER8

8 years agoadded otimized cswap and zswap kernels for POWER8
Werner Saar [Sun, 27 Mar 2016 16:31:37 +0000 (18:31 +0200)]
added otimized cswap and zswap kernels for POWER8

8 years agoadded optimized zscal kernel for POWER8
Werner Saar [Sun, 27 Mar 2016 14:31:50 +0000 (16:31 +0200)]
added optimized zscal kernel for POWER8

8 years agoadded optimized sscal kernel for POWER8
Werner Saar [Sun, 27 Mar 2016 09:05:56 +0000 (11:05 +0200)]
added optimized sscal kernel for POWER8

8 years agoMerge pull request #824 from wernsaar/develop
wernsaar [Sun, 27 Mar 2016 08:43:17 +0000 (10:43 +0200)]
Merge pull request #824 from wernsaar/develop

added optimized drot-kernel and srot-kernel for POWER8

8 years agoadded drot- and srot-kernel optimimized for POWER8
Werner Saar [Sun, 27 Mar 2016 06:57:11 +0000 (08:57 +0200)]
added drot- and srot-kernel optimimized for POWER8

8 years agoMerge pull request #819 from ashwinyes/develop_20160324_fixes_optimizations
Zhang Xianyi [Sun, 27 Mar 2016 04:04:20 +0000 (00:04 -0400)]
Merge pull request #819 from ashwinyes/develop_20160324_fixes_optimizations

Cortex-A57: Fixes and Optimizations

8 years agoadded benchmark test for srot and drot
Werner Saar [Sat, 26 Mar 2016 06:14:13 +0000 (07:14 +0100)]
added benchmark test for srot and drot

8 years agoMerge pull request #823 from wernsaar/develop
wernsaar [Fri, 25 Mar 2016 17:08:48 +0000 (18:08 +0100)]
Merge pull request #823 from wernsaar/develop

added optimized copy and swap kernels for POWER8

8 years agoadded optimized sswap kernel for POWER8
Werner Saar [Fri, 25 Mar 2016 16:34:55 +0000 (17:34 +0100)]
added optimized sswap kernel for POWER8

8 years agoadded optimized ccopy kernel for POWER8
Werner Saar [Fri, 25 Mar 2016 15:54:25 +0000 (16:54 +0100)]
added optimized ccopy kernel for POWER8

8 years agoadded optimized scopy kernel for POWER8
Werner Saar [Fri, 25 Mar 2016 15:06:56 +0000 (16:06 +0100)]
added optimized scopy kernel for POWER8

8 years agoadded optimized zswap kernel for POWER8
Werner Saar [Fri, 25 Mar 2016 14:27:34 +0000 (15:27 +0100)]
added optimized zswap kernel for POWER8

8 years agoadded optimized dswap kernel for POWER8
Werner Saar [Fri, 25 Mar 2016 13:35:43 +0000 (14:35 +0100)]
added optimized dswap kernel for POWER8

8 years agoadded optimized dcopy kernel for POWER8
Werner Saar [Fri, 25 Mar 2016 12:03:02 +0000 (13:03 +0100)]
added optimized dcopy kernel for POWER8

8 years agoMerge pull request #822 from wernsaar/develop
wernsaar [Fri, 25 Mar 2016 09:15:51 +0000 (10:15 +0100)]
Merge pull request #822 from wernsaar/develop

added optimized dscal kernel for POWER8

8 years agoadded optimized dscal kernel for POWER8
Werner Saar [Fri, 25 Mar 2016 08:42:08 +0000 (09:42 +0100)]
added optimized dscal kernel for POWER8

8 years agoCortex-A57: Fix clang compilation errors
Ashwin Sekhar T K [Thu, 24 Mar 2016 05:01:28 +0000 (10:31 +0530)]
Cortex-A57: Fix clang compilation errors

8 years agoCortex-A57: Improve DGEMM 8x4 Implementation
Ashwin Sekhar T K [Thu, 17 Mar 2016 04:53:51 +0000 (10:23 +0530)]
Cortex-A57: Improve DGEMM 8x4 Implementation

8 years agoMerge pull request #817 from wernsaar/develop
wernsaar [Wed, 23 Mar 2016 12:37:04 +0000 (13:37 +0100)]
Merge pull request #817 from wernsaar/develop

added optimized zaxpy kernel for POWER8

8 years agoadded optimized zaxpy kernel for POWER8
Werner Saar [Wed, 23 Mar 2016 10:20:23 +0000 (11:20 +0100)]
added optimized zaxpy kernel for POWER8

8 years agoUpdate appveyor version.
Zhang Xianyi [Tue, 22 Mar 2016 15:37:35 +0000 (11:37 -0400)]
Update appveyor version.

8 years agoMerge pull request #813 from theoractice/develop
Zhang Xianyi [Tue, 22 Mar 2016 15:31:37 +0000 (11:31 -0400)]
Merge pull request #813 from theoractice/develop

Fix access violation on Windows while static linking in MSVC