Zhang Xianyi [Mon, 26 Oct 2015 19:52:13 +0000 (14:52 -0500)]
Fix cmake bug on MSVC 32-bit.
Zhang Xianyi [Mon, 26 Oct 2015 18:54:53 +0000 (02:54 +0800)]
Fix cmake bug on x86 32-bit.
e.g. Build 32-bit on 64-bit Linux.
cmake -DBINARY=32
Zhang Xianyi [Mon, 26 Oct 2015 15:42:21 +0000 (23:42 +0800)]
Add CBLAS test for CMAKE.
Zhang Xianyi [Tue, 20 Oct 2015 19:37:22 +0000 (14:37 -0500)]
Fixed cmake bug on Visual Studio.
Zhang Xianyi [Tue, 20 Oct 2015 18:24:54 +0000 (02:24 +0800)]
Fixed cmake bug on haswell.
Zhang Xianyi [Mon, 19 Oct 2015 20:30:55 +0000 (04:30 +0800)]
Fixe cmake config bugs.
Zhang Xianyi [Mon, 19 Oct 2015 19:35:25 +0000 (03:35 +0800)]
Detect cmake test result.
Zhang Xianyi [Mon, 12 Oct 2015 20:46:08 +0000 (04:46 +0800)]
Merge branch 'develop' into cmake
Conflicts:
driver/others/memory.c
Zhang Xianyi [Thu, 8 Oct 2015 15:07:24 +0000 (15:07 +0000)]
Include time.h.
Zhang Xianyi [Tue, 6 Oct 2015 18:31:51 +0000 (02:31 +0800)]
Refs #615. Import bug fixes for LAPACKE dormlq.
Zhang Xianyi [Mon, 5 Oct 2015 19:14:32 +0000 (14:14 -0500)]
Fixed #654. Make sure the gotoblas_init function is run before all other static initializations.
Zhang Xianyi [Mon, 5 Oct 2015 15:25:15 +0000 (10:25 -0500)]
Merge pull request #656 from stevengj/libname
default to lib$(SYMBOLPREFIX)openblas$(SYMBOLSUFFIX)
Zhang Xianyi [Mon, 5 Oct 2015 15:23:52 +0000 (10:23 -0500)]
Merge pull request #659 from Keno/patch-2
Fix cross compilation suffix detection
Keno Fischer [Mon, 5 Oct 2015 04:58:07 +0000 (00:58 -0400)]
Fix cross compilation suffix detection
If the path involves `-`, this would have otherwise detected this as a cross compile suffix.
Steven G. Johnson [Thu, 1 Oct 2015 19:07:04 +0000 (15:07 -0400)]
default to lib$(SYMBOLPREFIX)openblas$(SYMBOLSUFFIX), as discussed in #646: if you rename the symbols, it is best to rename the library
Zhang Xianyi [Sat, 26 Sep 2015 14:42:44 +0000 (14:42 +0000)]
Fixed make TARGET=CORTEXA9 and CORTEXA15 bug.
Zhang Xianyi [Tue, 22 Sep 2015 15:01:59 +0000 (10:01 -0500)]
Merge pull request #652 from larsmans/fixes
Tiny fixes
Lars Buitinck [Tue, 22 Sep 2015 10:01:09 +0000 (12:01 +0200)]
git ignore versioned .so files
Lars Buitinck [Tue, 22 Sep 2015 10:00:30 +0000 (12:00 +0200)]
actually remove cblas_noconst.h
This file hasn't been used since
212463dce961827421a9c54f109a430c1599732c.
Zhang Xianyi [Thu, 10 Sep 2015 15:36:57 +0000 (10:36 -0500)]
Merge pull request #640 from kortschak/dlansy-fix
Fix LAPACK_*lansy routines
Zhang Xianyi [Thu, 10 Sep 2015 15:32:07 +0000 (10:32 -0500)]
Refs #638. Fixed compiling bug with clang on Mac OS X.
kortschak [Thu, 10 Sep 2015 06:02:50 +0000 (15:32 +0930)]
Fix LAPACK_*lansy routines
Fixes #639.
Zhang Xianyi [Wed, 9 Sep 2015 15:48:15 +0000 (10:48 -0500)]
Merge branch 'yuyichao-skylake-id' into develop
Zhang Xianyi [Wed, 9 Sep 2015 15:47:17 +0000 (10:47 -0500)]
Detect other Intel Skylake cores.
http://users.atw.hu/instlatx64/
Yichao Yu [Wed, 9 Sep 2015 15:00:23 +0000 (11:00 -0400)]
Ref #632. Support Intel Skylake by Haswell kernels.
Zhang Xianyi [Wed, 9 Sep 2015 14:56:07 +0000 (09:56 -0500)]
Merge pull request #634 from kortschak/lantr-trans-prep
Fix lantr preparation for row major matrices
kortschak [Tue, 8 Sep 2015 23:55:48 +0000 (09:25 +0930)]
Fix lantr preparation for row major matrices
Zhang Xianyi [Tue, 8 Sep 2015 18:59:08 +0000 (13:59 -0500)]
Merge pull request #633 from grisuthedragon/tune_imatcopy
Improved Ximatcopy when lda==ldb.
Martin Koehler [Mon, 7 Sep 2015 12:33:26 +0000 (14:33 +0200)]
Improved Ximatcopy when lda==ldb.
The Ximatcopy functions create a copy of the input matrix
although they seem to work inplace. The new routines
XIMATCOPY_K_YY perform the operations inplace if the leading
dimension does not change.
Zhang Xianyi [Fri, 4 Sep 2015 18:01:01 +0000 (13:01 -0500)]
Merge pull request #630 from buffer51/develop
Fixed error in common.h for Android compilation introduced by e12cf11
buffer51 [Fri, 4 Sep 2015 00:54:21 +0000 (20:54 -0400)]
Fixed error in common.h for Android compilation introduced by
e12cf1123e8784ce6fe9d2ac14526331fbe2c555
Zhang Xianyi [Thu, 20 Aug 2015 03:50:25 +0000 (22:50 -0500)]
Add notification.
Zhang Xianyi [Thu, 20 Aug 2015 03:48:55 +0000 (22:48 -0500)]
Merge branch 'develop' of github.com:xianyi/OpenBLAS into develop
Zhang Xianyi [Thu, 20 Aug 2015 03:26:20 +0000 (22:26 -0500)]
Merge pull request #619 from gitter-badger/gitter-badge
Add a Gitter chat badge to README.md
The Gitter Badger [Thu, 20 Aug 2015 03:21:09 +0000 (03:21 +0000)]
Added Gitter badge
Zhang Xianyi [Wed, 19 Aug 2015 13:07:47 +0000 (08:07 -0500)]
Use C kernels for s/dgemv on x86.
Zhang Xianyi [Wed, 19 Aug 2015 03:43:42 +0000 (22:43 -0500)]
Fixed cmake bug with NO_LAPACK=1
Zhang Xianyi [Mon, 17 Aug 2015 20:22:37 +0000 (15:22 -0500)]
Merge pull request #617 from notaz/arm_fixes
really fix ARM64 locking
Grazvydas Ignotas [Sun, 16 Aug 2015 23:27:45 +0000 (01:27 +0200)]
really fix ARM64 locking
Zhang Xianyi [Sun, 16 Aug 2015 22:16:18 +0000 (17:16 -0500)]
Merge pull request #616 from notaz/arm_fixes
ARM fixes
Grazvydas Ignotas [Sun, 16 Aug 2015 18:11:13 +0000 (20:11 +0200)]
correct a minor mistake
Grazvydas Ignotas [Sun, 16 Aug 2015 16:13:30 +0000 (18:13 +0200)]
use real armv5 support
there is no more requirement for ARMv6 instructions,
and VFP on ARMv5 is uncommon
Grazvydas Ignotas [Sun, 16 Aug 2015 16:10:34 +0000 (18:10 +0200)]
add fallback blas_lock implementation
to be used on armv5 and new platforms
Grazvydas Ignotas [Sun, 16 Aug 2015 16:08:45 +0000 (18:08 +0200)]
set ARMV7 for Cortex-A9 and Cortex-A15
otherwise some macros like YIELDING are not defined correctly
Grazvydas Ignotas [Sun, 16 Aug 2015 15:27:25 +0000 (17:27 +0200)]
add fallback rpcc implementation
- use on arm, arm64 and any new platform
- use faster integer math instead of double
- use similar scale as rdtsc so that timeouts work
Grazvydas Ignotas [Sun, 16 Aug 2015 13:37:02 +0000 (15:37 +0200)]
add missing barriers
should fix issue #597
Grazvydas Ignotas [Sun, 16 Aug 2015 13:18:42 +0000 (15:18 +0200)]
really fix ARM locking
- was writing 0 to lock variable, so was ineffective
- only exit loop if both lock was 0 and strex was successful
Zhang Xianyi [Tue, 11 Aug 2015 21:25:16 +0000 (16:25 -0500)]
Fixed cmake build bugs on Linux.
Zhang Xianyi [Mon, 10 Aug 2015 19:31:55 +0000 (03:31 +0800)]
Merge branch 'hpanderson_cmake' into cmake
Zhang Xianyi [Mon, 10 Aug 2015 19:31:07 +0000 (03:31 +0800)]
Merge branch 'cmake' of https://github.com/hpanderson/OpenBLAS into hpanderson_cmake
Zhang Xianyi [Mon, 10 Aug 2015 19:10:44 +0000 (14:10 -0500)]
Use cmake to build OpenBLAS GENERIC Target on MSVC x86 64-bit.
Disable CBLAS and LAPACK.
Zhang Xianyi [Thu, 6 Aug 2015 18:15:51 +0000 (13:15 -0500)]
Merge pull request #614 from xantares/cmake_version
install OpenBLASConfigVersion.cmake
xantares [Thu, 6 Aug 2015 18:03:50 +0000 (20:03 +0200)]
install OpenBLASConfigVersion.cmake
Hank Anderson [Thu, 6 Aug 2015 12:40:06 +0000 (07:40 -0500)]
Added missing lapacke.cmake file.
Zhang Xianyi [Wed, 5 Aug 2015 14:19:17 +0000 (09:19 -0500)]
Merge pull request #613 from fabioperez/develop
Add POWER7/POWER8 as targets
Fábio Perez [Wed, 5 Aug 2015 14:02:39 +0000 (11:02 -0300)]
Add POWER7/POWER8 as targets
Zhang Xianyi [Tue, 4 Aug 2015 21:58:24 +0000 (16:58 -0500)]
Merge pull request #612 from ibmsoe/ppc64le
ppc64le platform support (ELF ABI v2)
Zhang Xianyi [Tue, 4 Aug 2015 04:59:01 +0000 (23:59 -0500)]
Merge branch 'develop' into cmake
Zhang Xianyi [Tue, 4 Aug 2015 04:55:56 +0000 (23:55 -0500)]
Use pure C generic target on x86 and x86_64.
make TARGET=GENERIC
?gemm3m is unimplemented on generic target.
Zhang Xianyi [Mon, 3 Aug 2015 20:37:05 +0000 (04:37 +0800)]
Fixed cmake building bugs on Linux. Disable LAPACK by default.
Matthew Brandyberry [Tue, 21 Jul 2015 17:45:12 +0000 (12:45 -0500)]
ppc64le platform support (ELF ABI v2)
Zhang Xianyi [Tue, 21 Jul 2015 20:07:27 +0000 (04:07 +0800)]
Merge branch 'hpanderson_cmake' into cmake
Zhang Xianyi [Tue, 21 Jul 2015 20:06:39 +0000 (04:06 +0800)]
Merge branch 'cmake' of https://github.com/hpanderson/OpenBLAS into hpanderson_cmake
Zhang Xianyi [Fri, 26 Jun 2015 03:54:41 +0000 (11:54 +0800)]
Fix blas lock bug on AArch64.
Zhang Xianyi [Tue, 23 Jun 2015 02:54:51 +0000 (21:54 -0500)]
Merge pull request #595 from tanderson92/fixTests
Fix test execution when USE_OPENMP=0
wernsaar [Sat, 13 Jun 2015 14:44:48 +0000 (16:44 +0200)]
Merge pull request #596 from wernsaar/develop
optimizations for haswell
Werner Saar [Sat, 13 Jun 2015 14:16:29 +0000 (16:16 +0200)]
added optimized dtrmm_kernel for haswell
Werner Saar [Sat, 13 Jun 2015 08:28:27 +0000 (10:28 +0200)]
modified haswell parameter dgemm_unroll_n
Thomas Anderson [Sat, 13 Jun 2015 06:52:07 +0000 (23:52 -0700)]
Fix test execution when USE_OPENMP=0
The standard way to disable OpenMP support is to set USE_OPENMP=0,
as indicated by other checks to see if USE_OPENMP equals 1. The
problem is obviously then that `ifdef USE_OPENMP` is very much not
what we want to test for. This causes tests to fail when no OpenMP
library is installed.
Zhang Xianyi [Mon, 8 Jun 2015 15:53:50 +0000 (10:53 -0500)]
Fix #593. Change MACOSX_DEPLOYMENT_TARGET to 10.6.
wernsaar [Mon, 8 Jun 2015 12:22:02 +0000 (14:22 +0200)]
Merge pull request #592 from wernsaar/develop
added benchmark scripts
Werner Saar [Mon, 8 Jun 2015 12:06:38 +0000 (14:06 +0200)]
added benchmark scripts for numpy, octave and R
Werner Saar [Mon, 8 Jun 2015 10:58:38 +0000 (12:58 +0200)]
updated geev benchmark
wernsaar [Wed, 3 Jun 2015 10:14:09 +0000 (12:14 +0200)]
Merge pull request #589 from wernsaar/develop
small modification of gemm.c
Werner Saar [Wed, 3 Jun 2015 07:11:51 +0000 (09:11 +0200)]
small modification of gemm.c
wernsaar [Tue, 2 Jun 2015 13:29:49 +0000 (15:29 +0200)]
Merge pull request #587 from wernsaar/develop
added gesv benchmark
Werner Saar [Tue, 2 Jun 2015 11:35:49 +0000 (13:35 +0200)]
added gesv benchmark
wernsaar [Sun, 31 May 2015 13:01:54 +0000 (15:01 +0200)]
Merge pull request #585 from wernsaar/develop
bugfix for benchmark Makefile on MAC
Werner Saar [Sun, 31 May 2015 12:16:51 +0000 (14:16 +0200)]
bugfix for Makefile on mac
wernsaar [Fri, 29 May 2015 11:27:20 +0000 (13:27 +0200)]
Merge pull request #584 from wernsaar/develop
bugfixes, to build benchmarks with mingw on Windows OS
Werner Saar [Fri, 29 May 2015 10:56:22 +0000 (12:56 +0200)]
bugfixes, to build benchmarks with mingw on Windows OS
wernsaar [Sat, 23 May 2015 10:58:15 +0000 (12:58 +0200)]
Merge pull request #581 from wernsaar/develop
bugfix for arm locking
Werner Saar [Sat, 23 May 2015 09:40:40 +0000 (11:40 +0200)]
bugfix for arm locking
Werner Saar [Sat, 23 May 2015 08:58:38 +0000 (10:58 +0200)]
smp lock bugfix
wernsaar [Sat, 23 May 2015 07:46:39 +0000 (09:46 +0200)]
Merge pull request #580 from wernsaar/develop
added blas level1 swap benchmark
Werner Saar [Thu, 21 May 2015 06:51:42 +0000 (08:51 +0200)]
added blas level1 swap benchmark
Zhang Xianyi [Thu, 21 May 2015 02:57:27 +0000 (21:57 -0500)]
Support Android NDK armeabi-v7a-hard ABI. (-mfloat-abi=hard)
e.g.
make HOSTCC=gcc CC=arm-linux-androideabi-gcc NO_LAPACK=1 TARGET=ARMV7
In Android NDK, it uses armeabi-v7a-hard ABI.
TARGET_CFLAGS += -mhard-float -D_NDK_MATH_NO_SOFTFP=1
TARGET_LDFLAGS += -Wl,--no-warn-mismatch -lm_hard
For more information, please check hard-float example at
android_ndk/tests/device/hard-float/jni/.
wernsaar [Wed, 20 May 2015 09:56:02 +0000 (11:56 +0200)]
Merge pull request #578 from wernsaar/develop
added blas level1 copy benchmark
Werner Saar [Wed, 20 May 2015 09:05:00 +0000 (11:05 +0200)]
added blas level1 copy benchmark
Zhang Xianyi [Tue, 19 May 2015 17:04:45 +0000 (12:04 -0500)]
Fix f_check bug.
wernsaar [Tue, 19 May 2015 08:59:24 +0000 (10:59 +0200)]
Merge pull request #577 from wernsaar/develop
Bugfix for armv6 memory barrier
Werner Saar [Tue, 19 May 2015 08:43:12 +0000 (10:43 +0200)]
Ref #574: Bugfix for armv6 memory barrier
Zhang Xianyi [Tue, 19 May 2015 05:01:04 +0000 (00:01 -0500)]
1) Refs #575. Remove g77 from compiler list.
2) If OpenBLAS cannot find Fortran compiler, it will only build BLAS
(without LAPACK).
wernsaar [Mon, 18 May 2015 11:47:38 +0000 (13:47 +0200)]
Merge pull request #572 from wernsaar/develop
added optimized cscal and zscal functions for steamroller
Werner Saar [Mon, 18 May 2015 10:40:07 +0000 (12:40 +0200)]
added optimized cscal and zscal kernels for steamroller
Werner Saar [Mon, 18 May 2015 08:50:57 +0000 (10:50 +0200)]
added optimized cscal and zscal kernels for steamroller and piledriver
Werner Saar [Mon, 18 May 2015 06:46:06 +0000 (08:46 +0200)]
added optimized cscal kernel for sandybridge
Werner Saar [Mon, 18 May 2015 05:33:52 +0000 (07:33 +0200)]
added optimized cscal kernel for bulldozer
wernsaar [Sun, 17 May 2015 12:09:14 +0000 (14:09 +0200)]
Merge pull request #571 from wernsaar/develop
added optimized cscal and zscal functions
Werner Saar [Sun, 17 May 2015 11:44:09 +0000 (13:44 +0200)]
added optimized cscal kernel for haswell