Zhang Xianyi [Wed, 19 Feb 2014 22:58:27 +0000 (06:58 +0800)]
Merge pull request #343 from ogrisel/fix-294-fork-safe-pthread
FIX #294: fork-safe pthread mode
Olivier Grisel [Wed, 19 Feb 2014 18:25:48 +0000 (19:25 +0100)]
Do not reference pthread_atfork under windows
Olivier Grisel [Wed, 19 Feb 2014 18:14:13 +0000 (19:14 +0100)]
Make sure that fork_test.c is not built under windows
Olivier Grisel [Tue, 18 Feb 2014 09:57:01 +0000 (10:57 +0100)]
FIX #294: make OpenBLAS thread-pool resilient to fork via pthread_atfork
Olivier Grisel [Wed, 19 Feb 2014 17:32:54 +0000 (18:32 +0100)]
Revert "Refs #294. Used pthread_atfork to avoid hang after a Unix fork."
This reverts commit
3617c22a5679c52b1e760d739d76a7bde9e84d35.
Zhang Xianyi [Tue, 18 Feb 2014 07:36:04 +0000 (15:36 +0800)]
Refs #294. Used pthread_atfork to avoid hang after a Unix fork.
The problem is the mutex we used in blas_server. Thus, we must clear
the mutex before the fork and re-init them at parent and child process.
If you used OpenMP, GOMP has the same problem by now. Please try other OpenMP
implemantation.
wernsaar [Sun, 16 Feb 2014 10:45:47 +0000 (11:45 +0100)]
checked in bugfixes for ARM
Zhang Xianyi [Sat, 15 Feb 2014 15:23:15 +0000 (23:23 +0800)]
Refs #340. Fixed SEGFAULT bug of dgemv_n on OSX.
wangqian [Tue, 4 Feb 2014 13:47:51 +0000 (21:47 +0800)]
Fixed computational error of dgemv_n.
Zhang Xianyi [Fri, 24 Jan 2014 07:39:46 +0000 (15:39 +0800)]
Refs #329 #287. Only disable -fopenmp for LAPACK Fortran codes on Windows.
Zhang Xianyi [Thu, 23 Jan 2014 18:05:59 +0000 (02:05 +0800)]
Refs #338. Added OPENBLAS_VERBOSE environment variable on runtime
By default, OpenBLAS doesn't output the warning message. You can set
OPENBLAS_VERBOSE (e.g. export OPENBLAS_VERBOSE=1) to enable the warning
message on runtime.
Zhang Xianyi [Mon, 20 Jan 2014 17:27:51 +0000 (01:27 +0800)]
Refs #333. Detect the wrong parameter for zherk/zher2k.
Zhang Xianyi [Sun, 19 Jan 2014 15:28:11 +0000 (23:28 +0800)]
Fixed #334 a makefile bug in lapacke.
Zhang Xianyi [Wed, 8 Jan 2014 03:16:21 +0000 (11:16 +0800)]
Refs #335. Added the fallback of L2 size detection for some virtual machines.
Zhang Xianyi [Sun, 5 Jan 2014 15:44:29 +0000 (23:44 +0800)]
Refs #332. Added addtional Intel Ivy Bridge and Haswell CPU-id.
Zhang Xianyi [Tue, 31 Dec 2013 07:53:13 +0000 (15:53 +0800)]
Refs #307. Delete debug printf.
Zhang Xianyi [Mon, 16 Dec 2013 12:31:17 +0000 (20:31 +0800)]
Refs #330. Fixed the compatible issue with clang on Mac OSX.
Zhang Xianyi [Fri, 13 Dec 2013 12:48:05 +0000 (20:48 +0800)]
Release 0.2.9 rc1 version.
Zhang Xianyi [Fri, 13 Dec 2013 08:17:41 +0000 (16:17 +0800)]
Refs #325 #324. Imported carkl's patch for building LAPACKE on Windows.
Zhang Xianyi [Thu, 12 Dec 2013 15:07:33 +0000 (23:07 +0800)]
Refs #310. Fixed Segfault bug on nehalem when Julia calling dgeqrt3 on OSX.
Please also check JuliaLang/julia#4099
Julia test script:
A=rand(256, 256)
qrfact(A)
I found this was a bug in kernel/x86_64/dgemm_ncopy_8.S.
However, I cannot use gdb with julia. Thus, this is a walkaround fix.
Zhang Xianyi [Wed, 11 Dec 2013 14:16:40 +0000 (06:16 -0800)]
Merge pull request #327 from lucasb-eyer/develop
Added support for NO_SHARED in make install.
lucasb-eyer [Tue, 10 Dec 2013 16:10:13 +0000 (17:10 +0100)]
Added support for NO_SHARED in make install.
Zhang Xianyi [Mon, 9 Dec 2013 15:05:02 +0000 (07:05 -0800)]
Merge pull request #326 from cicku/develop
Update Makefile.install
Christopher Meng [Mon, 9 Dec 2013 11:56:37 +0000 (19:56 +0800)]
Add DESTDIR support for easier building on RPM based distros.
Use install command instead of cp to install files with permissions control.
Zhang Xianyi [Mon, 9 Dec 2013 08:50:02 +0000 (16:50 +0800)]
Refs #324. Upgrade LAPACK to 3.5.0 version.
Zhang Xianyi [Tue, 3 Dec 2013 14:47:03 +0000 (06:47 -0800)]
Merge pull request #323 from wernsaar/develop
Merge bulldozer, haswell, piledriver and armv7 branches
wernsaar [Mon, 2 Dec 2013 12:17:51 +0000 (13:17 +0100)]
added support for HASWELL
wernsaar [Mon, 2 Dec 2013 09:08:14 +0000 (10:08 +0100)]
modified trsm kernel
wernsaar [Sun, 1 Dec 2013 21:39:39 +0000 (22:39 +0100)]
added trsm kernels from origin
wernsaar [Sun, 1 Dec 2013 20:07:49 +0000 (21:07 +0100)]
modified zsymv
wernsaar [Sun, 1 Dec 2013 19:56:02 +0000 (20:56 +0100)]
merged symv
wernsaar [Sun, 1 Dec 2013 17:43:24 +0000 (18:43 +0100)]
modified param.h
wernsaar [Sun, 1 Dec 2013 17:19:58 +0000 (18:19 +0100)]
Merge remote branch 'origin/armv7' into develop
wernsaar [Sun, 1 Dec 2013 17:18:40 +0000 (18:18 +0100)]
Merge remote branch 'origin/develop' into armv7
wernsaar [Sun, 1 Dec 2013 17:15:56 +0000 (18:15 +0100)]
Merge remote branch 'origin/haswell' into develop
wernsaar [Sun, 1 Dec 2013 17:13:57 +0000 (18:13 +0100)]
Merge remote branch 'origin/develop' into piledriver
wernsaar [Sun, 1 Dec 2013 17:11:09 +0000 (18:11 +0100)]
Merge remote branch 'origin/develop' into haswell
wernsaar [Sun, 1 Dec 2013 17:10:23 +0000 (18:10 +0100)]
modified param.h
wernsaar [Sun, 1 Dec 2013 17:09:12 +0000 (18:09 +0100)]
Merge remote branch 'origin/haswell' into develop
wernsaar [Sun, 1 Dec 2013 17:06:51 +0000 (18:06 +0100)]
Merge remote branch 'origin/piledriver' into develop
wernsaar [Sun, 1 Dec 2013 17:06:14 +0000 (18:06 +0100)]
Merge remote branch 'origin/bulldozer' into develop
wernsaar [Sun, 1 Dec 2013 17:03:34 +0000 (18:03 +0100)]
Merge remote branch 'origin/master' into piledriver
wernsaar [Sun, 1 Dec 2013 17:02:54 +0000 (18:02 +0100)]
modified param.h
wernsaar [Sun, 1 Dec 2013 17:02:11 +0000 (18:02 +0100)]
modified param.h
wernsaar [Sun, 1 Dec 2013 17:00:49 +0000 (18:00 +0100)]
Merge remote branch 'origin/develop' into haswell
wernsaar [Sun, 1 Dec 2013 16:58:46 +0000 (17:58 +0100)]
modified Makefile.L3
wernsaar [Sun, 1 Dec 2013 16:54:53 +0000 (17:54 +0100)]
modified param.h
wernsaar [Sun, 1 Dec 2013 16:51:39 +0000 (17:51 +0100)]
modified param.h
wernsaar [Sun, 1 Dec 2013 16:38:58 +0000 (17:38 +0100)]
modified Makefile.rule
wernsaar [Sun, 1 Dec 2013 16:35:18 +0000 (17:35 +0100)]
modified c_check
wernsaar [Sun, 1 Dec 2013 16:31:22 +0000 (17:31 +0100)]
modified getarch.c
wernsaar [Sun, 1 Dec 2013 15:57:54 +0000 (16:57 +0100)]
Merge remote branch 'origin/develop' into piledriver
wernsaar [Sun, 1 Dec 2013 15:56:21 +0000 (16:56 +0100)]
Merge remote branch 'origin/develop' into bulldozer
wernsaar [Sun, 1 Dec 2013 15:55:24 +0000 (16:55 +0100)]
Merge remote branch 'origin/master' into bulldozer
wernsaar [Sun, 1 Dec 2013 15:53:39 +0000 (16:53 +0100)]
Merge remote branch 'origin/develop' into haswell
wernsaar [Sun, 1 Dec 2013 15:50:18 +0000 (16:50 +0100)]
Merge remote branch 'origin/master' into haswell
wernsaar [Sun, 1 Dec 2013 15:48:08 +0000 (16:48 +0100)]
modified Makefile.system
wernsaar [Sun, 1 Dec 2013 15:46:32 +0000 (16:46 +0100)]
modified Makefile.system
wernsaar [Sun, 1 Dec 2013 15:32:54 +0000 (16:32 +0100)]
modified common.h
wernsaar [Sun, 1 Dec 2013 15:23:50 +0000 (16:23 +0100)]
Merge remote branch 'origin/develop' into bulldozer
wernsaar [Sun, 1 Dec 2013 15:22:49 +0000 (16:22 +0100)]
modified common.h
wernsaar [Sun, 1 Dec 2013 15:21:10 +0000 (16:21 +0100)]
Merge remote branch 'origin/develop' into piledriver
wernsaar [Sun, 1 Dec 2013 15:19:30 +0000 (16:19 +0100)]
Merge remote branch 'origin/develop' into haswell
wernsaar [Sun, 1 Dec 2013 15:15:51 +0000 (16:15 +0100)]
modified common.h
wernsaar [Sun, 1 Dec 2013 15:11:13 +0000 (16:11 +0100)]
modified common.h
wernsaar [Sun, 1 Dec 2013 12:52:48 +0000 (13:52 +0100)]
changed level3.c
wernsaar [Sun, 1 Dec 2013 12:52:08 +0000 (13:52 +0100)]
changed level3.c
wernsaar [Sun, 1 Dec 2013 12:51:04 +0000 (13:51 +0100)]
changed level3.c
wernsaar [Sun, 1 Dec 2013 12:49:59 +0000 (13:49 +0100)]
changed level3.c
wernsaar [Sun, 1 Dec 2013 12:46:30 +0000 (13:46 +0100)]
changed level3.c
wernsaar [Sun, 1 Dec 2013 12:38:11 +0000 (13:38 +0100)]
Merge remote branch 'origin/develop' into haswell
wernsaar [Sun, 1 Dec 2013 12:33:18 +0000 (13:33 +0100)]
Merge remote branch 'origin/develop' into piledriver
wernsaar [Sun, 1 Dec 2013 12:27:35 +0000 (13:27 +0100)]
Merge remote branch 'origin/develop' into bulldozer
wernsaar [Sun, 1 Dec 2013 12:16:41 +0000 (13:16 +0100)]
Merge remote branch 'origin/develop' into armv7
wernsaar [Sun, 1 Dec 2013 10:25:33 +0000 (11:25 +0100)]
Merge remote branch 'origin/develop' into haswell
wernsaar [Sun, 1 Dec 2013 10:23:36 +0000 (11:23 +0100)]
merged common_x86.h and common_x86_64.h from develop
wernsaar [Sun, 1 Dec 2013 10:06:36 +0000 (11:06 +0100)]
Merge remote branch 'origin/develop' into piledriver
wernsaar [Sun, 1 Dec 2013 10:05:10 +0000 (11:05 +0100)]
merge from develop
wernsaar [Sun, 1 Dec 2013 09:58:22 +0000 (10:58 +0100)]
merged form develop
wernsaar [Sun, 1 Dec 2013 09:53:39 +0000 (10:53 +0100)]
Merge remote branch 'origin/develop' into bulldozer
wernsaar [Sun, 1 Dec 2013 09:12:07 +0000 (10:12 +0100)]
Merge branch 'develop' into armv7
wernsaar [Fri, 29 Nov 2013 16:06:33 +0000 (17:06 +0100)]
added complex gemv kernels for ARMV6 and ARMV7
wernsaar [Thu, 28 Nov 2013 16:40:21 +0000 (17:40 +0100)]
optimized gemv_n_vfp.S
wernsaar [Thu, 28 Nov 2013 13:35:07 +0000 (14:35 +0100)]
added optimized ctrmm kernel for ARMV6
wernsaar [Thu, 28 Nov 2013 12:41:06 +0000 (13:41 +0100)]
added optimized ztrmm kernel for ARMV6
wernsaar [Thu, 28 Nov 2013 11:45:38 +0000 (12:45 +0100)]
optimized strmm kernel for ARMV6
wernsaar [Thu, 28 Nov 2013 11:32:12 +0000 (12:32 +0100)]
optimized dtrmm kernel for ARMV7
wernsaar [Thu, 28 Nov 2013 10:54:38 +0000 (11:54 +0100)]
add optimized cgemm kernel for ARMV6
wernsaar [Thu, 28 Nov 2013 09:04:43 +0000 (10:04 +0100)]
optimized zgemm kernel for ARMV6
wernsaar [Thu, 28 Nov 2013 07:33:44 +0000 (08:33 +0100)]
optimized zgemm_kernel_2x2_vfp.S
wernsaar [Thu, 28 Nov 2013 07:08:08 +0000 (08:08 +0100)]
optimized sgemm kernel for ARMV6
wernsaar [Wed, 27 Nov 2013 17:38:32 +0000 (18:38 +0100)]
optimized sgemm kernel for ARMV6
wernsaar [Wed, 27 Nov 2013 16:37:38 +0000 (17:37 +0100)]
optimized dgemm kernel for ARMV6
wernsaar [Wed, 27 Nov 2013 15:15:06 +0000 (16:15 +0100)]
added zgemm_kernel for ARMV6
wernsaar [Mon, 25 Nov 2013 18:31:27 +0000 (19:31 +0100)]
added optimized gemv_t kernel for ARMV6
wernsaar [Mon, 25 Nov 2013 11:25:58 +0000 (12:25 +0100)]
add optimized axpy kernel
wernsaar [Sun, 24 Nov 2013 15:19:01 +0000 (16:19 +0100)]
changed default optimization from -O2 to -O3 for ARMV8
wernsaar [Sun, 24 Nov 2013 14:47:00 +0000 (15:47 +0100)]
added experimental support for ARMV8
wernsaar [Sun, 24 Nov 2013 12:22:49 +0000 (13:22 +0100)]
changed default optimization flag for ARM from -O2 to -O3
wernsaar [Sun, 24 Nov 2013 11:03:28 +0000 (12:03 +0100)]
add single and double precision gemv_n kernel for ARMV6