Martin Kroeker [Thu, 30 May 2019 09:38:11 +0000 (11:38 +0200)]
Use generic kernels for complex (I)AMAX to support softfp
Martin Kroeker [Thu, 30 May 2019 09:25:43 +0000 (11:25 +0200)]
Ensure correct output for DAMAX with softfp
Martin Kroeker [Wed, 29 May 2019 13:02:51 +0000 (15:02 +0200)]
Separate implementations of AMAX and IAMAX on arm
As noted in #1912 and comment on #1942, the combined implementation happens to "do the right thing" on hardfp, but cannot return both value and index on softfp where they would have to share the return register
Martin Kroeker [Thu, 9 May 2019 13:25:09 +0000 (15:25 +0200)]
Merge pull request #2127 from martin-frbg/issue2114_2
Add NO_AFFINITY to available CMAKE options on Linux, and set it to ON
Martin Kroeker [Thu, 9 May 2019 12:42:36 +0000 (14:42 +0200)]
Replace ISMIN and ISAMIN kernels on all x86_64 platforms (#2125)
* Mark iamax_sse.S as unsuitable for MIN due to issue #2116
* Use iamax.S rather than iamax_sse.S for ISMIN/ISAMIN on all x86_64 as workaround for #2116
Martin Kroeker [Thu, 9 May 2019 09:52:02 +0000 (11:52 +0200)]
Add NO_AFFINITY to available options on Linux, and set it to ON
to match the gmake default. Fixes second part of #2114
Martin Kroeker [Thu, 9 May 2019 06:57:37 +0000 (08:57 +0200)]
Merge pull request #2124 from tylerjereddy/manylinux1_azure
TST: Azure manylinux1 & clean-up
Martin Kroeker [Thu, 9 May 2019 06:10:19 +0000 (08:10 +0200)]
Merge pull request #2123 from tylerjereddy/azure_readme_badge
DOC: Add Azure CI status badge to README
Martin Kroeker [Thu, 9 May 2019 06:10:00 +0000 (08:10 +0200)]
Merge pull request #2120 from brada4/getrf-2113
Address redundant code concern #2113
Tyler Reddy [Thu, 9 May 2019 01:51:59 +0000 (18:51 -0700)]
TST: Azure manylinux1 & clean-up
* remove some of the steps & comments
from the original Azure yml template
* modify the trigger section to use
develop since OpenBLAS primarily uses
this branch; use the same batching
behavior as downstream projects NumPy/
SciPy
* remove Travis emulated ARMv6 gcc build
because this now happens in Azure
* use documented Ubuntu vmImage name for Azure
and add in a manylinux1 test run to the matrix
[skip appveyor]
Martin Kroeker [Wed, 8 May 2019 22:47:44 +0000 (00:47 +0200)]
Add ARMV6 build to azure CI setup (#2122)
using aytekinar's Alpine image and docker script from the Travis setup
[skip ci]
Tyler Reddy [Wed, 8 May 2019 22:14:01 +0000 (15:14 -0700)]
DOC: Add Azure CI status badge
Martin Kroeker [Wed, 8 May 2019 17:07:58 +0000 (19:07 +0200)]
Update azure-pipelines.yml
Martin Kroeker [Wed, 8 May 2019 13:25:43 +0000 (15:25 +0200)]
Update azure-pipelines.yml
Martin Kroeker [Wed, 8 May 2019 13:23:54 +0000 (15:23 +0200)]
Update azure-pipelines.yml
Martin Kroeker [Wed, 8 May 2019 12:14:22 +0000 (14:14 +0200)]
Update azure-pipelines.yml
Martin Kroeker [Wed, 8 May 2019 12:12:02 +0000 (14:12 +0200)]
Update azure-pipelines.yml
take out offending lines (although stolen from https://github.com/conda-forge/opencv-feedstock azure-pipelines fiie)
Martin Kroeker [Wed, 8 May 2019 11:58:59 +0000 (13:58 +0200)]
fix tabbing in azure commands
Martin Kroeker [Wed, 8 May 2019 11:52:22 +0000 (13:52 +0200)]
first try migrating one of the arm builds from travis
Martin Kroeker [Wed, 8 May 2019 11:31:46 +0000 (13:31 +0200)]
Merge pull request #2121 from tylerjereddy/ppc64le-travis
TST: add native POWER8 to CI
Martin Kroeker [Wed, 8 May 2019 10:20:00 +0000 (12:20 +0200)]
Update link to IBM MASS library, update cpu support status
Martin Kroeker [Wed, 8 May 2019 09:41:17 +0000 (11:41 +0200)]
Merge pull request #2118 from Diazonium/develop
Change two http links to https
Tyler Reddy [Tue, 7 May 2019 23:06:42 +0000 (16:06 -0700)]
TST: add native POWER8 to CI
* add native POWER8 testing to
Travis CI matrix with ppc64le
os entry
Zhang Xianyi [Wed, 8 May 2019 01:58:01 +0000 (09:58 +0800)]
Set up CI with Azure Pipelines
[skip ci]
Andrew [Tue, 7 May 2019 20:46:54 +0000 (23:46 +0300)]
remove redundant code #2113
Martin Kroeker [Tue, 7 May 2019 16:18:16 +0000 (18:18 +0200)]
Merge pull request #2117 from martin-frbg/issue2114
Fix errors in cpu affinity setup with glibc 2.6
Diazonium [Tue, 7 May 2019 12:55:20 +0000 (14:55 +0200)]
Change two http links to https
Closes #2109
Martin Kroeker [Tue, 7 May 2019 11:34:52 +0000 (13:34 +0200)]
Fix errors in cpu enumeration with glibc 2.6
for #2114
Andrew [Tue, 7 May 2019 10:15:08 +0000 (13:15 +0300)]
init
Martin Kroeker [Sun, 5 May 2019 16:08:49 +0000 (18:08 +0200)]
Merge pull request #2111 from martin-frbg/issue1955
Disable the SkyLakeX DGEMMIxCOPY kernels as well
Martin Kroeker [Sun, 5 May 2019 13:52:01 +0000 (15:52 +0200)]
Disable DGEMMINCOPY as well for now
#1955
Martin Kroeker [Sat, 4 May 2019 20:54:41 +0000 (22:54 +0200)]
Disable the SkyLakeX DGEMMITCOPY kernel as well
as a stopgap measure for https://github.com/numpy/numpy/issues/13401 as mentioned in #1955
Martin Kroeker [Sat, 4 May 2019 20:28:29 +0000 (22:28 +0200)]
Merge pull request #2101 from luzpaz/misc-typos
Misc. typo fixes in comments and documentation
luz.paz [Sat, 4 May 2019 19:43:17 +0000 (15:43 -0400)]
Revert Changelog.txt typos
luz.paz [Sat, 4 May 2019 19:01:29 +0000 (15:01 -0400)]
Revert reference/ fixes
Martin Kroeker [Thu, 2 May 2019 05:56:57 +0000 (07:56 +0200)]
Merge pull request #2107 from quickwritereader/develop
sgemm/strmm kernel for power9
Martin Kroeker [Thu, 2 May 2019 05:56:37 +0000 (07:56 +0200)]
Merge pull request #2105 from martin-frbg/issue2104
Correct argument of CPU_ISSET for glibc <2.5
AbdelRauf [Wed, 1 May 2019 19:36:22 +0000 (19:36 +0000)]
conflict resolve
Martin Kroeker [Wed, 1 May 2019 08:46:46 +0000 (10:46 +0200)]
Correct argument of CPU_ISSET for glibc <2.5
fixes #2104
luz.paz [Mon, 29 Apr 2019 21:03:56 +0000 (17:03 -0400)]
Misc. typo fixes
Found via `codespell -q 3 -w -L ith,als,dum,nd,amin,nto,wis,ba -S ./relapack,./kernel,./lapack-netlib`
Martin Kroeker [Mon, 29 Apr 2019 17:25:32 +0000 (19:25 +0200)]
Increment version to 0.3.7.dev
Martin Kroeker [Mon, 29 Apr 2019 17:24:53 +0000 (19:24 +0200)]
Increment version to 0.3.7.dev
Martin Kroeker [Mon, 29 Apr 2019 17:21:54 +0000 (19:21 +0200)]
Merge branch 'release-0.3.0' into develop
Martin Kroeker [Mon, 29 Apr 2019 17:18:26 +0000 (19:18 +0200)]
Update Changelog with changes from 0.3.6
AbdelRauf [Mon, 29 Apr 2019 08:57:44 +0000 (08:57 +0000)]
Merge branch 'develop' of https://github.com/quickwritereader/OpenBLAS into develop
AbdelRauf [Sat, 13 Apr 2019 13:56:19 +0000 (13:56 +0000)]
sgemm/strmm
Martin Kroeker [Mon, 29 Apr 2019 07:25:19 +0000 (09:25 +0200)]
Merge pull request #2099 from martin-frbg/rela-gbtrf
Disable repeated recursion on Ab_BR in ReLAPACK xGBTRF
Martin Kroeker [Sun, 28 Apr 2019 22:12:37 +0000 (00:12 +0200)]
Disable repeated recursion on Ab_BR in ReLAPACK xGBTRF
due to crashes in LAPACK tests
Martin Kroeker [Sun, 28 Apr 2019 17:31:01 +0000 (19:31 +0200)]
Merge pull request #2098 from martin-frbg/rela-malloc
Disable reallocation of work array in ReLAPACK xSYTRF
Martin Kroeker [Sun, 28 Apr 2019 17:28:57 +0000 (19:28 +0200)]
Merge pull request #2097 from martin-frbg/rela-getrf
Correct INFO=4 condition in ReLAPACK xGETRF
Martin Kroeker [Sun, 28 Apr 2019 08:02:28 +0000 (10:02 +0200)]
Disable reallocation of work array in xSYTRF
as it appears to cause memory management problems (seen in the LAPACK tests)
Martin Kroeker [Sun, 28 Apr 2019 07:58:56 +0000 (09:58 +0200)]
Correct INFO=4 condition
Martin Kroeker [Sun, 28 Apr 2019 07:55:42 +0000 (09:55 +0200)]
Merge pull request #2096 from martin-frbg/eig-testing
Avoid out-of-bounds accesses in LAPACK EIG tests
Martin Kroeker [Sun, 28 Apr 2019 07:55:25 +0000 (09:55 +0200)]
Merge pull request #2095 from martin-frbg/trsm
Correct length of name string in xerbla call
Martin Kroeker [Sat, 27 Apr 2019 21:01:49 +0000 (23:01 +0200)]
Avoid out-of-bounds accesses in LAPACK EIG tests
see https://github.com/Reference-LAPACK/lapack/issues/333
Martin Kroeker [Sat, 27 Apr 2019 20:49:04 +0000 (22:49 +0200)]
Correct length of name string in xerbla call
Martin Kroeker [Sat, 27 Apr 2019 20:45:47 +0000 (22:45 +0200)]
Merge pull request #2094 from martin-frbg/issue2066
Fix ReLAPACK integration problems
Martin Kroeker [Sat, 27 Apr 2019 17:06:00 +0000 (19:06 +0200)]
Add support for INTERFACE64 and fix XERBLA calls
1. Replaced all instances of "int" with "blasint"
2. Added string length as "hidden" third parameter in calls to fortran XERBLA
Martin Kroeker [Sat, 27 Apr 2019 16:55:47 +0000 (18:55 +0200)]
Support INTERFACE64=1
Martin Kroeker [Tue, 23 Apr 2019 18:12:06 +0000 (20:12 +0200)]
Merge pull request #2092 from jeffbaylor/snprintf_with_MSC_VER
snprintf define consolidated to common.h
Martin Kroeker [Tue, 23 Apr 2019 18:11:36 +0000 (20:11 +0200)]
Merge pull request #2072 from martin-frbg/sum
Add (C)BLAS extension ?sum
Jeff Baylor [Tue, 23 Apr 2019 00:01:34 +0000 (17:01 -0700)]
snprintf define consolidated to common.h
Martin Kroeker [Sun, 14 Apr 2019 19:40:07 +0000 (21:40 +0200)]
Merge pull request #2084 from RashmicaG/develop
Add in runtime CPU detection for POWER.
Rashmica Gupta [Tue, 9 Apr 2019 04:13:24 +0000 (14:13 +1000)]
Add in runtime CPU detection for POWER.
Martin Kroeker [Tue, 2 Apr 2019 19:40:58 +0000 (21:40 +0200)]
Merge pull request #2080 from martin-frbg/issue2075
Add -lm and disable EXPRECISION support on *BSD
Martin Kroeker [Tue, 2 Apr 2019 07:38:18 +0000 (09:38 +0200)]
Add -lm and disable EXPRECISION support on *BSD
fixes #2075
Martin Kroeker [Sun, 31 Mar 2019 20:12:23 +0000 (22:12 +0200)]
Add declarations for ?sum
Martin Kroeker [Sun, 31 Mar 2019 11:56:08 +0000 (13:56 +0200)]
Merge pull request #2073 from martin-frbg/issue2056-2
Detect 32bit environment on 64bit ARM hardware
Martin Kroeker [Sun, 31 Mar 2019 11:55:49 +0000 (13:55 +0200)]
Add ?sum definitions for generic kernel
Martin Kroeker [Sun, 31 Mar 2019 11:55:05 +0000 (13:55 +0200)]
Add ?sum
Martin Kroeker [Sun, 31 Mar 2019 09:57:01 +0000 (11:57 +0200)]
Add cmake defaults for ?sum kernels
Martin Kroeker [Sun, 31 Mar 2019 08:50:43 +0000 (10:50 +0200)]
Detect 32bit environment on 64bit ARM hardware
for #2056, using same approach as #2058
Martin Kroeker [Sat, 30 Mar 2019 21:49:05 +0000 (22:49 +0100)]
Add ZARCH implementation of ?sum
as trivial copies of the respective ?asum kernels with the ABS and vflpsb calls removed
Martin Kroeker [Sat, 30 Mar 2019 21:27:04 +0000 (22:27 +0100)]
Add x86_64 implementation of ?sum
as trivial copy of ?asum with the fabs calls removed
Martin Kroeker [Sat, 30 Mar 2019 21:26:10 +0000 (22:26 +0100)]
Add x86 implementation of ?sum
as trivial copy of ?asum with the fabs calls removed
Martin Kroeker [Sat, 30 Mar 2019 21:25:06 +0000 (22:25 +0100)]
Add SPARC implementation of ?sum
as trivial copy of ?asum with the fabs replaced by fmov to preserve code structure
Martin Kroeker [Sat, 30 Mar 2019 21:23:42 +0000 (22:23 +0100)]
Add POWER implementation of ?sum
as trivial copy of ?asum with the fabs replaced by fmr to preserve code structure
Martin Kroeker [Sat, 30 Mar 2019 21:22:15 +0000 (22:22 +0100)]
Add MIPS64 implementation of ?sum
as trivial copy of ?asum with the fabs replaced by mov to preserve code structure
Martin Kroeker [Sat, 30 Mar 2019 21:20:14 +0000 (22:20 +0100)]
Add MIPS implementation of ?sum
as trivial copy of ?asum with the fabs calls removed
Martin Kroeker [Sat, 30 Mar 2019 21:18:03 +0000 (22:18 +0100)]
Add ia64 implementation of ?sum
as trivial copy of asum with the fabs calls removed
Martin Kroeker [Sat, 30 Mar 2019 21:13:36 +0000 (22:13 +0100)]
Add ARM64 implementations of ?sum
as trivial copies of the respective ?asum kernels with the fabs calls removed
Martin Kroeker [Sat, 30 Mar 2019 21:11:38 +0000 (22:11 +0100)]
Add ARM implementations of ?sum
(trivial copies of the respective ?asum with the fabs calls removed)
Martin Kroeker [Sat, 30 Mar 2019 21:05:11 +0000 (22:05 +0100)]
Add implementations of ssum/dsum and csum/zsum
as trivial copies of asum/zsasum with the fabs calls replaced by fmov to preserve code structure
Martin Kroeker [Sat, 30 Mar 2019 21:01:13 +0000 (22:01 +0100)]
Add ?sum
Martin Kroeker [Sat, 30 Mar 2019 20:59:18 +0000 (21:59 +0100)]
Add interface for ?sum (derived from ?asum)
Martin Kroeker [Sat, 30 Mar 2019 20:58:03 +0000 (21:58 +0100)]
Add declarations for ?sum and cblas_?sum
Martin Kroeker [Sat, 30 Mar 2019 20:21:38 +0000 (21:21 +0100)]
Merge pull request #2061 from martin-frbg/martin-frbg-patch-1
Disable the AVX512 DGEMM kernel (again)
Martin Kroeker [Sat, 30 Mar 2019 13:54:28 +0000 (14:54 +0100)]
Merge pull request #2071 from martin-frbg/issue2068
Provide CBLAS interfaces to I?MIN and I?MAX
Martin Kroeker [Sat, 30 Mar 2019 11:38:41 +0000 (12:38 +0100)]
Build CBLAS interfaces for I?MIN and I?MAX
Martin Kroeker [Sat, 30 Mar 2019 11:37:13 +0000 (12:37 +0100)]
Expose CBLAS interfaces for I?MIN and I?MAX
Martin Kroeker [Fri, 29 Mar 2019 20:46:21 +0000 (21:46 +0100)]
Merge pull request #2070 from quickwritereader/develop
power9 makefile. dgemm based on power8 kernel with following changes …
Martin Kroeker [Fri, 29 Mar 2019 18:36:29 +0000 (19:36 +0100)]
Merge branch 'develop' into develop
AbdelRauf [Thu, 14 Mar 2019 10:42:04 +0000 (10:42 +0000)]
power9 makefile. dgemm based on power8 kernel with following changes : 32x unrolled 16x4 kernel and 8x4 kernel using (lxv stxv butterfly rank1 update). improvement from 17 to 22-23gflops. dtrmm cases were added into dgemm itself
Martin Kroeker [Mon, 25 Mar 2019 20:34:30 +0000 (21:34 +0100)]
Merge pull request #2069 from aixoss/aix-asm-change
AIX asm syntax changes needed for shared object creation
Ayappan P [Mon, 25 Mar 2019 13:23:25 +0000 (18:53 +0530)]
AIX asm syntax changes needed for shared object creation
Martin Kroeker [Tue, 19 Mar 2019 21:12:51 +0000 (22:12 +0100)]
Merge pull request #2064 from embray/cygwin/use-tls-thread-memory-cleanup
Fix for #2063
Erik M. Bray [Tue, 19 Mar 2019 09:22:02 +0000 (10:22 +0100)]
Also call CloseHandle on each thread, as well as on the event so as to not leak thread handles.
Erik M. Bray [Mon, 18 Mar 2019 19:32:48 +0000 (20:32 +0100)]
Fix for #2063: The DllMain used in Cygwin did not run the thread memory
pool cleanup upon THREAD_DETACH which is needed when compiled with
USE_TLS=1.
Martin Kroeker [Sat, 16 Mar 2019 10:57:23 +0000 (11:57 +0100)]
Merge pull request #2058 from xsacha/patch-3
Change 64-bit detection as explained in #2056
Martin Kroeker [Sat, 16 Mar 2019 10:56:51 +0000 (11:56 +0100)]
Merge pull request #2060 from embray/cygwin/readenv
Use POSIX getenv on Cygwin