pkubaj [Fri, 28 Jun 2019 10:29:44 +0000 (10:29 +0000)]
Fix build for PPC970 on FreeBSD pt. 1
FreeBSD needs DCBT_ARG=0 as well.
Martin Kroeker [Tue, 25 Jun 2019 10:56:33 +0000 (12:56 +0200)]
Merge pull request #2169 from pkubaj/develop
Fix build on FreeBSD/powerpc64.
Piotr Kubaj [Tue, 25 Jun 2019 08:58:56 +0000 (10:58 +0200)]
Fix build on FreeBSD/powerpc64.
Signed-off-by: Piotr Kubaj <pkubaj@anongoth.pl>
Martin Kroeker [Wed, 19 Jun 2019 12:38:01 +0000 (14:38 +0200)]
Merge pull request #2167 from kavanabhat/dtrmm_power8_segfault
Fix DTRMMKERNEL register save for power8 64-bit mode (Fix for #2166)
kavanabhat [Wed, 19 Jun 2019 09:57:14 +0000 (15:27 +0530)]
Update dtrmm_kernel_16x4_power8.S
Martin Kroeker [Mon, 10 Jun 2019 17:12:45 +0000 (19:12 +0200)]
Merge pull request #2159 from martin-frbg/issue2149
Avoid unintentional activation of TLS codepath via USE_TLS=0
Martin Kroeker [Mon, 10 Jun 2019 15:24:15 +0000 (17:24 +0200)]
Avoid unintentional activation of TLS code via USE_TLS=0
fixes #2149
Martin Kroeker [Mon, 10 Jun 2019 12:08:11 +0000 (14:08 +0200)]
Merge pull request #2158 from martin-frbg/issue2143
Remove any inadvertent use of -march=native from DYNAMIC_ARCH builds
Martin Kroeker [Mon, 10 Jun 2019 07:50:13 +0000 (09:50 +0200)]
Remove any inadvertent use of -march=native from DYNAMIC_ARCH builds
from #2143, -march=native precludes use of more specific options like -march=skylake-avx512 in individual kernels, and defeats the purpose of dynamic arch anyway.
Martin Kroeker [Sun, 9 Jun 2019 10:19:08 +0000 (12:19 +0200)]
Merge pull request #2157 from martin-frbg/2154-2
Add gfortran workaround for potential ABI violation
Martin Kroeker [Sun, 9 Jun 2019 07:31:13 +0000 (09:31 +0200)]
Update fc.cmake
Martin Kroeker [Sat, 8 Jun 2019 21:17:03 +0000 (23:17 +0200)]
Add gfortran workaround for potential ABI violation
for #2154
Martin Kroeker [Fri, 7 Jun 2019 11:23:07 +0000 (13:23 +0200)]
Merge pull request #2148 from TiborGY/cpp_thread_test_2
Thread safety tester using C++11 threading (cleaned history)
Martin Kroeker [Thu, 6 Jun 2019 11:43:12 +0000 (13:43 +0200)]
Merge pull request #2156 from martin-frbg/issue2154
Add gfortran workaround for C->FORTRAN ABI violation
Martin Kroeker [Thu, 6 Jun 2019 08:24:16 +0000 (10:24 +0200)]
Add gfortran workaround for ABI violations
for #2154 (see gcc bug 90329)
Martin Kroeker [Thu, 6 Jun 2019 08:18:40 +0000 (10:18 +0200)]
Add gfortran workaround for ABI violations in LAPACKE
for #2154 (see gcc bug 90329)
Martin Kroeker [Thu, 6 Jun 2019 05:42:56 +0000 (07:42 +0200)]
Merge pull request #2153 from quickwritereader/develop
improved power9 zgemm,sgemm
AbdelRauf [Wed, 5 Jun 2019 20:50:50 +0000 (20:50 +0000)]
conflict resolve
AbdelRauf [Wed, 5 Jun 2019 10:30:57 +0000 (10:30 +0000)]
power9 zgemm ztrmm optimized
Martin Kroeker [Wed, 5 Jun 2019 18:27:45 +0000 (20:27 +0200)]
Merge pull request #2145 from martin-frbg/1912-3
Separate implementations of AMAX and IAMAX on arm
Martin Kroeker [Wed, 5 Jun 2019 18:27:05 +0000 (20:27 +0200)]
Merge pull request #2110 from pc2/cpu-detection
Fix detection of Skylake processors when using GCC
Michael Lass [Fri, 3 May 2019 19:22:27 +0000 (21:22 +0200)]
c_check: Unlink correct file
Michael Lass [Fri, 3 May 2019 19:07:14 +0000 (21:07 +0200)]
Fix detection of AVX512 capable compilers in getarch
21eda8b5 introduced a check in getarch.c to test if the compiler is capable of
AVX512. This check currently fails, since the used __AVX2__ macro is only
defined if getarch itself was compiled with AVX2/AVX512 support. Make sure this
is the case by building getarch with -march=native on x86_64. It is only
supposed to run on the build host anyway.
AbdelRauf [Fri, 31 May 2019 22:48:16 +0000 (22:48 +0000)]
sgemm pipeline improved, zgemm rewritten without inner packs, ABI lxvx v20 fixed with vs52
Martin Kroeker [Mon, 3 Jun 2019 09:01:33 +0000 (11:01 +0200)]
Document NO_AVX512
for #2151
TiborGY [Sat, 1 Jun 2019 19:36:41 +0000 (21:36 +0200)]
add c++ thread test option to Makefile.rule
TiborGY [Sat, 1 Jun 2019 19:32:52 +0000 (21:32 +0200)]
hook up c++ thread safety test (main Makefile)
TiborGY [Sat, 1 Jun 2019 19:30:06 +0000 (21:30 +0200)]
upload thread safety test folder
AbdelRauf [Thu, 23 May 2019 04:23:43 +0000 (04:23 +0000)]
improved zgemm power9 based on power8
Martin Kroeker [Thu, 30 May 2019 09:38:11 +0000 (11:38 +0200)]
Use generic kernels for complex (I)AMAX to support softfp
Martin Kroeker [Thu, 30 May 2019 09:25:43 +0000 (11:25 +0200)]
Ensure correct output for DAMAX with softfp
Martin Kroeker [Wed, 29 May 2019 13:02:51 +0000 (15:02 +0200)]
Separate implementations of AMAX and IAMAX on arm
As noted in #1912 and comment on #1942, the combined implementation happens to "do the right thing" on hardfp, but cannot return both value and index on softfp where they would have to share the return register
Martin Kroeker [Wed, 29 May 2019 12:09:10 +0000 (14:09 +0200)]
Merge pull request #2144 from xianyi/revert-2142-issue1912-2
Revert "Add softfp support in min/max kernels"
Martin Kroeker [Wed, 29 May 2019 12:07:17 +0000 (14:07 +0200)]
Revert "Add softfp support in min/max kernels"
Martin Kroeker [Tue, 28 May 2019 20:56:08 +0000 (22:56 +0200)]
Merge pull request #2142 from martin-frbg/issue1912-2
Add softfp support in min/max kernels
Martin Kroeker [Tue, 28 May 2019 18:50:40 +0000 (20:50 +0200)]
Merge pull request #2141 from martin-frbg/issue1912
Build and run utests independently of fortran
Martin Kroeker [Tue, 28 May 2019 18:34:22 +0000 (20:34 +0200)]
Add softfp support in min/max kernels
fix for #1912
Martin Kroeker [Sun, 26 May 2019 10:39:20 +0000 (12:39 +0200)]
Merge pull request #2140 from martin-frbg/pgi19
Do not try ancient PGI hacks with recent versions of that compiler
Martin Kroeker [Fri, 24 May 2019 11:02:23 +0000 (13:02 +0200)]
Build and run utests in any case, they do their own checks for fortran availability
Martin Kroeker [Wed, 22 May 2019 11:48:27 +0000 (13:48 +0200)]
Do not try ancient PGI hacks with recent versions of that compiler
should fix #2139
Martin Kroeker [Thu, 16 May 2019 10:08:16 +0000 (12:08 +0200)]
Merge pull request #2136 from martin-frbg/issue2126
Add option to allow combining USE_THREAD=0 with thread locking support
Martin Kroeker [Wed, 15 May 2019 21:40:06 +0000 (23:40 +0200)]
Merge pull request #2134 from tylerjereddy/skylake_regress_guard_may14
TST: add SkylakeX AVX512 CI test
Martin Kroeker [Wed, 15 May 2019 21:38:12 +0000 (23:38 +0200)]
Remove unrelated change
Martin Kroeker [Wed, 15 May 2019 21:36:17 +0000 (23:36 +0200)]
Add option USE_LOCKING but keep default settings intact
Martin Kroeker [Wed, 15 May 2019 21:21:20 +0000 (23:21 +0200)]
Add option USE_LOCKING for SMP-like locking in USE_THREAD=0 builds
Martin Kroeker [Wed, 15 May 2019 21:19:30 +0000 (23:19 +0200)]
Add option USE_LOCKING for single-threaded build with locking support
Martin Kroeker [Wed, 15 May 2019 21:18:43 +0000 (23:18 +0200)]
Add option USE_LOCKING for single-threaded build with locking support
for calling from concurrent threads
Tyler Reddy [Tue, 14 May 2019 18:32:23 +0000 (11:32 -0700)]
TST: add SkylakeX AVX512 CI test
* adapt the C-level reproducer code for some
recent SkylakeX AVX512 kernel issues, provided
by Isuru Fernando and modified by Martin Kroeker,
for usage in the utest suite
* add an Intel SDE SkylakeX emulation utest run to
the Azure CI matrix; a custom Docker build was required
because Ubuntu image provided by Azure does not support
AVX512VL instructions
Martin Kroeker [Tue, 14 May 2019 07:37:00 +0000 (09:37 +0200)]
Merge pull request #2130 from isuruf/drone
Drone CI for arm64 native builds
Isuru Fernando [Sun, 12 May 2019 20:25:45 +0000 (15:25 -0500)]
Fix typo
Isuru Fernando [Sun, 12 May 2019 20:14:46 +0000 (15:14 -0500)]
arm32 build
Isuru Fernando [Sun, 12 May 2019 20:09:53 +0000 (15:09 -0500)]
Remove qemu armv8 builds
Isuru Fernando [Sun, 12 May 2019 19:28:48 +0000 (14:28 -0500)]
See if ubuntu 19.04 fixes the ICE
Isuru Fernando [Sun, 12 May 2019 19:22:36 +0000 (14:22 -0500)]
parallel build
Isuru Fernando [Sun, 12 May 2019 19:17:12 +0000 (14:17 -0500)]
build without lapack on cmake
Isuru Fernando [Sun, 12 May 2019 19:09:29 +0000 (14:09 -0500)]
Add cmake builds and print options
Isuru Fernando [Sun, 12 May 2019 19:06:04 +0000 (14:06 -0500)]
Add a cmake build as well
Isuru Fernando [Sun, 12 May 2019 19:02:39 +0000 (14:02 -0500)]
no need of gcc in clang build
Isuru Fernando [Sun, 12 May 2019 18:56:59 +0000 (13:56 -0500)]
update yes
Isuru Fernando [Sun, 12 May 2019 18:55:38 +0000 (13:55 -0500)]
Fix typo
Isuru Fernando [Sun, 12 May 2019 18:55:04 +0000 (13:55 -0500)]
apt update
Isuru Fernando [Sun, 12 May 2019 18:53:58 +0000 (13:53 -0500)]
Switch to ubuntu and parallel jobs
Isuru Fernando [Sun, 12 May 2019 18:50:37 +0000 (13:50 -0500)]
gfortran->gcc-gfortran
Isuru Fernando [Sun, 12 May 2019 18:47:49 +0000 (13:47 -0500)]
Install gfortran and add a clang job
Isuru Fernando [Sun, 12 May 2019 18:44:15 +0000 (13:44 -0500)]
Install perl
Isuru Fernando [Sun, 12 May 2019 18:42:16 +0000 (13:42 -0500)]
Install gcc
Isuru Fernando [Sun, 12 May 2019 18:40:23 +0000 (13:40 -0500)]
remove sudo
Isuru Fernando [Sun, 12 May 2019 18:39:51 +0000 (13:39 -0500)]
install make
Isuru Fernando [Sun, 12 May 2019 18:35:07 +0000 (13:35 -0500)]
Test drone CI
Martin Kroeker [Sun, 12 May 2019 07:55:57 +0000 (09:55 +0200)]
Merge pull request #2129 from martin-frbg/armv8azure
Move ARMv8/gcc CI job from Travis to Azure
Martin Kroeker [Sat, 11 May 2019 20:37:06 +0000 (22:37 +0200)]
Update .travis.yml
Martin Kroeker [Sat, 11 May 2019 14:08:23 +0000 (16:08 +0200)]
Move ARMv8 gcc build from Travis to Azure
Martin Kroeker [Sat, 11 May 2019 14:07:30 +0000 (16:07 +0200)]
Move ARMv8 gcc build from Travis to Azure
Martin Kroeker [Thu, 9 May 2019 13:25:09 +0000 (15:25 +0200)]
Merge pull request #2127 from martin-frbg/issue2114_2
Add NO_AFFINITY to available CMAKE options on Linux, and set it to ON
Martin Kroeker [Thu, 9 May 2019 12:42:36 +0000 (14:42 +0200)]
Replace ISMIN and ISAMIN kernels on all x86_64 platforms (#2125)
* Mark iamax_sse.S as unsuitable for MIN due to issue #2116
* Use iamax.S rather than iamax_sse.S for ISMIN/ISAMIN on all x86_64 as workaround for #2116
Martin Kroeker [Thu, 9 May 2019 09:52:02 +0000 (11:52 +0200)]
Add NO_AFFINITY to available options on Linux, and set it to ON
to match the gmake default. Fixes second part of #2114
Martin Kroeker [Thu, 9 May 2019 06:57:37 +0000 (08:57 +0200)]
Merge pull request #2124 from tylerjereddy/manylinux1_azure
TST: Azure manylinux1 & clean-up
Martin Kroeker [Thu, 9 May 2019 06:10:19 +0000 (08:10 +0200)]
Merge pull request #2123 from tylerjereddy/azure_readme_badge
DOC: Add Azure CI status badge to README
Martin Kroeker [Thu, 9 May 2019 06:10:00 +0000 (08:10 +0200)]
Merge pull request #2120 from brada4/getrf-2113
Address redundant code concern #2113
Tyler Reddy [Thu, 9 May 2019 01:51:59 +0000 (18:51 -0700)]
TST: Azure manylinux1 & clean-up
* remove some of the steps & comments
from the original Azure yml template
* modify the trigger section to use
develop since OpenBLAS primarily uses
this branch; use the same batching
behavior as downstream projects NumPy/
SciPy
* remove Travis emulated ARMv6 gcc build
because this now happens in Azure
* use documented Ubuntu vmImage name for Azure
and add in a manylinux1 test run to the matrix
[skip appveyor]
Martin Kroeker [Wed, 8 May 2019 22:47:44 +0000 (00:47 +0200)]
Add ARMV6 build to azure CI setup (#2122)
using aytekinar's Alpine image and docker script from the Travis setup
[skip ci]
Tyler Reddy [Wed, 8 May 2019 22:14:01 +0000 (15:14 -0700)]
DOC: Add Azure CI status badge
Martin Kroeker [Wed, 8 May 2019 17:07:58 +0000 (19:07 +0200)]
Update azure-pipelines.yml
Martin Kroeker [Wed, 8 May 2019 13:25:43 +0000 (15:25 +0200)]
Update azure-pipelines.yml
Martin Kroeker [Wed, 8 May 2019 13:23:54 +0000 (15:23 +0200)]
Update azure-pipelines.yml
Martin Kroeker [Wed, 8 May 2019 12:14:22 +0000 (14:14 +0200)]
Update azure-pipelines.yml
Martin Kroeker [Wed, 8 May 2019 12:12:02 +0000 (14:12 +0200)]
Update azure-pipelines.yml
take out offending lines (although stolen from https://github.com/conda-forge/opencv-feedstock azure-pipelines fiie)
Martin Kroeker [Wed, 8 May 2019 11:58:59 +0000 (13:58 +0200)]
fix tabbing in azure commands
Martin Kroeker [Wed, 8 May 2019 11:52:22 +0000 (13:52 +0200)]
first try migrating one of the arm builds from travis
Martin Kroeker [Wed, 8 May 2019 11:31:46 +0000 (13:31 +0200)]
Merge pull request #2121 from tylerjereddy/ppc64le-travis
TST: add native POWER8 to CI
Martin Kroeker [Wed, 8 May 2019 10:20:00 +0000 (12:20 +0200)]
Update link to IBM MASS library, update cpu support status
Martin Kroeker [Wed, 8 May 2019 09:41:17 +0000 (11:41 +0200)]
Merge pull request #2118 from Diazonium/develop
Change two http links to https
Tyler Reddy [Tue, 7 May 2019 23:06:42 +0000 (16:06 -0700)]
TST: add native POWER8 to CI
* add native POWER8 testing to
Travis CI matrix with ppc64le
os entry
Zhang Xianyi [Wed, 8 May 2019 01:58:01 +0000 (09:58 +0800)]
Set up CI with Azure Pipelines
[skip ci]
Andrew [Tue, 7 May 2019 20:46:54 +0000 (23:46 +0300)]
remove redundant code #2113
Martin Kroeker [Tue, 7 May 2019 16:18:16 +0000 (18:18 +0200)]
Merge pull request #2117 from martin-frbg/issue2114
Fix errors in cpu affinity setup with glibc 2.6
Diazonium [Tue, 7 May 2019 12:55:20 +0000 (14:55 +0200)]
Change two http links to https
Closes #2109
Martin Kroeker [Tue, 7 May 2019 11:34:52 +0000 (13:34 +0200)]
Fix errors in cpu enumeration with glibc 2.6
for #2114
Andrew [Tue, 7 May 2019 10:15:08 +0000 (13:15 +0300)]
init
Martin Kroeker [Sun, 5 May 2019 16:08:49 +0000 (18:08 +0200)]
Merge pull request #2111 from martin-frbg/issue1955
Disable the SkyLakeX DGEMMIxCOPY kernels as well