platform/upstream/openblas.git
5 years agoUtest needs CBLAS but not necessarily FORTRAN
Martin Kroeker [Sat, 6 Jul 2019 12:29:47 +0000 (14:29 +0200)]
Utest needs CBLAS but not necessarily FORTRAN

5 years agoMerge pull request #2162 from martin-frbg/pgi
Martin Kroeker [Wed, 3 Jul 2019 17:16:30 +0000 (19:16 +0200)]
Merge pull request #2162 from martin-frbg/pgi

Fixes for PGI compiler

5 years agoMerge pull request #2172 from quickwritereader/develop
Martin Kroeker [Mon, 1 Jul 2019 19:06:02 +0000 (21:06 +0200)]
Merge pull request #2172 from quickwritereader/develop

power9 cgemm/ctrmm. new sgemm 8x16

5 years agocgemm/ctrmm power9
AbdelRauf [Tue, 18 Jun 2019 15:55:56 +0000 (15:55 +0000)]
cgemm/ctrmm power9

5 years agoMerge pull request #2170 from pkubaj/patch-1
Martin Kroeker [Sun, 30 Jun 2019 21:29:02 +0000 (23:29 +0200)]
Merge pull request #2170 from pkubaj/patch-1

Fix build on PPC970 for FreeBSD

5 years agoFix build for PPC970 on FreeBSD pt.2
pkubaj [Fri, 28 Jun 2019 10:31:45 +0000 (10:31 +0000)]
Fix build for PPC970 on FreeBSD pt.2

FreeBSD needs those macros too.

5 years agoFix build for PPC970 on FreeBSD pt. 1
pkubaj [Fri, 28 Jun 2019 10:29:44 +0000 (10:29 +0000)]
Fix build for PPC970 on FreeBSD pt. 1

FreeBSD needs DCBT_ARG=0 as well.

5 years agoMerge pull request #2169 from pkubaj/develop
Martin Kroeker [Tue, 25 Jun 2019 10:56:33 +0000 (12:56 +0200)]
Merge pull request #2169 from pkubaj/develop

Fix build on FreeBSD/powerpc64.

5 years agoFix build on FreeBSD/powerpc64.
Piotr Kubaj [Tue, 25 Jun 2019 08:58:56 +0000 (10:58 +0200)]
Fix build on FreeBSD/powerpc64.

Signed-off-by: Piotr Kubaj <pkubaj@anongoth.pl>
5 years agoPGI compiler does not like -march=native
Martin Kroeker [Thu, 20 Jun 2019 17:56:01 +0000 (19:56 +0200)]
PGI compiler does not like -march=native

5 years agoMerge pull request #2167 from kavanabhat/dtrmm_power8_segfault
Martin Kroeker [Wed, 19 Jun 2019 12:38:01 +0000 (14:38 +0200)]
Merge pull request #2167 from kavanabhat/dtrmm_power8_segfault

Fix DTRMMKERNEL register save for power8 64-bit mode (Fix for #2166)

5 years agoUpdate dtrmm_kernel_16x4_power8.S
kavanabhat [Wed, 19 Jun 2019 09:57:14 +0000 (15:27 +0530)]
Update dtrmm_kernel_16x4_power8.S

5 years agonew sgemm 8x16
AbdelRauf [Mon, 17 Jun 2019 15:33:38 +0000 (15:33 +0000)]
new sgemm 8x16

5 years agoFix mov syntax
Martin Kroeker [Sun, 16 Jun 2019 16:35:43 +0000 (18:35 +0200)]
Fix mov syntax

5 years agoZero ecx with a mov instruction
Martin Kroeker [Sun, 16 Jun 2019 13:04:10 +0000 (15:04 +0200)]
Zero ecx with a mov instruction

PGI assembler does not like the initialization in the constraints.

5 years agoUpdate Makefile.x86_64
Martin Kroeker [Fri, 14 Jun 2019 06:08:11 +0000 (08:08 +0200)]
Update Makefile.x86_64

5 years agoDo not force gcc options on non-gcc compilers
Martin Kroeker [Thu, 13 Jun 2019 21:01:35 +0000 (23:01 +0200)]
Do not force gcc options on non-gcc compilers

fixes compile failure with pgi 18.10 as reported on OpenBLAS-users

5 years agoMerge pull request #2159 from martin-frbg/issue2149
Martin Kroeker [Mon, 10 Jun 2019 17:12:45 +0000 (19:12 +0200)]
Merge pull request #2159 from martin-frbg/issue2149

Avoid unintentional activation of TLS codepath via USE_TLS=0

5 years agoAvoid unintentional activation of TLS code via USE_TLS=0
Martin Kroeker [Mon, 10 Jun 2019 15:24:15 +0000 (17:24 +0200)]
Avoid unintentional activation of TLS code via USE_TLS=0

fixes #2149

5 years agoMerge pull request #2158 from martin-frbg/issue2143
Martin Kroeker [Mon, 10 Jun 2019 12:08:11 +0000 (14:08 +0200)]
Merge pull request #2158 from martin-frbg/issue2143

Remove any inadvertent use of -march=native from DYNAMIC_ARCH builds

5 years agoRemove any inadvertent use of -march=native from DYNAMIC_ARCH builds
Martin Kroeker [Mon, 10 Jun 2019 07:50:13 +0000 (09:50 +0200)]
Remove any inadvertent use of -march=native from DYNAMIC_ARCH builds

from #2143, -march=native precludes use of more specific options like -march=skylake-avx512 in individual kernels, and defeats the purpose of dynamic arch anyway.

5 years agoMerge pull request #2157 from martin-frbg/2154-2
Martin Kroeker [Sun, 9 Jun 2019 10:19:08 +0000 (12:19 +0200)]
Merge pull request #2157 from martin-frbg/2154-2

Add gfortran workaround for potential ABI violation

5 years agoUpdate fc.cmake
Martin Kroeker [Sun, 9 Jun 2019 07:31:13 +0000 (09:31 +0200)]
Update fc.cmake

5 years agoAdd gfortran workaround for potential ABI violation
Martin Kroeker [Sat, 8 Jun 2019 21:17:03 +0000 (23:17 +0200)]
Add gfortran workaround for potential ABI violation

for #2154

5 years agoMerge pull request #2148 from TiborGY/cpp_thread_test_2
Martin Kroeker [Fri, 7 Jun 2019 11:23:07 +0000 (13:23 +0200)]
Merge pull request #2148 from TiborGY/cpp_thread_test_2

Thread safety tester using C++11 threading (cleaned history)

5 years agoMerge pull request #2156 from martin-frbg/issue2154
Martin Kroeker [Thu, 6 Jun 2019 11:43:12 +0000 (13:43 +0200)]
Merge pull request #2156 from martin-frbg/issue2154

Add gfortran workaround for C->FORTRAN ABI violation

5 years agoAdd gfortran workaround for ABI violations
Martin Kroeker [Thu, 6 Jun 2019 08:24:16 +0000 (10:24 +0200)]
Add gfortran workaround for ABI violations

for #2154 (see gcc bug 90329)

5 years agoAdd gfortran workaround for ABI violations in LAPACKE
Martin Kroeker [Thu, 6 Jun 2019 08:18:40 +0000 (10:18 +0200)]
Add gfortran workaround for ABI violations in LAPACKE

for #2154 (see gcc bug 90329)

5 years agoMerge pull request #2153 from quickwritereader/develop
Martin Kroeker [Thu, 6 Jun 2019 05:42:56 +0000 (07:42 +0200)]
Merge pull request #2153 from quickwritereader/develop

improved power9 zgemm,sgemm

5 years agoconflict resolve
AbdelRauf [Wed, 5 Jun 2019 20:50:50 +0000 (20:50 +0000)]
conflict resolve

5 years agopower9 zgemm ztrmm optimized
AbdelRauf [Wed, 5 Jun 2019 10:30:57 +0000 (10:30 +0000)]
power9 zgemm ztrmm optimized

5 years agoMerge pull request #2145 from martin-frbg/1912-3
Martin Kroeker [Wed, 5 Jun 2019 18:27:45 +0000 (20:27 +0200)]
Merge pull request #2145 from martin-frbg/1912-3

Separate implementations of AMAX and IAMAX on arm

5 years agoMerge pull request #2110 from pc2/cpu-detection
Martin Kroeker [Wed, 5 Jun 2019 18:27:05 +0000 (20:27 +0200)]
Merge pull request #2110 from pc2/cpu-detection

Fix detection of Skylake processors when using GCC

5 years agoc_check: Unlink correct file
Michael Lass [Fri, 3 May 2019 19:22:27 +0000 (21:22 +0200)]
c_check: Unlink correct file

5 years agoFix detection of AVX512 capable compilers in getarch
Michael Lass [Fri, 3 May 2019 19:07:14 +0000 (21:07 +0200)]
Fix detection of AVX512 capable compilers in getarch

21eda8b5 introduced a check in getarch.c to test if the compiler is capable of
AVX512. This check currently fails, since the used __AVX2__ macro is only
defined if getarch itself was compiled with AVX2/AVX512 support. Make sure this
is the case by building getarch with -march=native on x86_64. It is only
supposed to run on the build host anyway.

5 years agosgemm pipeline improved, zgemm rewritten without inner packs, ABI lxvx v20 fixed...
AbdelRauf [Fri, 31 May 2019 22:48:16 +0000 (22:48 +0000)]
sgemm pipeline improved, zgemm rewritten without inner packs, ABI lxvx v20 fixed with vs52

5 years agoDocument NO_AVX512
Martin Kroeker [Mon, 3 Jun 2019 09:01:33 +0000 (11:01 +0200)]
Document NO_AVX512

for #2151

5 years ago add c++ thread test option to Makefile.rule
TiborGY [Sat, 1 Jun 2019 19:36:41 +0000 (21:36 +0200)]
 add c++ thread test option to Makefile.rule

5 years agohook up c++ thread safety test (main Makefile)
TiborGY [Sat, 1 Jun 2019 19:32:52 +0000 (21:32 +0200)]
hook up c++ thread safety test (main Makefile)

5 years agoupload thread safety test folder
TiborGY [Sat, 1 Jun 2019 19:30:06 +0000 (21:30 +0200)]
upload thread safety test folder

5 years agoimproved zgemm power9 based on power8
AbdelRauf [Thu, 23 May 2019 04:23:43 +0000 (04:23 +0000)]
improved zgemm power9 based on power8

5 years agoUse generic kernels for complex (I)AMAX to support softfp
Martin Kroeker [Thu, 30 May 2019 09:38:11 +0000 (11:38 +0200)]
Use generic kernels for complex (I)AMAX to support softfp

5 years agoEnsure correct output for DAMAX with softfp
Martin Kroeker [Thu, 30 May 2019 09:25:43 +0000 (11:25 +0200)]
Ensure correct output for DAMAX with softfp

5 years agoSeparate implementations of AMAX and IAMAX on arm
Martin Kroeker [Wed, 29 May 2019 13:02:51 +0000 (15:02 +0200)]
Separate implementations of AMAX and IAMAX on arm

As noted in #1912 and comment on #1942, the combined implementation happens to "do the right thing" on hardfp, but cannot return both value and index on softfp where they would have to share the return register

5 years agoMerge pull request #2144 from xianyi/revert-2142-issue1912-2
Martin Kroeker [Wed, 29 May 2019 12:09:10 +0000 (14:09 +0200)]
Merge pull request #2144 from xianyi/revert-2142-issue1912-2

Revert "Add softfp support in min/max kernels"

5 years agoRevert "Add softfp support in min/max kernels"
Martin Kroeker [Wed, 29 May 2019 12:07:17 +0000 (14:07 +0200)]
Revert "Add softfp support in min/max kernels"

5 years agoMerge pull request #2142 from martin-frbg/issue1912-2
Martin Kroeker [Tue, 28 May 2019 20:56:08 +0000 (22:56 +0200)]
Merge pull request #2142 from martin-frbg/issue1912-2

Add softfp support in min/max kernels

5 years agoMerge pull request #2141 from martin-frbg/issue1912
Martin Kroeker [Tue, 28 May 2019 18:50:40 +0000 (20:50 +0200)]
Merge pull request #2141 from martin-frbg/issue1912

Build and run utests independently of fortran

5 years agoAdd softfp support in min/max kernels
Martin Kroeker [Tue, 28 May 2019 18:34:22 +0000 (20:34 +0200)]
Add softfp support in min/max kernels

fix for #1912

5 years agoMerge pull request #2140 from martin-frbg/pgi19
Martin Kroeker [Sun, 26 May 2019 10:39:20 +0000 (12:39 +0200)]
Merge pull request #2140 from martin-frbg/pgi19

Do not try ancient PGI hacks with recent versions of that compiler

5 years agoBuild and run utests in any case, they do their own checks for fortran availability
Martin Kroeker [Fri, 24 May 2019 11:02:23 +0000 (13:02 +0200)]
Build and run utests in any case, they do their own checks for fortran availability

5 years agoDo not try ancient PGI hacks with recent versions of that compiler
Martin Kroeker [Wed, 22 May 2019 11:48:27 +0000 (13:48 +0200)]
Do not try ancient PGI hacks with recent versions of that compiler

should fix #2139

5 years agoMerge pull request #2136 from martin-frbg/issue2126
Martin Kroeker [Thu, 16 May 2019 10:08:16 +0000 (12:08 +0200)]
Merge pull request #2136 from martin-frbg/issue2126

Add option to allow combining USE_THREAD=0 with thread locking support

5 years agoMerge pull request #2134 from tylerjereddy/skylake_regress_guard_may14
Martin Kroeker [Wed, 15 May 2019 21:40:06 +0000 (23:40 +0200)]
Merge pull request #2134 from tylerjereddy/skylake_regress_guard_may14

TST: add SkylakeX AVX512 CI test

5 years agoRemove unrelated change
Martin Kroeker [Wed, 15 May 2019 21:38:12 +0000 (23:38 +0200)]
Remove unrelated change

5 years agoAdd option USE_LOCKING but keep default settings intact
Martin Kroeker [Wed, 15 May 2019 21:36:17 +0000 (23:36 +0200)]
Add option USE_LOCKING but keep default settings intact

5 years agoAdd option USE_LOCKING for SMP-like locking in USE_THREAD=0 builds
Martin Kroeker [Wed, 15 May 2019 21:21:20 +0000 (23:21 +0200)]
Add option USE_LOCKING for SMP-like locking in USE_THREAD=0 builds

5 years agoAdd option USE_LOCKING for single-threaded build with locking support
Martin Kroeker [Wed, 15 May 2019 21:19:30 +0000 (23:19 +0200)]
Add option USE_LOCKING for single-threaded build with locking support

5 years agoAdd option USE_LOCKING for single-threaded build with locking support
Martin Kroeker [Wed, 15 May 2019 21:18:43 +0000 (23:18 +0200)]
Add option USE_LOCKING for single-threaded build with locking support

for calling from concurrent threads

5 years agoTST: add SkylakeX AVX512 CI test
Tyler Reddy [Tue, 14 May 2019 18:32:23 +0000 (11:32 -0700)]
TST: add SkylakeX AVX512 CI test

* adapt the C-level reproducer code for some
recent SkylakeX AVX512 kernel issues, provided
by Isuru Fernando and modified by Martin Kroeker,
for usage in the utest suite

* add an Intel SDE SkylakeX emulation utest run to
the Azure CI matrix; a custom Docker build was required
because Ubuntu image provided by Azure does not support
AVX512VL instructions

5 years agoMerge pull request #2130 from isuruf/drone
Martin Kroeker [Tue, 14 May 2019 07:37:00 +0000 (09:37 +0200)]
Merge pull request #2130 from isuruf/drone

Drone CI for arm64 native builds

5 years agoFix typo
Isuru Fernando [Sun, 12 May 2019 20:25:45 +0000 (15:25 -0500)]
Fix typo

5 years agoarm32 build
Isuru Fernando [Sun, 12 May 2019 20:14:46 +0000 (15:14 -0500)]
arm32 build

5 years agoRemove qemu armv8 builds
Isuru Fernando [Sun, 12 May 2019 20:09:53 +0000 (15:09 -0500)]
Remove qemu armv8 builds

5 years agoSee if ubuntu 19.04 fixes the ICE
Isuru Fernando [Sun, 12 May 2019 19:28:48 +0000 (14:28 -0500)]
See if ubuntu 19.04 fixes the ICE

5 years agoparallel build
Isuru Fernando [Sun, 12 May 2019 19:22:36 +0000 (14:22 -0500)]
parallel build

5 years agobuild without lapack on cmake
Isuru Fernando [Sun, 12 May 2019 19:17:12 +0000 (14:17 -0500)]
build without lapack on cmake

5 years agoAdd cmake builds and print options
Isuru Fernando [Sun, 12 May 2019 19:09:29 +0000 (14:09 -0500)]
Add cmake builds and print options

5 years agoAdd a cmake build as well
Isuru Fernando [Sun, 12 May 2019 19:06:04 +0000 (14:06 -0500)]
Add a cmake build as well

5 years agono need of gcc in clang build
Isuru Fernando [Sun, 12 May 2019 19:02:39 +0000 (14:02 -0500)]
no need of gcc in clang build

5 years agoupdate yes
Isuru Fernando [Sun, 12 May 2019 18:56:59 +0000 (13:56 -0500)]
update yes

5 years agoFix typo
Isuru Fernando [Sun, 12 May 2019 18:55:38 +0000 (13:55 -0500)]
Fix typo

5 years agoapt update
Isuru Fernando [Sun, 12 May 2019 18:55:04 +0000 (13:55 -0500)]
apt update

5 years agoSwitch to ubuntu and parallel jobs
Isuru Fernando [Sun, 12 May 2019 18:53:58 +0000 (13:53 -0500)]
Switch to ubuntu and parallel jobs

5 years agogfortran->gcc-gfortran
Isuru Fernando [Sun, 12 May 2019 18:50:37 +0000 (13:50 -0500)]
gfortran->gcc-gfortran

5 years agoInstall gfortran and add a clang job
Isuru Fernando [Sun, 12 May 2019 18:47:49 +0000 (13:47 -0500)]
Install gfortran and add a clang job

5 years agoInstall perl
Isuru Fernando [Sun, 12 May 2019 18:44:15 +0000 (13:44 -0500)]
Install perl

5 years agoInstall gcc
Isuru Fernando [Sun, 12 May 2019 18:42:16 +0000 (13:42 -0500)]
Install gcc

5 years agoremove sudo
Isuru Fernando [Sun, 12 May 2019 18:40:23 +0000 (13:40 -0500)]
remove sudo

5 years agoinstall make
Isuru Fernando [Sun, 12 May 2019 18:39:51 +0000 (13:39 -0500)]
install make

5 years agoTest drone CI
Isuru Fernando [Sun, 12 May 2019 18:35:07 +0000 (13:35 -0500)]
Test drone CI

5 years agoMerge pull request #2129 from martin-frbg/armv8azure
Martin Kroeker [Sun, 12 May 2019 07:55:57 +0000 (09:55 +0200)]
Merge pull request #2129 from martin-frbg/armv8azure

Move ARMv8/gcc CI job from Travis to Azure

5 years agoUpdate .travis.yml
Martin Kroeker [Sat, 11 May 2019 20:37:06 +0000 (22:37 +0200)]
Update .travis.yml

5 years agoMove ARMv8 gcc build from Travis to Azure
Martin Kroeker [Sat, 11 May 2019 14:08:23 +0000 (16:08 +0200)]
Move ARMv8 gcc build from Travis to Azure

5 years agoMove ARMv8 gcc build from Travis to Azure
Martin Kroeker [Sat, 11 May 2019 14:07:30 +0000 (16:07 +0200)]
Move ARMv8 gcc build from Travis to Azure

5 years agoMerge pull request #2127 from martin-frbg/issue2114_2
Martin Kroeker [Thu, 9 May 2019 13:25:09 +0000 (15:25 +0200)]
Merge pull request #2127 from martin-frbg/issue2114_2

Add NO_AFFINITY to available CMAKE options on Linux, and set it to ON

5 years agoReplace ISMIN and ISAMIN kernels on all x86_64 platforms (#2125)
Martin Kroeker [Thu, 9 May 2019 12:42:36 +0000 (14:42 +0200)]
Replace ISMIN and ISAMIN kernels on all x86_64 platforms (#2125)

* Mark iamax_sse.S as unsuitable for MIN due to issue #2116
* Use iamax.S rather than iamax_sse.S for ISMIN/ISAMIN on all x86_64 as workaround for #2116

5 years agoAdd NO_AFFINITY to available options on Linux, and set it to ON
Martin Kroeker [Thu, 9 May 2019 09:52:02 +0000 (11:52 +0200)]
Add NO_AFFINITY to available options on Linux, and set it to ON

to match the gmake default. Fixes second part of #2114

5 years agoMerge pull request #2124 from tylerjereddy/manylinux1_azure
Martin Kroeker [Thu, 9 May 2019 06:57:37 +0000 (08:57 +0200)]
Merge pull request #2124 from tylerjereddy/manylinux1_azure

TST: Azure manylinux1 & clean-up

5 years agoMerge pull request #2123 from tylerjereddy/azure_readme_badge
Martin Kroeker [Thu, 9 May 2019 06:10:19 +0000 (08:10 +0200)]
Merge pull request #2123 from tylerjereddy/azure_readme_badge

DOC: Add Azure CI status badge to README

5 years agoMerge pull request #2120 from brada4/getrf-2113
Martin Kroeker [Thu, 9 May 2019 06:10:00 +0000 (08:10 +0200)]
Merge pull request #2120 from brada4/getrf-2113

Address redundant code concern #2113

5 years agoTST: Azure manylinux1 & clean-up
Tyler Reddy [Thu, 9 May 2019 01:51:59 +0000 (18:51 -0700)]
TST: Azure manylinux1 & clean-up

* remove some of the steps & comments
from the original Azure yml template

* modify the trigger section to use
develop since OpenBLAS primarily uses
this branch; use the same batching
behavior as downstream projects NumPy/
SciPy

* remove Travis emulated ARMv6 gcc build
because this now happens in Azure

* use documented Ubuntu vmImage name for Azure
and add in a manylinux1 test run to the matrix

[skip appveyor]

5 years agoAdd ARMV6 build to azure CI setup (#2122)
Martin Kroeker [Wed, 8 May 2019 22:47:44 +0000 (00:47 +0200)]
Add ARMV6 build to azure CI setup (#2122)

using aytekinar's Alpine image and docker script from the Travis setup

[skip ci]

5 years agoDOC: Add Azure CI status badge
Tyler Reddy [Wed, 8 May 2019 22:14:01 +0000 (15:14 -0700)]
DOC: Add Azure CI status badge

5 years agoUpdate azure-pipelines.yml
Martin Kroeker [Wed, 8 May 2019 17:07:58 +0000 (19:07 +0200)]
Update azure-pipelines.yml

5 years agoUpdate azure-pipelines.yml
Martin Kroeker [Wed, 8 May 2019 13:25:43 +0000 (15:25 +0200)]
Update azure-pipelines.yml

5 years agoUpdate azure-pipelines.yml
Martin Kroeker [Wed, 8 May 2019 13:23:54 +0000 (15:23 +0200)]
Update azure-pipelines.yml

5 years agoUpdate azure-pipelines.yml
Martin Kroeker [Wed, 8 May 2019 12:14:22 +0000 (14:14 +0200)]
Update azure-pipelines.yml

5 years agoUpdate azure-pipelines.yml
Martin Kroeker [Wed, 8 May 2019 12:12:02 +0000 (14:12 +0200)]
Update azure-pipelines.yml

take out offending lines (although stolen from https://github.com/conda-forge/opencv-feedstock azure-pipelines fiie)

5 years agofix tabbing in azure commands
Martin Kroeker [Wed, 8 May 2019 11:58:59 +0000 (13:58 +0200)]
fix tabbing in azure commands