platform/upstream/openblas.git
2 years agoMerge pull request #3348 from guowangy/skylakex-sgemv_t-fix
Martin Kroeker [Wed, 25 Aug 2021 20:43:45 +0000 (22:43 +0200)]
Merge pull request #3348 from guowangy/skylakex-sgemv_t-fix

skylakex sgemv_t kernel fix

2 years agoMerge pull request #3345 from nsait-linaro/windows_on_arm64
Martin Kroeker [Wed, 25 Aug 2021 13:49:55 +0000 (15:49 +0200)]
Merge pull request #3345 from nsait-linaro/windows_on_arm64

Add support for windows/arm64 targets with clang

2 years agosgemv: skylakex: fix build warning
Wangyang Guo [Wed, 25 Aug 2021 07:13:00 +0000 (07:13 +0000)]
sgemv: skylakex: fix build warning

2 years agosgemv: skylakex: bug fix for sgemv_t kernel in corner case
Wangyang Guo [Wed, 25 Aug 2021 07:07:27 +0000 (07:07 +0000)]
sgemv: skylakex: bug fix for sgemv_t kernel in corner case

2 years agoFix ctest.h to build using clang on windows
Niyas Sait [Mon, 16 Aug 2021 10:25:07 +0000 (11:25 +0100)]
Fix ctest.h to build using clang on windows

2 years agoadd support for building on windows/arm64 target
Niyas Sait [Mon, 16 Aug 2021 10:22:51 +0000 (11:22 +0100)]
add support for building on windows/arm64 target

2 years agoAdd more OSX build jobs to Azure CI (#3338)
Martin Kroeker [Sat, 14 Aug 2021 22:17:23 +0000 (00:17 +0200)]
Add more OSX build jobs to Azure CI (#3338)

* Add OSX build job with Homebrew OpenMP in a CMAKE build
* Check install step on OSX/gcc to make sure all include files are generated and installed as intended
* Add mixed clang/gfortran build with cmake on OSX
* move IOS ARMV7/ARMV8 crossbuilds from travis to azure

2 years agoMerge pull request #3344 from gxw-loongson/develop
Martin Kroeker [Thu, 12 Aug 2021 13:16:46 +0000 (15:16 +0200)]
Merge pull request #3344 from gxw-loongson/develop

Delete the macro instruction "li" and use "li.d" instead

2 years agoDelete the macro instruction "li" and use "li.d" instead
gxw [Tue, 10 Aug 2021 08:42:57 +0000 (16:42 +0800)]
Delete the macro instruction "li" and use "li.d" instead

Change-Id: Icff7981e2eb7df29ba5af1f8eb5be8443c67450f

2 years agoMerge pull request #3343 from cianciosa/develop
Martin Kroeker [Wed, 11 Aug 2021 23:28:18 +0000 (01:28 +0200)]
Merge pull request #3343 from cianciosa/develop

Fix undefined behavior checking the size of ARGC

2 years agoFix a small syntax error. A ( was accidently deleted.
cianciosa [Wed, 11 Aug 2021 16:08:34 +0000 (12:08 -0400)]
Fix a small syntax error. A ( was accidently deleted.

2 years agoCheck the total number of arguments passed insead of if the ARGV# is defined. This...
cianciosa [Wed, 11 Aug 2021 16:00:07 +0000 (12:00 -0400)]
Check the total number of arguments passed insead of if the ARGV# is defined. This fixes a problem when compling openblas as a subproject of another code.

2 years agoMerge pull request #3341 from RajalakshmiSR/dasump10
Martin Kroeker [Wed, 11 Aug 2021 07:39:10 +0000 (09:39 +0200)]
Merge pull request #3341 from RajalakshmiSR/dasump10

POWER10: Improving dasum performance

2 years agoPOWER10: Improving dasum performance
Rajalakshmi Srinivasaraghavan [Wed, 11 Aug 2021 03:06:04 +0000 (22:06 -0500)]
POWER10: Improving dasum performance

Unrolling a loop in dasum micro code to help in improving
POWER10 performance.

2 years agoMerge pull request #3336 from martin-frbg/traviscom
Zhang Xianyi [Thu, 5 Aug 2021 11:13:19 +0000 (19:13 +0800)]
Merge pull request #3336 from martin-frbg/traviscom

Disable all x86 jobs on Travis

2 years agoDisable all x86 jobs
Martin Kroeker [Thu, 5 Aug 2021 09:08:18 +0000 (11:08 +0200)]
Disable all x86 jobs

2 years agoMerge pull request #3332 from martin-frbg/travisbadge
Martin Kroeker [Thu, 5 Aug 2021 07:36:59 +0000 (09:36 +0200)]
Merge pull request #3332 from martin-frbg/travisbadge

Update Travis badge in README

2 years agoMerge pull request #3334 from Guobing-Chen/BF16_gemm_full_kernel
Martin Kroeker [Thu, 5 Aug 2021 06:01:13 +0000 (08:01 +0200)]
Merge pull request #3334 from Guobing-Chen/BF16_gemm_full_kernel

Add all SBGEMM kernels for IA AVX512-BF16 based platforms

2 years agoAdd all SBGEMM kernels for IA AVX512-BF16 based platforms
Chen, Guobing [Thu, 5 Aug 2021 03:11:14 +0000 (11:11 +0800)]
Add all SBGEMM kernels for IA AVX512-BF16 based platforms

Added all SBGEMM kernels including NN/NT/TN/TT for both ColMajor and
RowMajor, based on AVX512-BF16 ISA set on IA.

Signed-off-by: Chen, Guobing <guobing.chen@intel.com>
2 years agoUpdate Travis badge in README
Martin Kroeker [Tue, 3 Aug 2021 08:45:45 +0000 (10:45 +0200)]
Update Travis badge in README

2 years agoMerge pull request #3330 from xianyi/issue3321
Martin Kroeker [Mon, 2 Aug 2021 20:36:05 +0000 (22:36 +0200)]
Merge pull request #3330 from xianyi/issue3321

Improve the "tried to allocate too many buffers" error message

2 years agoActually add the message to the TLS section
Martin Kroeker [Mon, 2 Aug 2021 12:50:14 +0000 (14:50 +0200)]
Actually add the message to the TLS section

2 years agoRebase source and edit TLS version of the message as well
Martin Kroeker [Mon, 2 Aug 2021 12:19:16 +0000 (14:19 +0200)]
Rebase source and edit TLS version of the message as well

2 years agoMerge pull request #3331 from gxw-loongson/develop
Martin Kroeker [Mon, 2 Aug 2021 05:21:46 +0000 (07:21 +0200)]
Merge pull request #3331 from gxw-loongson/develop

Fixed typos about LOONGARCH64

2 years agoAdd cmake support for LOONGARCH64
gxw [Mon, 2 Aug 2021 02:00:41 +0000 (10:00 +0800)]
Add cmake support for LOONGARCH64

2 years agoImprove the "tried to allocate too many buffers" error message
Martin Kroeker [Sat, 31 Jul 2021 15:23:40 +0000 (17:23 +0200)]
Improve the "tried to allocate too many buffers" error message

3 years agoMerge pull request #3329 from martin-frbg/issue3272
Martin Kroeker [Fri, 30 Jul 2021 18:39:38 +0000 (20:39 +0200)]
Merge pull request #3329 from martin-frbg/issue3272

Work around gcc11+ miscompiling C/ZBLAS3 tests at -O3

3 years agoDisable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
Martin Kroeker [Fri, 30 Jul 2021 12:46:19 +0000 (14:46 +0200)]
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3

3 years agoDisable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
Martin Kroeker [Fri, 30 Jul 2021 12:44:54 +0000 (14:44 +0200)]
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3

3 years agoDisable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
Martin Kroeker [Fri, 30 Jul 2021 12:21:08 +0000 (14:21 +0200)]
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3

3 years agoDisable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
Martin Kroeker [Fri, 30 Jul 2021 12:20:12 +0000 (14:20 +0200)]
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3

3 years agoFixed typos about LOONGARCH64
gxw [Fri, 30 Jul 2021 10:11:12 +0000 (18:11 +0800)]
Fixed typos about LOONGARCH64

3 years agoMerge pull request #3327 from h-vetinari/lapack597_redux
Martin Kroeker [Wed, 28 Jul 2021 21:04:02 +0000 (23:04 +0200)]
Merge pull request #3327 from h-vetinari/lapack597_redux

Complete the carry of lapack PR 597

3 years agoMerge pull request #3326 from wattoc/develop
Martin Kroeker [Wed, 28 Jul 2021 21:03:37 +0000 (23:03 +0200)]
Merge pull request #3326 from wattoc/develop

Include Haiku in processor count checks

3 years agoMerge pull request #3325 from gxw-loongson/develop
Martin Kroeker [Wed, 28 Jul 2021 21:03:15 +0000 (23:03 +0200)]
Merge pull request #3325 from gxw-loongson/develop

Add support for LOONGARCH64

3 years agoAlso make sure the `1` is INTEGER*4 for OMP_SET_NUM_THREADS
Isuru Fernando [Thu, 15 Jul 2021 09:54:33 +0000 (04:54 -0500)]
Also make sure the `1` is INTEGER*4 for OMP_SET_NUM_THREADS

3 years agoInclude Haiku in processor count checks
Craig Watson [Tue, 27 Jul 2021 09:00:30 +0000 (09:00 +0000)]
Include Haiku in processor count checks

3 years agoAdd support for LOONGARCH64
gxw [Mon, 26 Jul 2021 07:44:54 +0000 (15:44 +0800)]
Add support for LOONGARCH64

3 years agoMerge pull request #3323 from martin-frbg/issue3322
Martin Kroeker [Fri, 23 Jul 2021 20:46:02 +0000 (22:46 +0200)]
Merge pull request #3323 from martin-frbg/issue3322

GCC did not support -mtune for ARM64 before 5.1

3 years agoMerge pull request #3314 from martin-frbg/lapack597
Martin Kroeker [Fri, 23 Jul 2021 13:30:27 +0000 (15:30 +0200)]
Merge pull request #3314 from martin-frbg/lapack597

Fix LAPACK testsuite compatibility with libomp (Reference-LAPACK PR 597)

3 years agoGCC did not support -mtune for ARM64 before 5.1
Martin Kroeker [Fri, 23 Jul 2021 11:42:52 +0000 (13:42 +0200)]
GCC did not support -mtune for ARM64 before 5.1

3 years agoMerge pull request #3320 from martin-frbg/issue3318
Martin Kroeker [Thu, 22 Jul 2021 19:28:50 +0000 (21:28 +0200)]
Merge pull request #3320 from martin-frbg/issue3318

Empirical workaround for numpy SVD NaN problem from issue 3318

3 years agoFix regex to match kernels suffixed with cpuname too
Martin Kroeker [Thu, 22 Jul 2021 15:24:15 +0000 (17:24 +0200)]
Fix regex to match kernels suffixed with cpuname too

3 years agoRework setting of -mfma to only apply it where necessary
Martin Kroeker [Thu, 22 Jul 2021 10:00:03 +0000 (12:00 +0200)]
Rework setting of -mfma to only apply it where necessary

3 years agoEmpirical workaround for numpy SVD NaN problem from issue 3318
Martin Kroeker [Sun, 18 Jul 2021 20:19:19 +0000 (22:19 +0200)]
Empirical workaround for numpy SVD NaN problem from issue 3318

3 years agoUpdate version to 0.3.17.dev
Martin Kroeker [Thu, 15 Jul 2021 13:00:01 +0000 (15:00 +0200)]
Update version to 0.3.17.dev

3 years agoUpdate version to 0.3.17.dev
Martin Kroeker [Thu, 15 Jul 2021 12:59:15 +0000 (14:59 +0200)]
Update version to 0.3.17.dev

3 years agoMerge pull request #3317 from xianyi/release-0.3.0
Martin Kroeker [Thu, 15 Jul 2021 12:58:20 +0000 (14:58 +0200)]
Merge pull request #3317 from xianyi/release-0.3.0

merge 0.3.17 back into develop to copy tag

3 years agoUpdate version to 0.3.17
Martin Kroeker [Thu, 15 Jul 2021 12:52:54 +0000 (14:52 +0200)]
Update version to 0.3.17

3 years agoMerge pull request #3316 from xianyi/develop
Martin Kroeker [Thu, 15 Jul 2021 12:51:50 +0000 (14:51 +0200)]
Merge pull request #3316 from xianyi/develop

Merge develop for bugfix release 0.3.17

3 years agoMerge branch 'release-0.3.0' into develop
Martin Kroeker [Thu, 15 Jul 2021 12:50:20 +0000 (14:50 +0200)]
Merge branch 'release-0.3.0' into develop

3 years agoUpdate version to 0.3.17
Martin Kroeker [Thu, 15 Jul 2021 12:46:24 +0000 (14:46 +0200)]
Update version to 0.3.17

3 years agoMerge pull request #3315 from martin-frbg/changelog0317
Martin Kroeker [Thu, 15 Jul 2021 12:44:59 +0000 (14:44 +0200)]
Merge pull request #3315 from martin-frbg/changelog0317

Update Changelog for 0.3.17

3 years agoUpdate Changelog for 0.3.17
Martin Kroeker [Thu, 15 Jul 2021 12:44:17 +0000 (14:44 +0200)]
Update Changelog for 0.3.17

3 years agoMerge pull request #3313 from martin-frbg/3266-2
Martin Kroeker [Thu, 15 Jul 2021 06:00:57 +0000 (08:00 +0200)]
Merge pull request #3313 from martin-frbg/3266-2

Remove BLASLONG casts from SPARC parameter entries

3 years agoMerge pull request #3312 from martin-frbg/revert_3260
Martin Kroeker [Thu, 15 Jul 2021 06:00:34 +0000 (08:00 +0200)]
Merge pull request #3312 from martin-frbg/revert_3260

Temporarily disable the SkylakeX sgemv_t microkernel

3 years agoMerge pull request #3311 from martin-frbg/issue3309
Martin Kroeker [Thu, 15 Jul 2021 05:58:47 +0000 (07:58 +0200)]
Merge pull request #3311 from martin-frbg/issue3309

Revert PR #3250 (shortcut without buffer allocation) as it is unsafe …

3 years agoDeclare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp
Martin Kroeker [Wed, 14 Jul 2021 20:42:43 +0000 (22:42 +0200)]
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp

3 years agoDeclare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp
Martin Kroeker [Wed, 14 Jul 2021 20:41:45 +0000 (22:41 +0200)]
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp

3 years agoDeclare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp
Martin Kroeker [Wed, 14 Jul 2021 20:40:29 +0000 (22:40 +0200)]
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp

3 years agoDeclare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp
Martin Kroeker [Wed, 14 Jul 2021 20:39:20 +0000 (22:39 +0200)]
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp

3 years agoRemove BLASLONG casts from SPARC entries
Martin Kroeker [Wed, 14 Jul 2021 19:09:36 +0000 (21:09 +0200)]
Remove BLASLONG casts from SPARC entries

in response to https://github.com/xianyi/OpenBLAS/pull/3266#issuecomment-878637675

3 years agoTemporarily disable the SkylakeX sgemv_t microkernel due to LAPACK testsuite failures
Martin Kroeker [Wed, 14 Jul 2021 18:50:14 +0000 (20:50 +0200)]
Temporarily disable the SkylakeX sgemv_t microkernel due to LAPACK testsuite failures

3 years agoRevert PR #3250 (shortcut without buffer allocation) as it is unsafe on some x86_64
Martin Kroeker [Wed, 14 Jul 2021 18:32:57 +0000 (20:32 +0200)]
Revert PR #3250 (shortcut without buffer allocation) as it is unsafe on some x86_64

3 years agoMerge pull request #3310 from jeromerobert/develop
Martin Kroeker [Wed, 14 Jul 2021 14:06:13 +0000 (16:06 +0200)]
Merge pull request #3310 from jeromerobert/develop

Avoid redefinition of _GNU_SOURCE

3 years agoAvoid redefinition of _GNU_SOURCE
Jerome Robert [Wed, 14 Jul 2021 10:20:57 +0000 (12:20 +0200)]
Avoid redefinition of _GNU_SOURCE

* _GNU_SOURCE may have been set by the application and redefinition
  trigger warnings or error with -Werror
* Fix for 220f6a1c5

3 years agoMerge pull request #3306 from jonaszhou1/develop
Martin Kroeker [Mon, 12 Jul 2021 14:30:10 +0000 (16:30 +0200)]
Merge pull request #3306 from jonaszhou1/develop

Add cpu detection support for Zhaoxin processors

3 years agoAdd cpu detection support for Zhaoxin processors
JonasZhou [Mon, 12 Jul 2021 05:43:45 +0000 (13:43 +0800)]
Add cpu detection support for Zhaoxin processors

Signed-off-by: JonasZhou <JonasZhou@zhaoxin.com>
3 years agoUpdate version to 0.3.16.dev
Martin Kroeker [Sun, 11 Jul 2021 22:16:59 +0000 (00:16 +0200)]
Update version to 0.3.16.dev

3 years agoUpdate version to 0.3.16.dev
Martin Kroeker [Sun, 11 Jul 2021 22:16:29 +0000 (00:16 +0200)]
Update version to 0.3.16.dev

3 years agoMerge pull request #3305 from xianyi/release-0.3.0
Martin Kroeker [Sun, 11 Jul 2021 22:15:52 +0000 (00:15 +0200)]
Merge pull request #3305 from xianyi/release-0.3.0

Merge release branch back into develop to copy tag

3 years agoMerge pull request #3304 from xianyi/develop
Martin Kroeker [Sun, 11 Jul 2021 22:12:52 +0000 (00:12 +0200)]
Merge pull request #3304 from xianyi/develop

Merge develop int0 0.3.0 for release 0.3.16

3 years agoMerge branch 'release-0.3.0' into develop
Martin Kroeker [Sun, 11 Jul 2021 22:12:25 +0000 (00:12 +0200)]
Merge branch 'release-0.3.0' into develop

3 years agoUpdate version to 0.3.16
Martin Kroeker [Sun, 11 Jul 2021 22:09:35 +0000 (00:09 +0200)]
Update version to 0.3.16

3 years agoUpdate version to 0.3.16
Martin Kroeker [Sun, 11 Jul 2021 22:08:55 +0000 (00:08 +0200)]
Update version to 0.3.16

3 years agoMerge pull request #3303 from martin-frbg/changelog16
Martin Kroeker [Sun, 11 Jul 2021 21:50:02 +0000 (23:50 +0200)]
Merge pull request #3303 from martin-frbg/changelog16

Update Changelog for 0.3.16

3 years agoUpdate Changelog for 0.3.16
Martin Kroeker [Sun, 11 Jul 2021 21:48:39 +0000 (23:48 +0200)]
Update Changelog for 0.3.16

3 years agoMerge pull request #3300 from martin-frbg/AzureAlpine
Martin Kroeker [Sun, 11 Jul 2021 20:50:20 +0000 (22:50 +0200)]
Merge pull request #3300 from martin-frbg/AzureAlpine

Move Alpine Linux build job from Travis to Azure

3 years agoMerge pull request #3302 from martin-frbg/small_cleanup
Martin Kroeker [Sun, 11 Jul 2021 20:26:41 +0000 (22:26 +0200)]
Merge pull request #3302 from martin-frbg/small_cleanup

Clean up some warnings

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 16:29:17 +0000 (18:29 +0200)]
Update azure-pipelines.yml

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 16:08:30 +0000 (18:08 +0200)]
Update azure-pipelines.yml

3 years agoMerge pull request #3297 from outerpassage/develop
Martin Kroeker [Sun, 11 Jul 2021 15:10:20 +0000 (17:10 +0200)]
Merge pull request #3297 from outerpassage/develop

fix compilation with musl libc

3 years agoClean up some warnings
Martin Kroeker [Sun, 11 Jul 2021 14:00:29 +0000 (16:00 +0200)]
Clean up some warnings

3 years agoRevert addition of test_install
Martin Kroeker [Sun, 11 Jul 2021 12:38:49 +0000 (14:38 +0200)]
Revert addition of test_install

3 years agoRevert addition of test_install target
Martin Kroeker [Sun, 11 Jul 2021 12:37:38 +0000 (14:37 +0200)]
Revert addition of test_install target

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 09:54:02 +0000 (11:54 +0200)]
Update azure-pipelines.yml

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 09:29:52 +0000 (11:29 +0200)]
Update azure-pipelines.yml

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 07:38:48 +0000 (09:38 +0200)]
Update azure-pipelines.yml

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 06:24:20 +0000 (08:24 +0200)]
Update azure-pipelines.yml

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 05:25:07 +0000 (07:25 +0200)]
Update azure-pipelines.yml

3 years agoMerge pull request #3301 from martin-frbg/syr2bench
Martin Kroeker [Sun, 11 Jul 2021 05:20:19 +0000 (07:20 +0200)]
Merge pull request #3301 from martin-frbg/syr2bench

Handle OPENBLAS_LOOPS in SYR2 benchmark

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sat, 10 Jul 2021 21:21:58 +0000 (23:21 +0200)]
Update azure-pipelines.yml

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sat, 10 Jul 2021 20:41:49 +0000 (22:41 +0200)]
Update azure-pipelines.yml

3 years agoHandle OPENBLAS_LOOPS in SYR2 benchmark
Martin Kroeker [Sat, 10 Jul 2021 19:27:53 +0000 (21:27 +0200)]
Handle OPENBLAS_LOOPS in SYR2 benchmark

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sat, 10 Jul 2021 18:57:21 +0000 (20:57 +0200)]
Update azure-pipelines.yml

3 years agoMerge pull request #3299 from martin-frbg/issue3298
Martin Kroeker [Sat, 10 Jul 2021 18:48:53 +0000 (20:48 +0200)]
Merge pull request #3299 from martin-frbg/issue3298

Fix copy-paste error in LIBCORE assignment for Tiger Lake

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sat, 10 Jul 2021 18:12:33 +0000 (20:12 +0200)]
Update azure-pipelines.yml

3 years agoadd sudo for install in Alpine
Martin Kroeker [Sat, 10 Jul 2021 17:52:04 +0000 (19:52 +0200)]
add sudo for install in Alpine

3 years agofix typo
Martin Kroeker [Sat, 10 Jul 2021 17:03:42 +0000 (19:03 +0200)]
fix typo

3 years agoMove Alpine Linux build job from Travis to Azure
Martin Kroeker [Sat, 10 Jul 2021 16:52:44 +0000 (18:52 +0200)]
Move Alpine Linux build job from Travis to Azure