Martin Kroeker [Wed, 11 Aug 2021 07:39:10 +0000 (09:39 +0200)]
Merge pull request #3341 from RajalakshmiSR/dasump10
POWER10: Improving dasum performance
Rajalakshmi Srinivasaraghavan [Wed, 11 Aug 2021 03:06:04 +0000 (22:06 -0500)]
POWER10: Improving dasum performance
Unrolling a loop in dasum micro code to help in improving
POWER10 performance.
Zhang Xianyi [Thu, 5 Aug 2021 11:13:19 +0000 (19:13 +0800)]
Merge pull request #3336 from martin-frbg/traviscom
Disable all x86 jobs on Travis
Martin Kroeker [Thu, 5 Aug 2021 09:08:18 +0000 (11:08 +0200)]
Disable all x86 jobs
Martin Kroeker [Thu, 5 Aug 2021 07:36:59 +0000 (09:36 +0200)]
Merge pull request #3332 from martin-frbg/travisbadge
Update Travis badge in README
Martin Kroeker [Thu, 5 Aug 2021 06:01:13 +0000 (08:01 +0200)]
Merge pull request #3334 from Guobing-Chen/BF16_gemm_full_kernel
Add all SBGEMM kernels for IA AVX512-BF16 based platforms
Chen, Guobing [Thu, 5 Aug 2021 03:11:14 +0000 (11:11 +0800)]
Add all SBGEMM kernels for IA AVX512-BF16 based platforms
Added all SBGEMM kernels including NN/NT/TN/TT for both ColMajor and
RowMajor, based on AVX512-BF16 ISA set on IA.
Signed-off-by: Chen, Guobing <guobing.chen@intel.com>
Martin Kroeker [Tue, 3 Aug 2021 08:45:45 +0000 (10:45 +0200)]
Update Travis badge in README
Martin Kroeker [Mon, 2 Aug 2021 20:36:05 +0000 (22:36 +0200)]
Merge pull request #3330 from xianyi/issue3321
Improve the "tried to allocate too many buffers" error message
Martin Kroeker [Mon, 2 Aug 2021 12:50:14 +0000 (14:50 +0200)]
Actually add the message to the TLS section
Martin Kroeker [Mon, 2 Aug 2021 12:19:16 +0000 (14:19 +0200)]
Rebase source and edit TLS version of the message as well
Martin Kroeker [Mon, 2 Aug 2021 05:21:46 +0000 (07:21 +0200)]
Merge pull request #3331 from gxw-loongson/develop
Fixed typos about LOONGARCH64
gxw [Mon, 2 Aug 2021 02:00:41 +0000 (10:00 +0800)]
Add cmake support for LOONGARCH64
Martin Kroeker [Sat, 31 Jul 2021 15:23:40 +0000 (17:23 +0200)]
Improve the "tried to allocate too many buffers" error message
Martin Kroeker [Fri, 30 Jul 2021 18:39:38 +0000 (20:39 +0200)]
Merge pull request #3329 from martin-frbg/issue3272
Work around gcc11+ miscompiling C/ZBLAS3 tests at -O3
Martin Kroeker [Fri, 30 Jul 2021 12:46:19 +0000 (14:46 +0200)]
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
Martin Kroeker [Fri, 30 Jul 2021 12:44:54 +0000 (14:44 +0200)]
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
Martin Kroeker [Fri, 30 Jul 2021 12:21:08 +0000 (14:21 +0200)]
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
Martin Kroeker [Fri, 30 Jul 2021 12:20:12 +0000 (14:20 +0200)]
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
gxw [Fri, 30 Jul 2021 10:11:12 +0000 (18:11 +0800)]
Fixed typos about LOONGARCH64
Martin Kroeker [Wed, 28 Jul 2021 21:04:02 +0000 (23:04 +0200)]
Merge pull request #3327 from h-vetinari/lapack597_redux
Complete the carry of lapack PR 597
Martin Kroeker [Wed, 28 Jul 2021 21:03:37 +0000 (23:03 +0200)]
Merge pull request #3326 from wattoc/develop
Include Haiku in processor count checks
Martin Kroeker [Wed, 28 Jul 2021 21:03:15 +0000 (23:03 +0200)]
Merge pull request #3325 from gxw-loongson/develop
Add support for LOONGARCH64
Isuru Fernando [Thu, 15 Jul 2021 09:54:33 +0000 (04:54 -0500)]
Also make sure the `1` is INTEGER*4 for OMP_SET_NUM_THREADS
Craig Watson [Tue, 27 Jul 2021 09:00:30 +0000 (09:00 +0000)]
Include Haiku in processor count checks
gxw [Mon, 26 Jul 2021 07:44:54 +0000 (15:44 +0800)]
Add support for LOONGARCH64
Martin Kroeker [Fri, 23 Jul 2021 20:46:02 +0000 (22:46 +0200)]
Merge pull request #3323 from martin-frbg/issue3322
GCC did not support -mtune for ARM64 before 5.1
Martin Kroeker [Fri, 23 Jul 2021 13:30:27 +0000 (15:30 +0200)]
Merge pull request #3314 from martin-frbg/lapack597
Fix LAPACK testsuite compatibility with libomp (Reference-LAPACK PR 597)
Martin Kroeker [Fri, 23 Jul 2021 11:42:52 +0000 (13:42 +0200)]
GCC did not support -mtune for ARM64 before 5.1
Martin Kroeker [Thu, 22 Jul 2021 19:28:50 +0000 (21:28 +0200)]
Merge pull request #3320 from martin-frbg/issue3318
Empirical workaround for numpy SVD NaN problem from issue 3318
Martin Kroeker [Thu, 22 Jul 2021 15:24:15 +0000 (17:24 +0200)]
Fix regex to match kernels suffixed with cpuname too
Martin Kroeker [Thu, 22 Jul 2021 10:00:03 +0000 (12:00 +0200)]
Rework setting of -mfma to only apply it where necessary
Martin Kroeker [Sun, 18 Jul 2021 20:19:19 +0000 (22:19 +0200)]
Empirical workaround for numpy SVD NaN problem from issue 3318
Martin Kroeker [Thu, 15 Jul 2021 13:00:01 +0000 (15:00 +0200)]
Update version to 0.3.17.dev
Martin Kroeker [Thu, 15 Jul 2021 12:59:15 +0000 (14:59 +0200)]
Update version to 0.3.17.dev
Martin Kroeker [Thu, 15 Jul 2021 12:58:20 +0000 (14:58 +0200)]
Merge pull request #3317 from xianyi/release-0.3.0
merge 0.3.17 back into develop to copy tag
Martin Kroeker [Thu, 15 Jul 2021 12:52:54 +0000 (14:52 +0200)]
Update version to 0.3.17
Martin Kroeker [Thu, 15 Jul 2021 12:51:50 +0000 (14:51 +0200)]
Merge pull request #3316 from xianyi/develop
Merge develop for bugfix release 0.3.17
Martin Kroeker [Thu, 15 Jul 2021 12:50:20 +0000 (14:50 +0200)]
Merge branch 'release-0.3.0' into develop
Martin Kroeker [Thu, 15 Jul 2021 12:46:24 +0000 (14:46 +0200)]
Update version to 0.3.17
Martin Kroeker [Thu, 15 Jul 2021 12:44:59 +0000 (14:44 +0200)]
Merge pull request #3315 from martin-frbg/changelog0317
Update Changelog for 0.3.17
Martin Kroeker [Thu, 15 Jul 2021 12:44:17 +0000 (14:44 +0200)]
Update Changelog for 0.3.17
Martin Kroeker [Thu, 15 Jul 2021 06:00:57 +0000 (08:00 +0200)]
Merge pull request #3313 from martin-frbg/3266-2
Remove BLASLONG casts from SPARC parameter entries
Martin Kroeker [Thu, 15 Jul 2021 06:00:34 +0000 (08:00 +0200)]
Merge pull request #3312 from martin-frbg/revert_3260
Temporarily disable the SkylakeX sgemv_t microkernel
Martin Kroeker [Thu, 15 Jul 2021 05:58:47 +0000 (07:58 +0200)]
Merge pull request #3311 from martin-frbg/issue3309
Revert PR #3250 (shortcut without buffer allocation) as it is unsafe …
Martin Kroeker [Wed, 14 Jul 2021 20:42:43 +0000 (22:42 +0200)]
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp
Martin Kroeker [Wed, 14 Jul 2021 20:41:45 +0000 (22:41 +0200)]
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp
Martin Kroeker [Wed, 14 Jul 2021 20:40:29 +0000 (22:40 +0200)]
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp
Martin Kroeker [Wed, 14 Jul 2021 20:39:20 +0000 (22:39 +0200)]
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp
Martin Kroeker [Wed, 14 Jul 2021 19:09:36 +0000 (21:09 +0200)]
Remove BLASLONG casts from SPARC entries
in response to https://github.com/xianyi/OpenBLAS/pull/3266#issuecomment-
878637675
Martin Kroeker [Wed, 14 Jul 2021 18:50:14 +0000 (20:50 +0200)]
Temporarily disable the SkylakeX sgemv_t microkernel due to LAPACK testsuite failures
Martin Kroeker [Wed, 14 Jul 2021 18:32:57 +0000 (20:32 +0200)]
Revert PR #3250 (shortcut without buffer allocation) as it is unsafe on some x86_64
Martin Kroeker [Wed, 14 Jul 2021 14:06:13 +0000 (16:06 +0200)]
Merge pull request #3310 from jeromerobert/develop
Avoid redefinition of _GNU_SOURCE
Jerome Robert [Wed, 14 Jul 2021 10:20:57 +0000 (12:20 +0200)]
Avoid redefinition of _GNU_SOURCE
* _GNU_SOURCE may have been set by the application and redefinition
trigger warnings or error with -Werror
* Fix for
220f6a1c5
Martin Kroeker [Mon, 12 Jul 2021 14:30:10 +0000 (16:30 +0200)]
Merge pull request #3306 from jonaszhou1/develop
Add cpu detection support for Zhaoxin processors
JonasZhou [Mon, 12 Jul 2021 05:43:45 +0000 (13:43 +0800)]
Add cpu detection support for Zhaoxin processors
Signed-off-by: JonasZhou <JonasZhou@zhaoxin.com>
Martin Kroeker [Sun, 11 Jul 2021 22:16:59 +0000 (00:16 +0200)]
Update version to 0.3.16.dev
Martin Kroeker [Sun, 11 Jul 2021 22:16:29 +0000 (00:16 +0200)]
Update version to 0.3.16.dev
Martin Kroeker [Sun, 11 Jul 2021 22:15:52 +0000 (00:15 +0200)]
Merge pull request #3305 from xianyi/release-0.3.0
Merge release branch back into develop to copy tag
Martin Kroeker [Sun, 11 Jul 2021 22:12:52 +0000 (00:12 +0200)]
Merge pull request #3304 from xianyi/develop
Merge develop int0 0.3.0 for release 0.3.16
Martin Kroeker [Sun, 11 Jul 2021 22:12:25 +0000 (00:12 +0200)]
Merge branch 'release-0.3.0' into develop
Martin Kroeker [Sun, 11 Jul 2021 22:09:35 +0000 (00:09 +0200)]
Update version to 0.3.16
Martin Kroeker [Sun, 11 Jul 2021 22:08:55 +0000 (00:08 +0200)]
Update version to 0.3.16
Martin Kroeker [Sun, 11 Jul 2021 21:50:02 +0000 (23:50 +0200)]
Merge pull request #3303 from martin-frbg/changelog16
Update Changelog for 0.3.16
Martin Kroeker [Sun, 11 Jul 2021 21:48:39 +0000 (23:48 +0200)]
Update Changelog for 0.3.16
Martin Kroeker [Sun, 11 Jul 2021 20:50:20 +0000 (22:50 +0200)]
Merge pull request #3300 from martin-frbg/AzureAlpine
Move Alpine Linux build job from Travis to Azure
Martin Kroeker [Sun, 11 Jul 2021 20:26:41 +0000 (22:26 +0200)]
Merge pull request #3302 from martin-frbg/small_cleanup
Clean up some warnings
Martin Kroeker [Sun, 11 Jul 2021 16:29:17 +0000 (18:29 +0200)]
Update azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 16:08:30 +0000 (18:08 +0200)]
Update azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 15:10:20 +0000 (17:10 +0200)]
Merge pull request #3297 from outerpassage/develop
fix compilation with musl libc
Martin Kroeker [Sun, 11 Jul 2021 14:00:29 +0000 (16:00 +0200)]
Clean up some warnings
Martin Kroeker [Sun, 11 Jul 2021 12:38:49 +0000 (14:38 +0200)]
Revert addition of test_install
Martin Kroeker [Sun, 11 Jul 2021 12:37:38 +0000 (14:37 +0200)]
Revert addition of test_install target
Martin Kroeker [Sun, 11 Jul 2021 09:54:02 +0000 (11:54 +0200)]
Update azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 09:29:52 +0000 (11:29 +0200)]
Update azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 07:38:48 +0000 (09:38 +0200)]
Update azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 06:24:20 +0000 (08:24 +0200)]
Update azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 05:25:07 +0000 (07:25 +0200)]
Update azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 05:20:19 +0000 (07:20 +0200)]
Merge pull request #3301 from martin-frbg/syr2bench
Handle OPENBLAS_LOOPS in SYR2 benchmark
Martin Kroeker [Sat, 10 Jul 2021 21:21:58 +0000 (23:21 +0200)]
Update azure-pipelines.yml
Martin Kroeker [Sat, 10 Jul 2021 20:41:49 +0000 (22:41 +0200)]
Update azure-pipelines.yml
Martin Kroeker [Sat, 10 Jul 2021 19:27:53 +0000 (21:27 +0200)]
Handle OPENBLAS_LOOPS in SYR2 benchmark
Martin Kroeker [Sat, 10 Jul 2021 18:57:21 +0000 (20:57 +0200)]
Update azure-pipelines.yml
Martin Kroeker [Sat, 10 Jul 2021 18:48:53 +0000 (20:48 +0200)]
Merge pull request #3299 from martin-frbg/issue3298
Fix copy-paste error in LIBCORE assignment for Tiger Lake
Martin Kroeker [Sat, 10 Jul 2021 18:12:33 +0000 (20:12 +0200)]
Update azure-pipelines.yml
Martin Kroeker [Sat, 10 Jul 2021 17:52:04 +0000 (19:52 +0200)]
add sudo for install in Alpine
Martin Kroeker [Sat, 10 Jul 2021 17:03:42 +0000 (19:03 +0200)]
fix typo
Martin Kroeker [Sat, 10 Jul 2021 16:52:44 +0000 (18:52 +0200)]
Move Alpine Linux build job from Travis to Azure
Martin Kroeker [Sat, 10 Jul 2021 16:20:40 +0000 (18:20 +0200)]
Fix copy-paste error in LIBCORE assignment for Tiger Lake
River Dillon [Sat, 10 Jul 2021 08:34:47 +0000 (01:34 -0700)]
Contribution note
River Dillon [Sat, 10 Jul 2021 07:39:52 +0000 (00:39 -0700)]
Add test of installed <openblas_config.h>
River Dillon [Sat, 10 Jul 2021 07:38:02 +0000 (00:38 -0700)]
Add feature test macro for proper inclusion of <sched.h>
River Dillon [Sat, 10 Jul 2021 07:36:07 +0000 (00:36 -0700)]
Remove <linux/unistd.h>
Martin Kroeker [Thu, 8 Jul 2021 19:24:15 +0000 (21:24 +0200)]
Merge pull request #3296 from martin-frbg/issue3295
Support Zhaoxin/Centaur family 7 processors as Nehalem
Martin Kroeker [Thu, 8 Jul 2021 16:28:49 +0000 (18:28 +0200)]
Add vendor string Shanghai as successor to Centaur
Martin Kroeker [Thu, 8 Jul 2021 16:26:23 +0000 (18:26 +0200)]
Add vendor string Shanghai as the successor to Centaur
Martin Kroeker [Thu, 8 Jul 2021 10:23:15 +0000 (12:23 +0200)]
Recognize newer Zhaoxin/Centaur processors as Nehalem
Martin Kroeker [Thu, 8 Jul 2021 10:20:19 +0000 (12:20 +0200)]
Recognize newer Zhaoxin/Centaur cpus as Nehalem
Martin Kroeker [Wed, 7 Jul 2021 18:58:29 +0000 (20:58 +0200)]
Merge pull request #3265 from TAAPArthur/improve_portability
Removed use of non portable '-p' arg to install
Martin Kroeker [Wed, 7 Jul 2021 18:46:54 +0000 (20:46 +0200)]
Merge pull request #3293 from martin-frbg/issue3290
Enable (C)EXTRALIB as for any other platform when building the tests on RISCV C910V