platform/upstream/openblas.git
2 years agoChange a1b0 gemm to b0 gemm.
Xianyi Zhang [Thu, 27 Aug 2020 23:55:27 +0000 (07:55 +0800)]
Change a1b0 gemm to b0 gemm.

2 years agoRefs #2587 Fix typos.
Xianyi Zhang [Tue, 28 Apr 2020 16:19:19 +0000 (00:19 +0800)]
Refs #2587 Fix typos.

2 years agoFix gemm interface bug for small matrix.
Xianyi Zhang [Tue, 28 Apr 2020 15:15:20 +0000 (23:15 +0800)]
Fix gemm interface bug for small matrix.

2 years agoAdd alpha=1.0 beta=0.0 for small gemm.
Xianyi Zhang [Tue, 28 Apr 2020 14:35:36 +0000 (22:35 +0800)]
Add alpha=1.0 beta=0.0 for small gemm.

2 years agoAdd small marix optimization kernel interface.
Xianyi Zhang [Tue, 28 Apr 2020 11:01:36 +0000 (19:01 +0800)]
Add small marix optimization kernel interface.

make SMALL_MATRIX_OPT=1

2 years agoMerge pull request #3331 from gxw-loongson/develop
Martin Kroeker [Mon, 2 Aug 2021 05:21:46 +0000 (07:21 +0200)]
Merge pull request #3331 from gxw-loongson/develop

Fixed typos about LOONGARCH64

2 years agoAdd cmake support for LOONGARCH64
gxw [Mon, 2 Aug 2021 02:00:41 +0000 (10:00 +0800)]
Add cmake support for LOONGARCH64

2 years agoMerge pull request #3329 from martin-frbg/issue3272
Martin Kroeker [Fri, 30 Jul 2021 18:39:38 +0000 (20:39 +0200)]
Merge pull request #3329 from martin-frbg/issue3272

Work around gcc11+ miscompiling C/ZBLAS3 tests at -O3

2 years agoDisable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
Martin Kroeker [Fri, 30 Jul 2021 12:46:19 +0000 (14:46 +0200)]
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3

2 years agoDisable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
Martin Kroeker [Fri, 30 Jul 2021 12:44:54 +0000 (14:44 +0200)]
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3

2 years agoDisable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
Martin Kroeker [Fri, 30 Jul 2021 12:21:08 +0000 (14:21 +0200)]
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3

2 years agoDisable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
Martin Kroeker [Fri, 30 Jul 2021 12:20:12 +0000 (14:20 +0200)]
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3

2 years agoFixed typos about LOONGARCH64
gxw [Fri, 30 Jul 2021 10:11:12 +0000 (18:11 +0800)]
Fixed typos about LOONGARCH64

2 years agoMerge pull request #3327 from h-vetinari/lapack597_redux
Martin Kroeker [Wed, 28 Jul 2021 21:04:02 +0000 (23:04 +0200)]
Merge pull request #3327 from h-vetinari/lapack597_redux

Complete the carry of lapack PR 597

2 years agoMerge pull request #3326 from wattoc/develop
Martin Kroeker [Wed, 28 Jul 2021 21:03:37 +0000 (23:03 +0200)]
Merge pull request #3326 from wattoc/develop

Include Haiku in processor count checks

2 years agoMerge pull request #3325 from gxw-loongson/develop
Martin Kroeker [Wed, 28 Jul 2021 21:03:15 +0000 (23:03 +0200)]
Merge pull request #3325 from gxw-loongson/develop

Add support for LOONGARCH64

2 years agoAlso make sure the `1` is INTEGER*4 for OMP_SET_NUM_THREADS
Isuru Fernando [Thu, 15 Jul 2021 09:54:33 +0000 (04:54 -0500)]
Also make sure the `1` is INTEGER*4 for OMP_SET_NUM_THREADS

2 years agoInclude Haiku in processor count checks
Craig Watson [Tue, 27 Jul 2021 09:00:30 +0000 (09:00 +0000)]
Include Haiku in processor count checks

2 years agoAdd support for LOONGARCH64
gxw [Mon, 26 Jul 2021 07:44:54 +0000 (15:44 +0800)]
Add support for LOONGARCH64

2 years agoMerge pull request #3323 from martin-frbg/issue3322
Martin Kroeker [Fri, 23 Jul 2021 20:46:02 +0000 (22:46 +0200)]
Merge pull request #3323 from martin-frbg/issue3322

GCC did not support -mtune for ARM64 before 5.1

2 years agoMerge pull request #3314 from martin-frbg/lapack597
Martin Kroeker [Fri, 23 Jul 2021 13:30:27 +0000 (15:30 +0200)]
Merge pull request #3314 from martin-frbg/lapack597

Fix LAPACK testsuite compatibility with libomp (Reference-LAPACK PR 597)

2 years agoGCC did not support -mtune for ARM64 before 5.1
Martin Kroeker [Fri, 23 Jul 2021 11:42:52 +0000 (13:42 +0200)]
GCC did not support -mtune for ARM64 before 5.1

2 years agoMerge pull request #3320 from martin-frbg/issue3318
Martin Kroeker [Thu, 22 Jul 2021 19:28:50 +0000 (21:28 +0200)]
Merge pull request #3320 from martin-frbg/issue3318

Empirical workaround for numpy SVD NaN problem from issue 3318

2 years agoFix regex to match kernels suffixed with cpuname too
Martin Kroeker [Thu, 22 Jul 2021 15:24:15 +0000 (17:24 +0200)]
Fix regex to match kernels suffixed with cpuname too

2 years agoRework setting of -mfma to only apply it where necessary
Martin Kroeker [Thu, 22 Jul 2021 10:00:03 +0000 (12:00 +0200)]
Rework setting of -mfma to only apply it where necessary

2 years agoEmpirical workaround for numpy SVD NaN problem from issue 3318
Martin Kroeker [Sun, 18 Jul 2021 20:19:19 +0000 (22:19 +0200)]
Empirical workaround for numpy SVD NaN problem from issue 3318

2 years agoUpdate version to 0.3.17.dev
Martin Kroeker [Thu, 15 Jul 2021 13:00:01 +0000 (15:00 +0200)]
Update version to 0.3.17.dev

2 years agoUpdate version to 0.3.17.dev
Martin Kroeker [Thu, 15 Jul 2021 12:59:15 +0000 (14:59 +0200)]
Update version to 0.3.17.dev

2 years agoMerge pull request #3317 from xianyi/release-0.3.0
Martin Kroeker [Thu, 15 Jul 2021 12:58:20 +0000 (14:58 +0200)]
Merge pull request #3317 from xianyi/release-0.3.0

merge 0.3.17 back into develop to copy tag

2 years agoUpdate version to 0.3.17
Martin Kroeker [Thu, 15 Jul 2021 12:52:54 +0000 (14:52 +0200)]
Update version to 0.3.17

2 years agoMerge pull request #3316 from xianyi/develop
Martin Kroeker [Thu, 15 Jul 2021 12:51:50 +0000 (14:51 +0200)]
Merge pull request #3316 from xianyi/develop

Merge develop for bugfix release 0.3.17

2 years agoMerge branch 'release-0.3.0' into develop
Martin Kroeker [Thu, 15 Jul 2021 12:50:20 +0000 (14:50 +0200)]
Merge branch 'release-0.3.0' into develop

2 years agoUpdate version to 0.3.17
Martin Kroeker [Thu, 15 Jul 2021 12:46:24 +0000 (14:46 +0200)]
Update version to 0.3.17

2 years agoMerge pull request #3315 from martin-frbg/changelog0317
Martin Kroeker [Thu, 15 Jul 2021 12:44:59 +0000 (14:44 +0200)]
Merge pull request #3315 from martin-frbg/changelog0317

Update Changelog for 0.3.17

2 years agoUpdate Changelog for 0.3.17
Martin Kroeker [Thu, 15 Jul 2021 12:44:17 +0000 (14:44 +0200)]
Update Changelog for 0.3.17

2 years agoMerge pull request #3313 from martin-frbg/3266-2
Martin Kroeker [Thu, 15 Jul 2021 06:00:57 +0000 (08:00 +0200)]
Merge pull request #3313 from martin-frbg/3266-2

Remove BLASLONG casts from SPARC parameter entries

2 years agoMerge pull request #3312 from martin-frbg/revert_3260
Martin Kroeker [Thu, 15 Jul 2021 06:00:34 +0000 (08:00 +0200)]
Merge pull request #3312 from martin-frbg/revert_3260

Temporarily disable the SkylakeX sgemv_t microkernel

2 years agoMerge pull request #3311 from martin-frbg/issue3309
Martin Kroeker [Thu, 15 Jul 2021 05:58:47 +0000 (07:58 +0200)]
Merge pull request #3311 from martin-frbg/issue3309

Revert PR #3250 (shortcut without buffer allocation) as it is unsafe …

2 years agoDeclare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp
Martin Kroeker [Wed, 14 Jul 2021 20:42:43 +0000 (22:42 +0200)]
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp

2 years agoDeclare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp
Martin Kroeker [Wed, 14 Jul 2021 20:41:45 +0000 (22:41 +0200)]
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp

2 years agoDeclare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp
Martin Kroeker [Wed, 14 Jul 2021 20:40:29 +0000 (22:40 +0200)]
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp

2 years agoDeclare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp
Martin Kroeker [Wed, 14 Jul 2021 20:39:20 +0000 (22:39 +0200)]
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp

2 years agoRemove BLASLONG casts from SPARC entries
Martin Kroeker [Wed, 14 Jul 2021 19:09:36 +0000 (21:09 +0200)]
Remove BLASLONG casts from SPARC entries

in response to https://github.com/xianyi/OpenBLAS/pull/3266#issuecomment-878637675

2 years agoTemporarily disable the SkylakeX sgemv_t microkernel due to LAPACK testsuite failures
Martin Kroeker [Wed, 14 Jul 2021 18:50:14 +0000 (20:50 +0200)]
Temporarily disable the SkylakeX sgemv_t microkernel due to LAPACK testsuite failures

2 years agoRevert PR #3250 (shortcut without buffer allocation) as it is unsafe on some x86_64
Martin Kroeker [Wed, 14 Jul 2021 18:32:57 +0000 (20:32 +0200)]
Revert PR #3250 (shortcut without buffer allocation) as it is unsafe on some x86_64

2 years agoMerge pull request #3310 from jeromerobert/develop
Martin Kroeker [Wed, 14 Jul 2021 14:06:13 +0000 (16:06 +0200)]
Merge pull request #3310 from jeromerobert/develop

Avoid redefinition of _GNU_SOURCE

2 years agoAvoid redefinition of _GNU_SOURCE
Jerome Robert [Wed, 14 Jul 2021 10:20:57 +0000 (12:20 +0200)]
Avoid redefinition of _GNU_SOURCE

* _GNU_SOURCE may have been set by the application and redefinition
  trigger warnings or error with -Werror
* Fix for 220f6a1c5

2 years agoMerge pull request #3306 from jonaszhou1/develop
Martin Kroeker [Mon, 12 Jul 2021 14:30:10 +0000 (16:30 +0200)]
Merge pull request #3306 from jonaszhou1/develop

Add cpu detection support for Zhaoxin processors

2 years agoAdd cpu detection support for Zhaoxin processors
JonasZhou [Mon, 12 Jul 2021 05:43:45 +0000 (13:43 +0800)]
Add cpu detection support for Zhaoxin processors

Signed-off-by: JonasZhou <JonasZhou@zhaoxin.com>
2 years agoUpdate version to 0.3.16.dev
Martin Kroeker [Sun, 11 Jul 2021 22:16:59 +0000 (00:16 +0200)]
Update version to 0.3.16.dev

2 years agoUpdate version to 0.3.16.dev
Martin Kroeker [Sun, 11 Jul 2021 22:16:29 +0000 (00:16 +0200)]
Update version to 0.3.16.dev

2 years agoMerge pull request #3305 from xianyi/release-0.3.0
Martin Kroeker [Sun, 11 Jul 2021 22:15:52 +0000 (00:15 +0200)]
Merge pull request #3305 from xianyi/release-0.3.0

Merge release branch back into develop to copy tag

2 years agoMerge pull request #3304 from xianyi/develop
Martin Kroeker [Sun, 11 Jul 2021 22:12:52 +0000 (00:12 +0200)]
Merge pull request #3304 from xianyi/develop

Merge develop int0 0.3.0 for release 0.3.16

2 years agoMerge branch 'release-0.3.0' into develop
Martin Kroeker [Sun, 11 Jul 2021 22:12:25 +0000 (00:12 +0200)]
Merge branch 'release-0.3.0' into develop

2 years agoUpdate version to 0.3.16
Martin Kroeker [Sun, 11 Jul 2021 22:09:35 +0000 (00:09 +0200)]
Update version to 0.3.16

2 years agoUpdate version to 0.3.16
Martin Kroeker [Sun, 11 Jul 2021 22:08:55 +0000 (00:08 +0200)]
Update version to 0.3.16

2 years agoMerge pull request #3303 from martin-frbg/changelog16
Martin Kroeker [Sun, 11 Jul 2021 21:50:02 +0000 (23:50 +0200)]
Merge pull request #3303 from martin-frbg/changelog16

Update Changelog for 0.3.16

2 years agoUpdate Changelog for 0.3.16
Martin Kroeker [Sun, 11 Jul 2021 21:48:39 +0000 (23:48 +0200)]
Update Changelog for 0.3.16

2 years agoMerge pull request #3300 from martin-frbg/AzureAlpine
Martin Kroeker [Sun, 11 Jul 2021 20:50:20 +0000 (22:50 +0200)]
Merge pull request #3300 from martin-frbg/AzureAlpine

Move Alpine Linux build job from Travis to Azure

2 years agoMerge pull request #3302 from martin-frbg/small_cleanup
Martin Kroeker [Sun, 11 Jul 2021 20:26:41 +0000 (22:26 +0200)]
Merge pull request #3302 from martin-frbg/small_cleanup

Clean up some warnings

2 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 16:29:17 +0000 (18:29 +0200)]
Update azure-pipelines.yml

2 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 16:08:30 +0000 (18:08 +0200)]
Update azure-pipelines.yml

2 years agoMerge pull request #3297 from outerpassage/develop
Martin Kroeker [Sun, 11 Jul 2021 15:10:20 +0000 (17:10 +0200)]
Merge pull request #3297 from outerpassage/develop

fix compilation with musl libc

2 years agoClean up some warnings
Martin Kroeker [Sun, 11 Jul 2021 14:00:29 +0000 (16:00 +0200)]
Clean up some warnings

2 years agoRevert addition of test_install
Martin Kroeker [Sun, 11 Jul 2021 12:38:49 +0000 (14:38 +0200)]
Revert addition of test_install

2 years agoRevert addition of test_install target
Martin Kroeker [Sun, 11 Jul 2021 12:37:38 +0000 (14:37 +0200)]
Revert addition of test_install target

2 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 09:54:02 +0000 (11:54 +0200)]
Update azure-pipelines.yml

2 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 09:29:52 +0000 (11:29 +0200)]
Update azure-pipelines.yml

2 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 07:38:48 +0000 (09:38 +0200)]
Update azure-pipelines.yml

2 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 06:24:20 +0000 (08:24 +0200)]
Update azure-pipelines.yml

2 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 05:25:07 +0000 (07:25 +0200)]
Update azure-pipelines.yml

2 years agoMerge pull request #3301 from martin-frbg/syr2bench
Martin Kroeker [Sun, 11 Jul 2021 05:20:19 +0000 (07:20 +0200)]
Merge pull request #3301 from martin-frbg/syr2bench

Handle OPENBLAS_LOOPS in SYR2 benchmark

2 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sat, 10 Jul 2021 21:21:58 +0000 (23:21 +0200)]
Update azure-pipelines.yml

2 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sat, 10 Jul 2021 20:41:49 +0000 (22:41 +0200)]
Update azure-pipelines.yml

2 years agoHandle OPENBLAS_LOOPS in SYR2 benchmark
Martin Kroeker [Sat, 10 Jul 2021 19:27:53 +0000 (21:27 +0200)]
Handle OPENBLAS_LOOPS in SYR2 benchmark

2 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sat, 10 Jul 2021 18:57:21 +0000 (20:57 +0200)]
Update azure-pipelines.yml

2 years agoMerge pull request #3299 from martin-frbg/issue3298
Martin Kroeker [Sat, 10 Jul 2021 18:48:53 +0000 (20:48 +0200)]
Merge pull request #3299 from martin-frbg/issue3298

Fix copy-paste error in LIBCORE assignment for Tiger Lake

2 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sat, 10 Jul 2021 18:12:33 +0000 (20:12 +0200)]
Update azure-pipelines.yml

2 years agoadd sudo for install in Alpine
Martin Kroeker [Sat, 10 Jul 2021 17:52:04 +0000 (19:52 +0200)]
add sudo for install in Alpine

2 years agofix typo
Martin Kroeker [Sat, 10 Jul 2021 17:03:42 +0000 (19:03 +0200)]
fix typo

2 years agoMove Alpine Linux build job from Travis to Azure
Martin Kroeker [Sat, 10 Jul 2021 16:52:44 +0000 (18:52 +0200)]
Move Alpine Linux build job from Travis to Azure

2 years agoFix copy-paste error in LIBCORE assignment for Tiger Lake
Martin Kroeker [Sat, 10 Jul 2021 16:20:40 +0000 (18:20 +0200)]
Fix copy-paste error in LIBCORE assignment for Tiger Lake

2 years agoContribution note
River Dillon [Sat, 10 Jul 2021 08:34:47 +0000 (01:34 -0700)]
Contribution note

2 years agoAdd test of installed <openblas_config.h>
River Dillon [Sat, 10 Jul 2021 07:39:52 +0000 (00:39 -0700)]
Add test of installed <openblas_config.h>

2 years agoAdd feature test macro for proper inclusion of <sched.h>
River Dillon [Sat, 10 Jul 2021 07:38:02 +0000 (00:38 -0700)]
Add feature test macro for proper inclusion of <sched.h>

2 years agoRemove <linux/unistd.h>
River Dillon [Sat, 10 Jul 2021 07:36:07 +0000 (00:36 -0700)]
Remove <linux/unistd.h>

2 years agoMerge pull request #3296 from martin-frbg/issue3295
Martin Kroeker [Thu, 8 Jul 2021 19:24:15 +0000 (21:24 +0200)]
Merge pull request #3296 from martin-frbg/issue3295

Support Zhaoxin/Centaur family 7 processors as Nehalem

2 years agoAdd vendor string Shanghai as successor to Centaur
Martin Kroeker [Thu, 8 Jul 2021 16:28:49 +0000 (18:28 +0200)]
Add vendor string Shanghai as successor to Centaur

2 years agoAdd vendor string Shanghai as the successor to Centaur
Martin Kroeker [Thu, 8 Jul 2021 16:26:23 +0000 (18:26 +0200)]
Add vendor string Shanghai as the successor to Centaur

2 years agoRecognize newer Zhaoxin/Centaur processors as Nehalem
Martin Kroeker [Thu, 8 Jul 2021 10:23:15 +0000 (12:23 +0200)]
Recognize newer Zhaoxin/Centaur processors as Nehalem

2 years agoRecognize newer Zhaoxin/Centaur cpus as Nehalem
Martin Kroeker [Thu, 8 Jul 2021 10:20:19 +0000 (12:20 +0200)]
Recognize newer Zhaoxin/Centaur cpus as Nehalem

2 years agoMerge pull request #3265 from TAAPArthur/improve_portability
Martin Kroeker [Wed, 7 Jul 2021 18:58:29 +0000 (20:58 +0200)]
Merge pull request #3265 from TAAPArthur/improve_portability

Removed use of non portable '-p' arg to install

2 years agoMerge pull request #3293 from martin-frbg/issue3290
Martin Kroeker [Wed, 7 Jul 2021 18:46:54 +0000 (20:46 +0200)]
Merge pull request #3293 from martin-frbg/issue3290

Enable (C)EXTRALIB as for any other platform when building the tests on RISCV C910V

2 years agoMerge pull request #3292 from martin-frbg/syrk_limit
Martin Kroeker [Wed, 7 Jul 2021 18:46:28 +0000 (20:46 +0200)]
Merge pull request #3292 from martin-frbg/syrk_limit

Add lower limit for multithreading in xSYRK

2 years agoMerge pull request #3288 from martin-frbg/getrf-2
Martin Kroeker [Wed, 7 Jul 2021 18:45:57 +0000 (20:45 +0200)]
Merge pull request #3288 from martin-frbg/getrf-2

Add lower threshold for multithreading in ?GETRF

2 years agoRemove code that disabled EXTRALIB on RISCV C910V
Martin Kroeker [Tue, 6 Jul 2021 18:21:07 +0000 (20:21 +0200)]
Remove code that disabled EXTRALIB on RISCV C910V

2 years agoAdd lower limit for multithreading
Martin Kroeker [Sun, 4 Jul 2021 15:00:26 +0000 (17:00 +0200)]
Add lower limit for multithreading

2 years agoHandle OPENBLAS_LOOP
Martin Kroeker [Sun, 4 Jul 2021 14:59:43 +0000 (16:59 +0200)]
Handle OPENBLAS_LOOP

2 years agoMerge pull request #3289 from martin-frbg/issue3283
Martin Kroeker [Thu, 1 Jul 2021 22:19:06 +0000 (00:19 +0200)]
Merge pull request #3289 from martin-frbg/issue3283

Update README to mention availability of the Windows binaries in the Releases section

2 years agoMention availability of the Windows binaries in the Releases section
Martin Kroeker [Thu, 1 Jul 2021 17:24:35 +0000 (19:24 +0200)]
Mention availability of the Windows binaries in the Releases section