platform/upstream/openblas.git
3 years agoMerge pull request #3305 from xianyi/release-0.3.0
Martin Kroeker [Sun, 11 Jul 2021 22:15:52 +0000 (00:15 +0200)]
Merge pull request #3305 from xianyi/release-0.3.0

Merge release branch back into develop to copy tag

3 years agoMerge pull request #3304 from xianyi/develop
Martin Kroeker [Sun, 11 Jul 2021 22:12:52 +0000 (00:12 +0200)]
Merge pull request #3304 from xianyi/develop

Merge develop int0 0.3.0 for release 0.3.16

3 years agoMerge branch 'release-0.3.0' into develop
Martin Kroeker [Sun, 11 Jul 2021 22:12:25 +0000 (00:12 +0200)]
Merge branch 'release-0.3.0' into develop

3 years agoUpdate version to 0.3.16
Martin Kroeker [Sun, 11 Jul 2021 22:09:35 +0000 (00:09 +0200)]
Update version to 0.3.16

3 years agoUpdate version to 0.3.16
Martin Kroeker [Sun, 11 Jul 2021 22:08:55 +0000 (00:08 +0200)]
Update version to 0.3.16

3 years agoMerge pull request #3303 from martin-frbg/changelog16
Martin Kroeker [Sun, 11 Jul 2021 21:50:02 +0000 (23:50 +0200)]
Merge pull request #3303 from martin-frbg/changelog16

Update Changelog for 0.3.16

3 years agoUpdate Changelog for 0.3.16
Martin Kroeker [Sun, 11 Jul 2021 21:48:39 +0000 (23:48 +0200)]
Update Changelog for 0.3.16

3 years agoMerge pull request #3300 from martin-frbg/AzureAlpine
Martin Kroeker [Sun, 11 Jul 2021 20:50:20 +0000 (22:50 +0200)]
Merge pull request #3300 from martin-frbg/AzureAlpine

Move Alpine Linux build job from Travis to Azure

3 years agoMerge pull request #3302 from martin-frbg/small_cleanup
Martin Kroeker [Sun, 11 Jul 2021 20:26:41 +0000 (22:26 +0200)]
Merge pull request #3302 from martin-frbg/small_cleanup

Clean up some warnings

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 16:29:17 +0000 (18:29 +0200)]
Update azure-pipelines.yml

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 16:08:30 +0000 (18:08 +0200)]
Update azure-pipelines.yml

3 years agoMerge pull request #3297 from outerpassage/develop
Martin Kroeker [Sun, 11 Jul 2021 15:10:20 +0000 (17:10 +0200)]
Merge pull request #3297 from outerpassage/develop

fix compilation with musl libc

3 years agoClean up some warnings
Martin Kroeker [Sun, 11 Jul 2021 14:00:29 +0000 (16:00 +0200)]
Clean up some warnings

3 years agoRevert addition of test_install
Martin Kroeker [Sun, 11 Jul 2021 12:38:49 +0000 (14:38 +0200)]
Revert addition of test_install

3 years agoRevert addition of test_install target
Martin Kroeker [Sun, 11 Jul 2021 12:37:38 +0000 (14:37 +0200)]
Revert addition of test_install target

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 09:54:02 +0000 (11:54 +0200)]
Update azure-pipelines.yml

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 09:29:52 +0000 (11:29 +0200)]
Update azure-pipelines.yml

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 07:38:48 +0000 (09:38 +0200)]
Update azure-pipelines.yml

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 06:24:20 +0000 (08:24 +0200)]
Update azure-pipelines.yml

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sun, 11 Jul 2021 05:25:07 +0000 (07:25 +0200)]
Update azure-pipelines.yml

3 years agoMerge pull request #3301 from martin-frbg/syr2bench
Martin Kroeker [Sun, 11 Jul 2021 05:20:19 +0000 (07:20 +0200)]
Merge pull request #3301 from martin-frbg/syr2bench

Handle OPENBLAS_LOOPS in SYR2 benchmark

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sat, 10 Jul 2021 21:21:58 +0000 (23:21 +0200)]
Update azure-pipelines.yml

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sat, 10 Jul 2021 20:41:49 +0000 (22:41 +0200)]
Update azure-pipelines.yml

3 years agoHandle OPENBLAS_LOOPS in SYR2 benchmark
Martin Kroeker [Sat, 10 Jul 2021 19:27:53 +0000 (21:27 +0200)]
Handle OPENBLAS_LOOPS in SYR2 benchmark

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sat, 10 Jul 2021 18:57:21 +0000 (20:57 +0200)]
Update azure-pipelines.yml

3 years agoMerge pull request #3299 from martin-frbg/issue3298
Martin Kroeker [Sat, 10 Jul 2021 18:48:53 +0000 (20:48 +0200)]
Merge pull request #3299 from martin-frbg/issue3298

Fix copy-paste error in LIBCORE assignment for Tiger Lake

3 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sat, 10 Jul 2021 18:12:33 +0000 (20:12 +0200)]
Update azure-pipelines.yml

3 years agoadd sudo for install in Alpine
Martin Kroeker [Sat, 10 Jul 2021 17:52:04 +0000 (19:52 +0200)]
add sudo for install in Alpine

3 years agofix typo
Martin Kroeker [Sat, 10 Jul 2021 17:03:42 +0000 (19:03 +0200)]
fix typo

3 years agoMove Alpine Linux build job from Travis to Azure
Martin Kroeker [Sat, 10 Jul 2021 16:52:44 +0000 (18:52 +0200)]
Move Alpine Linux build job from Travis to Azure

3 years agoFix copy-paste error in LIBCORE assignment for Tiger Lake
Martin Kroeker [Sat, 10 Jul 2021 16:20:40 +0000 (18:20 +0200)]
Fix copy-paste error in LIBCORE assignment for Tiger Lake

3 years agoContribution note
River Dillon [Sat, 10 Jul 2021 08:34:47 +0000 (01:34 -0700)]
Contribution note

3 years agoAdd test of installed <openblas_config.h>
River Dillon [Sat, 10 Jul 2021 07:39:52 +0000 (00:39 -0700)]
Add test of installed <openblas_config.h>

3 years agoAdd feature test macro for proper inclusion of <sched.h>
River Dillon [Sat, 10 Jul 2021 07:38:02 +0000 (00:38 -0700)]
Add feature test macro for proper inclusion of <sched.h>

3 years agoRemove <linux/unistd.h>
River Dillon [Sat, 10 Jul 2021 07:36:07 +0000 (00:36 -0700)]
Remove <linux/unistd.h>

3 years agoMerge pull request #3296 from martin-frbg/issue3295
Martin Kroeker [Thu, 8 Jul 2021 19:24:15 +0000 (21:24 +0200)]
Merge pull request #3296 from martin-frbg/issue3295

Support Zhaoxin/Centaur family 7 processors as Nehalem

3 years agoAdd vendor string Shanghai as successor to Centaur
Martin Kroeker [Thu, 8 Jul 2021 16:28:49 +0000 (18:28 +0200)]
Add vendor string Shanghai as successor to Centaur

3 years agoAdd vendor string Shanghai as the successor to Centaur
Martin Kroeker [Thu, 8 Jul 2021 16:26:23 +0000 (18:26 +0200)]
Add vendor string Shanghai as the successor to Centaur

3 years agoRecognize newer Zhaoxin/Centaur processors as Nehalem
Martin Kroeker [Thu, 8 Jul 2021 10:23:15 +0000 (12:23 +0200)]
Recognize newer Zhaoxin/Centaur processors as Nehalem

3 years agoRecognize newer Zhaoxin/Centaur cpus as Nehalem
Martin Kroeker [Thu, 8 Jul 2021 10:20:19 +0000 (12:20 +0200)]
Recognize newer Zhaoxin/Centaur cpus as Nehalem

3 years agoMerge pull request #3265 from TAAPArthur/improve_portability
Martin Kroeker [Wed, 7 Jul 2021 18:58:29 +0000 (20:58 +0200)]
Merge pull request #3265 from TAAPArthur/improve_portability

Removed use of non portable '-p' arg to install

3 years agoMerge pull request #3293 from martin-frbg/issue3290
Martin Kroeker [Wed, 7 Jul 2021 18:46:54 +0000 (20:46 +0200)]
Merge pull request #3293 from martin-frbg/issue3290

Enable (C)EXTRALIB as for any other platform when building the tests on RISCV C910V

3 years agoMerge pull request #3292 from martin-frbg/syrk_limit
Martin Kroeker [Wed, 7 Jul 2021 18:46:28 +0000 (20:46 +0200)]
Merge pull request #3292 from martin-frbg/syrk_limit

Add lower limit for multithreading in xSYRK

3 years agoMerge pull request #3288 from martin-frbg/getrf-2
Martin Kroeker [Wed, 7 Jul 2021 18:45:57 +0000 (20:45 +0200)]
Merge pull request #3288 from martin-frbg/getrf-2

Add lower threshold for multithreading in ?GETRF

3 years agoRemove code that disabled EXTRALIB on RISCV C910V
Martin Kroeker [Tue, 6 Jul 2021 18:21:07 +0000 (20:21 +0200)]
Remove code that disabled EXTRALIB on RISCV C910V

3 years agoAdd lower limit for multithreading
Martin Kroeker [Sun, 4 Jul 2021 15:00:26 +0000 (17:00 +0200)]
Add lower limit for multithreading

3 years agoHandle OPENBLAS_LOOP
Martin Kroeker [Sun, 4 Jul 2021 14:59:43 +0000 (16:59 +0200)]
Handle OPENBLAS_LOOP

3 years agoMerge pull request #3289 from martin-frbg/issue3283
Martin Kroeker [Thu, 1 Jul 2021 22:19:06 +0000 (00:19 +0200)]
Merge pull request #3289 from martin-frbg/issue3283

Update README to mention availability of the Windows binaries in the Releases section

3 years agoMention availability of the Windows binaries in the Releases section
Martin Kroeker [Thu, 1 Jul 2021 17:24:35 +0000 (19:24 +0200)]
Mention availability of the Windows binaries in the Releases section

3 years agoAdd lower threshold for multithreading
Martin Kroeker [Thu, 1 Jul 2021 15:41:05 +0000 (17:41 +0200)]
Add lower threshold for multithreading

3 years agoHandle OPENBLAS_LOOPS for more stable results
Martin Kroeker [Thu, 1 Jul 2021 15:39:37 +0000 (17:39 +0200)]
Handle OPENBLAS_LOOPS for more stable results

3 years agoHandle OPENBLAS_LOOPS and OPENBLAS_TEST options
Martin Kroeker [Thu, 1 Jul 2021 15:38:45 +0000 (17:38 +0200)]
Handle OPENBLAS_LOOPS and OPENBLAS_TEST options

3 years agoMerge pull request #3284 from martin-frbg/potrf_potri
Martin Kroeker [Wed, 30 Jun 2021 05:42:45 +0000 (07:42 +0200)]
Merge pull request #3284 from martin-frbg/potrf_potri

Add lower thresholds for multithreading in POTRF/POTRI and improve the related benchmark

3 years agoMerge pull request #3287 from martin-frbg/appveyor-conda
Martin Kroeker [Tue, 29 Jun 2021 18:09:26 +0000 (20:09 +0200)]
Merge pull request #3287 from martin-frbg/appveyor-conda

Work around current conda/tqdm auto-update problem on Appveyor

3 years agoWork around current conda/tqdm auto-update problem
Martin Kroeker [Tue, 29 Jun 2021 15:17:34 +0000 (17:17 +0200)]
Work around current conda/tqdm auto-update problem

3 years agoAdd lower threshold for multithreading in ?potrf and ?potri
Martin Kroeker [Sat, 26 Jun 2021 21:47:41 +0000 (23:47 +0200)]
Add lower threshold for multithreading in ?potrf and ?potri

3 years agoAdd OPENBLAS_LOOPS support to potrf/potrs/potri benchmark
Martin Kroeker [Sat, 26 Jun 2021 21:46:00 +0000 (23:46 +0200)]
Add OPENBLAS_LOOPS support to potrf/potrs/potri benchmark

3 years agoAdd interface declarations for ?potri
Martin Kroeker [Sat, 26 Jun 2021 21:44:56 +0000 (23:44 +0200)]
Add interface declarations for ?potri

3 years agoMerge pull request #3278 from brada4/A55
Martin Kroeker [Wed, 23 Jun 2021 11:05:17 +0000 (13:05 +0200)]
Merge pull request #3278 from brada4/A55

Add CORTEXA55 cpuid 0xd05 support

3 years agoUpdate cpuid_arm64.c
Martin Kroeker [Wed, 23 Jun 2021 08:11:01 +0000 (10:11 +0200)]
Update cpuid_arm64.c

3 years agoOK
User User-User [Tue, 22 Jun 2021 21:58:59 +0000 (23:58 +0200)]
OK

3 years agobugz
User User-User [Sun, 20 Jun 2021 14:40:43 +0000 (16:40 +0200)]
bugz

3 years agoadd to cmake listings too
User User-User [Sun, 20 Jun 2021 13:32:42 +0000 (15:32 +0200)]
add to cmake listings too

3 years agoremove misplaced file
User User-User [Sun, 20 Jun 2021 13:29:25 +0000 (15:29 +0200)]
remove misplaced file

3 years agodynamic_arch
User User-User [Sun, 20 Jun 2021 12:19:41 +0000 (14:19 +0200)]
dynamic_arch

3 years agoadd gcc8 version matching
User User-User [Sat, 19 Jun 2021 20:21:39 +0000 (22:21 +0200)]
add gcc8 version matching

3 years agocopy conf
User User-User [Sat, 19 Jun 2021 19:49:58 +0000 (21:49 +0200)]
copy conf

3 years agoWiP CORTEX A55 support
User User-User [Sat, 19 Jun 2021 19:37:51 +0000 (21:37 +0200)]
WiP CORTEX A55 support

3 years agoMerge pull request #3276 from martin-frbg/issue3274
Martin Kroeker [Wed, 16 Jun 2021 14:37:30 +0000 (16:37 +0200)]
Merge pull request #3276 from martin-frbg/issue3274

Add workaround for another macro name collision with Windows 10 SDK winnt.h

3 years agoMerge pull request #3275 from martin-frbg/lapack580
Martin Kroeker [Wed, 16 Jun 2021 11:41:38 +0000 (13:41 +0200)]
Merge pull request #3275 from martin-frbg/lapack580

Fix missing EXTERNAL declarations in LAPACK TESTING (LAPACK PR 580)

3 years agoFix typo
Martin Kroeker [Wed, 16 Jun 2021 11:41:16 +0000 (13:41 +0200)]
Fix typo

3 years agoWork around another recent macro name collision with winnt.h
Martin Kroeker [Wed, 16 Jun 2021 10:32:34 +0000 (12:32 +0200)]
Work around another recent macro name collision with winnt.h

3 years agoModify defines for CR and RC to work around name collision on Windows
Martin Kroeker [Wed, 16 Jun 2021 10:17:25 +0000 (12:17 +0200)]
Modify defines for CR and RC to work around name collision on Windows

3 years agoDeclare DZASUM as EXTERNAL
Martin Kroeker [Wed, 16 Jun 2021 07:43:39 +0000 (09:43 +0200)]
Declare DZASUM as EXTERNAL

3 years agoDeclare ZDROT as EXTERNAL
Martin Kroeker [Wed, 16 Jun 2021 07:41:18 +0000 (09:41 +0200)]
Declare ZDROT as EXTERNAL

3 years agoDeclare SROT as EXTERNAL
Martin Kroeker [Wed, 16 Jun 2021 07:40:15 +0000 (09:40 +0200)]
Declare SROT as EXTERNAL

3 years agoDeclare DROT as EXTERNAL
Martin Kroeker [Wed, 16 Jun 2021 07:39:28 +0000 (09:39 +0200)]
Declare DROT as EXTERNAL

3 years agoDeclare CSROT as EXTERNAL
Martin Kroeker [Wed, 16 Jun 2021 07:35:12 +0000 (09:35 +0200)]
Declare CSROT as EXTERNAL

3 years agoDeclare SCASUM as EXTERNAL
Martin Kroeker [Wed, 16 Jun 2021 07:33:23 +0000 (09:33 +0200)]
Declare SCASUM as EXTERNAL

3 years agoMerge pull request #3273 from austinpagan/sbgemm_gcc10_fix
Martin Kroeker [Tue, 15 Jun 2021 20:58:48 +0000 (22:58 +0200)]
Merge pull request #3273 from austinpagan/sbgemm_gcc10_fix

Power10: Fix for SBGEMM

3 years agoPower10: Fix for SBGEMM
Gordon Fossum [Tue, 15 Jun 2021 18:07:47 +0000 (13:07 -0500)]
Power10: Fix for SBGEMM

While testing bfloat16 sbgemm kernel, there are some failures for odd value inputs due to updating result for
additional bytes.

3 years agoMerge pull request #3252 from martin-frbg/more_shortcuts
Martin Kroeker [Tue, 15 Jun 2021 14:14:20 +0000 (16:14 +0200)]
Merge pull request #3252 from martin-frbg/more_shortcuts

Further shortcuts for (small) cases that do not need buffer allocation

3 years agoMerge pull request #3250 from martin-frbg/gemv-shortcut
Martin Kroeker [Tue, 15 Jun 2021 12:50:14 +0000 (14:50 +0200)]
Merge pull request #3250 from martin-frbg/gemv-shortcut

Add shortcut for small-size S/D GEMV_N with increments of one

3 years agoMerge pull request #3270 from ggouaillardet/topic/dznrm2_tx2
Martin Kroeker [Mon, 14 Jun 2021 11:00:33 +0000 (13:00 +0200)]
Merge pull request #3270 from ggouaillardet/topic/dznrm2_tx2

arm64: add the missing d9 register to the clobber list

3 years agoarm64: add the missing d9 register to the clobber list
Gilles Gouaillardet [Mon, 14 Jun 2021 08:01:28 +0000 (17:01 +0900)]
arm64: add the missing d9 register to the clobber list

Refs. numpy/numpy#18422

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
3 years agoMerge pull request #3266 from martin-frbg/powerparam
Martin Kroeker [Thu, 10 Jun 2021 16:05:47 +0000 (18:05 +0200)]
Merge pull request #3266 from martin-frbg/powerparam

Remove spurious casts from PPC parameters and fix compilation for older targets

3 years agoMerge pull request #3260 from intelmy/sgemv_t_opt
Martin Kroeker [Thu, 10 Jun 2021 14:08:24 +0000 (16:08 +0200)]
Merge pull request #3260 from intelmy/sgemv_t_opt

Optimized sgemv_t for small N based on AVX512

3 years agoMerge pull request #3264 from RajalakshmiSR/sbgemmp10
Martin Kroeker [Thu, 10 Jun 2021 14:07:47 +0000 (16:07 +0200)]
Merge pull request #3264 from RajalakshmiSR/sbgemmp10

POWER10: Fixes for sbgemm kernel

3 years agoAdd prefetch values for power3
Martin Kroeker [Thu, 10 Jun 2021 09:20:29 +0000 (11:20 +0200)]
Add prefetch values for power3

3 years agoAdd prefetch values for power3
Martin Kroeker [Thu, 10 Jun 2021 09:19:40 +0000 (11:19 +0200)]
Add prefetch values for power3

3 years agoAdd prefetch values for power3
Martin Kroeker [Thu, 10 Jun 2021 09:18:22 +0000 (11:18 +0200)]
Add prefetch values for power3

3 years agoAdd prefetch values for power3
Martin Kroeker [Thu, 10 Jun 2021 09:17:33 +0000 (11:17 +0200)]
Add prefetch values for power3

3 years agoFix caxpy/zaxpy for big-endian
Martin Kroeker [Thu, 10 Jun 2021 09:15:48 +0000 (11:15 +0200)]
Fix caxpy/zaxpy for big-endian

3 years agoFix inverted conditional for caxpy/zaxpy
Martin Kroeker [Thu, 10 Jun 2021 09:14:03 +0000 (11:14 +0200)]
Fix inverted conditional for caxpy/zaxpy

3 years agofix c/zrot and sgemv for POWER5
Martin Kroeker [Thu, 10 Jun 2021 09:11:56 +0000 (11:11 +0200)]
fix c/zrot and sgemv for POWER5

3 years agoRemove casts for PPC/POWER and complete parameters for POWER3/4
Martin Kroeker [Thu, 10 Jun 2021 09:09:50 +0000 (11:09 +0200)]
Remove casts for PPC/POWER and complete parameters for POWER3/4

3 years agoRemoved use of non portable '-p' arg to install
Arthur Williams [Thu, 10 Jun 2021 01:50:11 +0000 (20:50 -0500)]
Removed use of non portable '-p' arg to install

Not all versions of install support '-p' flag and it isn't worth failing
the build in the installed files' timestamps get updated.

3 years agoPOWER10: Fixes for sbgemm kernel
Rajalakshmi Srinivasaraghavan [Wed, 9 Jun 2021 17:20:09 +0000 (12:20 -0500)]
POWER10: Fixes for sbgemm kernel

While testing bfloat16 sbgemm kernel, there are some failures
for odd value inputs due to array access beyond the boundary.

3 years agoOptimized sgemv_t for small N based on AVX512
Ma, Yu [Tue, 8 Jun 2021 19:08:28 +0000 (15:08 -0400)]
Optimized sgemv_t for small N based on AVX512

3 years agoMerge pull request #3259 from zhaofengli/riscv64-fixes
Zhang Xianyi [Tue, 8 Jun 2021 08:26:56 +0000 (16:26 +0800)]
Merge pull request #3259 from zhaofengli/riscv64-fixes

riscv64 fixes