platform/upstream/openblas.git
2 years agoPrevent compiler attempts to use k0 as mask register
Martin Kroeker [Wed, 23 Feb 2022 19:12:20 +0000 (20:12 +0100)]
Prevent compiler attempts to use k0 as mask register

2 years agoFix non-portable u_int64_t
Martin Kroeker [Wed, 23 Feb 2022 19:10:59 +0000 (20:10 +0100)]
Fix non-portable u_int64_t

2 years agoMerge pull request #3537 from xianyi/release-0.3.0
Martin Kroeker [Mon, 21 Feb 2022 05:57:27 +0000 (06:57 +0100)]
Merge pull request #3537 from xianyi/release-0.3.0

Merge back from 0.3.20 release to copy tag

2 years agoUpdate version to 0.3.20
Martin Kroeker [Sun, 20 Feb 2022 21:35:05 +0000 (22:35 +0100)]
Update version to 0.3.20

2 years agoMerge pull request #3536 from xianyi/develop
Martin Kroeker [Sun, 20 Feb 2022 21:33:59 +0000 (22:33 +0100)]
Merge pull request #3536 from xianyi/develop

Update from develop for release 0.3.20

2 years agoMerge branch 'release-0.3.0' into develop
Martin Kroeker [Sun, 20 Feb 2022 21:33:45 +0000 (22:33 +0100)]
Merge branch 'release-0.3.0' into develop

2 years agoUpdate version to 0.3.20
Martin Kroeker [Sun, 20 Feb 2022 21:30:50 +0000 (22:30 +0100)]
Update version to 0.3.20

2 years agoMerge pull request #3535 from martin-frbg/0320changes
Martin Kroeker [Sun, 20 Feb 2022 21:21:02 +0000 (22:21 +0100)]
Merge pull request #3535 from martin-frbg/0320changes

Update with 0.3.20 changes

2 years agoUpdate with 0.3.20 changes
Martin Kroeker [Sun, 20 Feb 2022 21:16:04 +0000 (22:16 +0100)]
Update with 0.3.20 changes

2 years agoMerge pull request #3532 from martin-frbg/issue3528-2
Martin Kroeker [Fri, 11 Feb 2022 10:44:32 +0000 (11:44 +0100)]
Merge pull request #3532 from martin-frbg/issue3528-2

Fix building a shared library on Mac with flang-classic

2 years agokeep flang-classic on MacOS from trying to create an executable instead of a library
Martin Kroeker [Thu, 10 Feb 2022 22:04:45 +0000 (23:04 +0100)]
keep flang-classic on MacOS from trying to create an executable instead of a library

2 years agofilter out libflangmain as well
Martin Kroeker [Thu, 10 Feb 2022 22:03:05 +0000 (23:03 +0100)]
filter out libflangmain as well

2 years agoMerge pull request #3531 from martin-frbg/issue2973
Martin Kroeker [Thu, 10 Feb 2022 13:16:08 +0000 (14:16 +0100)]
Merge pull request #3531 from martin-frbg/issue2973

Add .NOTPARALLEL: to MATGEN Makefile as a workaround for builds on DFS

2 years agoAdd .NOTPARALLEL: as a workaround for builds on DFS
Martin Kroeker [Wed, 9 Feb 2022 21:09:25 +0000 (22:09 +0100)]
Add .NOTPARALLEL: as a workaround for builds on DFS

2 years agoMerge pull request #3527 from martin-frbg/issue3490
Martin Kroeker [Mon, 7 Feb 2022 07:14:11 +0000 (08:14 +0100)]
Merge pull request #3527 from martin-frbg/issue3490

Treat AVX512-enabled Alder Lake like Cooper Lake/Sapphire Rapids

2 years agoSupport AVX512-enabled Alder Lake
Martin Kroeker [Sun, 6 Feb 2022 23:00:56 +0000 (00:00 +0100)]
Support AVX512-enabled Alder Lake

2 years agoSupport AVX512-enabled AlderLake
Martin Kroeker [Sun, 6 Feb 2022 23:00:15 +0000 (00:00 +0100)]
Support AVX512-enabled AlderLake

2 years agoMerge pull request #3493 from martin-frbg/casts+cleanup
Martin Kroeker [Sun, 6 Feb 2022 22:55:06 +0000 (23:55 +0100)]
Merge pull request #3493 from martin-frbg/casts+cleanup

WIP casts and cleanups

2 years agoUpdate azure-pipelines.yml
Martin Kroeker [Sat, 5 Feb 2022 21:39:03 +0000 (22:39 +0100)]
Update azure-pipelines.yml

2 years agoMerge pull request #3524 from martin-frbg/lapack646
Martin Kroeker [Thu, 3 Feb 2022 21:31:23 +0000 (22:31 +0100)]
Merge pull request #3524 from martin-frbg/lapack646

Fix input argument check in ?GEQRT2 (from Reference-LAPACK PR 646)

2 years agoFix input argument check (LAPACK PR 646)
Martin Kroeker [Thu, 3 Feb 2022 10:43:17 +0000 (11:43 +0100)]
Fix input argument check (LAPACK PR 646)

2 years agoMerge pull request #3521 from martin-frbg/issue3520
Martin Kroeker [Fri, 28 Jan 2022 12:39:36 +0000 (13:39 +0100)]
Merge pull request #3521 from martin-frbg/issue3520

Add proper defaults for Sparc IMIN/IMAX

2 years agoMerge pull request #3522 from martin-frbg/issue3517
Martin Kroeker [Fri, 28 Jan 2022 09:36:57 +0000 (10:36 +0100)]
Merge pull request #3522 from martin-frbg/issue3517

Disable building C/Z SPMV,SPR,SYMV,SYR when NO_LAPACK=1

2 years agoExclude some complex (LAPACK) functions when NO_LAPACK is set
Martin Kroeker [Thu, 27 Jan 2022 21:02:08 +0000 (22:02 +0100)]
Exclude some complex (LAPACK) functions when NO_LAPACK is set

2 years agoExclude some complex drivers when NO_LAPACK is set
Martin Kroeker [Thu, 27 Jan 2022 21:00:39 +0000 (22:00 +0100)]
Exclude some complex drivers when NO_LAPACK is set

2 years agoAdd proper defaults for IMIN/IMAX
Martin Kroeker [Thu, 27 Jan 2022 18:56:32 +0000 (19:56 +0100)]
Add proper defaults for IMIN/IMAX

2 years agoMerge pull request #3518 from martin-frbg/elbrus
Martin Kroeker [Tue, 25 Jan 2022 19:57:59 +0000 (20:57 +0100)]
Merge pull request #3518 from martin-frbg/elbrus

Add basic support for the (mostly x86_64 compatible) Elbrus E2000 architecture

2 years agoMerge pull request #3516 from mmuetzel/no-fortran
Martin Kroeker [Tue, 25 Jan 2022 19:57:38 +0000 (20:57 +0100)]
Merge pull request #3516 from mmuetzel/no-fortran

cmake: Check if Fortran compiler is usable before enabling it.

2 years agoUpdate CONTRIBUTORS.md
Martin Kroeker [Sat, 22 Jan 2022 18:09:00 +0000 (19:09 +0100)]
Update CONTRIBUTORS.md

2 years agoUpdate CONTRIBUTORS.md
Martin Kroeker [Sat, 22 Jan 2022 18:02:57 +0000 (19:02 +0100)]
Update CONTRIBUTORS.md

2 years agoAdd default KERNEL file for Elbrus E2K arch
Martin Kroeker [Sat, 22 Jan 2022 17:59:36 +0000 (18:59 +0100)]
Add default KERNEL file for Elbrus E2K arch

2 years agoCreate Makefile
Martin Kroeker [Sat, 22 Jan 2022 17:57:28 +0000 (18:57 +0100)]
Create Makefile

2 years agoAdd Elbrus e2k architecture support
Martin Kroeker [Sat, 22 Jan 2022 17:55:10 +0000 (18:55 +0100)]
Add Elbrus e2k architecture support

2 years agoAdd Elbrus E2000 architecture as generic x86_64 compatible
Martin Kroeker [Sat, 22 Jan 2022 17:53:38 +0000 (18:53 +0100)]
Add Elbrus E2000 architecture as generic x86_64 compatible

2 years agoAdd Elbrus e2k architecture detection
Martin Kroeker [Sat, 22 Jan 2022 17:27:38 +0000 (18:27 +0100)]
Add Elbrus e2k architecture detection

2 years agocmake: Check if Fortran compiler is usable before enabling it.
Markus Mützel [Fri, 21 Jan 2022 12:27:17 +0000 (13:27 +0100)]
cmake: Check if Fortran compiler is usable before enabling it.

2 years agoMerge pull request #3492 from binebrank/arm_sve_zgemm
Martin Kroeker [Tue, 18 Jan 2022 20:36:33 +0000 (21:36 +0100)]
Merge pull request #3492 from binebrank/arm_sve_zgemm

SVE zgemm&cgemm (and other BLAS 3 complex)

2 years agoupdate armv8sve + contributors
Bine Brank [Tue, 18 Jan 2022 07:28:31 +0000 (08:28 +0100)]
update armv8sve + contributors

2 years agoadapt CMake
Bine Brank [Mon, 17 Jan 2022 21:36:48 +0000 (22:36 +0100)]
adapt CMake

2 years agoMerge pull request #3514 from martin-frbg/issue3513
Martin Kroeker [Mon, 17 Jan 2022 18:22:18 +0000 (19:22 +0100)]
Merge pull request #3514 from martin-frbg/issue3513

Fix ?LASWP pivot index calculation for negative increments other than -1

2 years agoFix pivot offset calculation for negative incx
Martin Kroeker [Sun, 16 Jan 2022 23:11:18 +0000 (00:11 +0100)]
Fix pivot offset calculation for negative incx

2 years agoFix pivot offset calculation for negative incx
Martin Kroeker [Sun, 16 Jan 2022 23:10:21 +0000 (00:10 +0100)]
Fix pivot offset calculation for negative incx

2 years agoFix offset calculation for negative incx
Martin Kroeker [Sun, 16 Jan 2022 23:09:14 +0000 (00:09 +0100)]
Fix offset calculation for negative incx

2 years agoFix pivot offset calculation for negative incx
Martin Kroeker [Sun, 16 Jan 2022 23:08:20 +0000 (00:08 +0100)]
Fix pivot offset calculation for negative incx

2 years agoFix pivot offset calculation for negative incx
Martin Kroeker [Sun, 16 Jan 2022 23:07:33 +0000 (00:07 +0100)]
Fix pivot offset calculation for negative incx

2 years agoFix pivot offset calculation for negative incx
Martin Kroeker [Sun, 16 Jan 2022 23:06:41 +0000 (00:06 +0100)]
Fix pivot offset calculation for negative incx

2 years agoFix pivot index for negative increments
Martin Kroeker [Sun, 16 Jan 2022 23:05:33 +0000 (00:05 +0100)]
Fix pivot index for negative increments

2 years agoadapt Makefile for SVE trsm
Bine Brank [Sun, 16 Jan 2022 20:40:56 +0000 (21:40 +0100)]
adapt Makefile for SVE trsm

2 years agofix ztrsm lt/ut copy
Bine Brank [Sun, 16 Jan 2022 20:39:57 +0000 (21:39 +0100)]
fix ztrsm lt/ut copy

2 years agoadd sve ztrsm
Bine Brank [Sat, 15 Jan 2022 21:27:25 +0000 (22:27 +0100)]
add sve ztrsm

2 years agofix sve dtrsm kernels
Bine Brank [Sat, 15 Jan 2022 20:02:14 +0000 (21:02 +0100)]
fix sve dtrsm kernels

2 years agoadd remaining sve trsm copy kernels
Bine Brank [Tue, 11 Jan 2022 20:16:38 +0000 (21:16 +0100)]
add remaining sve trsm copy kernels

2 years agotrsm_lncopy_sve
Bine Brank [Mon, 10 Jan 2022 20:45:37 +0000 (21:45 +0100)]
trsm_lncopy_sve

2 years agosve trsmRN and trsmRT
Bine Brank [Mon, 10 Jan 2022 19:42:20 +0000 (20:42 +0100)]
sve trsmRN and trsmRT

2 years agoMerge pull request #3511 from martin-frbg/cmakeutils
Martin Kroeker [Mon, 10 Jan 2022 08:12:52 +0000 (09:12 +0100)]
Merge pull request #3511 from martin-frbg/cmakeutils

Fix handling of ifdef/ifndef in CMAKE

2 years agoFix handling of ifdef/ifndef
Martin Kroeker [Sun, 9 Jan 2022 22:31:59 +0000 (23:31 +0100)]
Fix handling of ifdef/ifndef

2 years agoadd trsm_kernel_LT_sve
Bine Brank [Sun, 9 Jan 2022 19:11:47 +0000 (20:11 +0100)]
add trsm_kernel_LT_sve

2 years agosve trsm_kernel_LN
Bine Brank [Sun, 9 Jan 2022 18:40:04 +0000 (19:40 +0100)]
sve trsm_kernel_LN

2 years agoMerge pull request #3510 from martin-frbg/issue3505
Martin Kroeker [Sun, 9 Jan 2022 13:50:51 +0000 (14:50 +0100)]
Merge pull request #3510 from martin-frbg/issue3505

Fix recent SkylakeX/DYNAMIC_ARCH DGEMM breakage

2 years agoMerge pull request #3508 from snadampal/v1_n2
Martin Kroeker [Sun, 9 Jan 2022 13:50:26 +0000 (14:50 +0100)]
Merge pull request #3508 from snadampal/v1_n2

OpenBLAS: aarch64: Add neoverse-v1/n2 architecture specifics

2 years agomake DYNAMIC_ARCH option available to getarch_2nd/param.h
Martin Kroeker [Sat, 8 Jan 2022 22:50:34 +0000 (23:50 +0100)]
make DYNAMIC_ARCH option available to getarch_2nd/param.h

2 years agoForward DYNAMIC_ARCH option to Makefile.prebuild
Martin Kroeker [Sat, 8 Jan 2022 22:48:58 +0000 (23:48 +0100)]
Forward DYNAMIC_ARCH option to Makefile.prebuild

2 years agoSkylakeX: match parameters to dgemm kernels for dyn/non-dyn
Martin Kroeker [Sat, 8 Jan 2022 22:48:13 +0000 (23:48 +0100)]
SkylakeX: match parameters to dgemm kernels for dyn/non-dyn

2 years agoOpenBLAS: aarch64: Add neoverse-v1/n2 architecture specifics
Sunita Nadampalli [Fri, 7 Jan 2022 00:28:17 +0000 (00:28 +0000)]
OpenBLAS: aarch64: Add neoverse-v1/n2 architecture specifics

2 years agoupdate contributors
Bine Brank [Thu, 6 Jan 2022 09:29:35 +0000 (10:29 +0100)]
update contributors

2 years agofix makefile.L3
Bine Brank [Thu, 6 Jan 2022 09:26:11 +0000 (10:26 +0100)]
fix makefile.L3

2 years agocombine zchemm into single file
Bine Brank [Wed, 5 Jan 2022 13:42:37 +0000 (14:42 +0100)]
combine zchemm into single file

2 years agoadapt CMake for SVE
Bine Brank [Wed, 5 Jan 2022 13:36:39 +0000 (14:36 +0100)]
adapt CMake for SVE

2 years agosve copy functions for cgemm chemm zsymm
Bine Brank [Wed, 5 Jan 2022 08:12:22 +0000 (09:12 +0100)]
sve copy functions for cgemm chemm zsymm

2 years agoadd cgemm ctrmm sve kernels
Bine Brank [Wed, 5 Jan 2022 08:09:18 +0000 (09:09 +0100)]
add cgemm ctrmm sve kernels

2 years agomodify sve zgemmcopy kernels
Bine Brank [Wed, 5 Jan 2022 08:07:28 +0000 (09:07 +0100)]
modify sve zgemmcopy kernels

2 years agoupdate configuration of kernels for A64FX and ARMV8SVE
Bine Brank [Wed, 5 Jan 2022 08:00:54 +0000 (09:00 +0100)]
update configuration of kernels for A64FX and ARMV8SVE

2 years agoconfigure Makefile for sve
Bine Brank [Wed, 5 Jan 2022 07:57:51 +0000 (08:57 +0100)]
configure Makefile for sve

2 years agofix sve ztrmm kernel
Bine Brank [Tue, 4 Jan 2022 13:42:07 +0000 (14:42 +0100)]
fix sve ztrmm kernel

2 years agoztrmm sve copy functions
Bine Brank [Tue, 4 Jan 2022 13:40:59 +0000 (14:40 +0100)]
ztrmm sve copy functions

2 years agoadd sve zhemm copy routines
Bine Brank [Mon, 3 Jan 2022 14:56:05 +0000 (15:56 +0100)]
add sve zhemm copy routines

2 years agoadd sve ztrmm
Bine Brank [Sun, 2 Jan 2022 18:15:33 +0000 (19:15 +0100)]
add sve ztrmm

2 years agoMerge pull request #3502 from jgillis/develop
Martin Kroeker [Sat, 1 Jan 2022 11:12:32 +0000 (12:12 +0100)]
Merge pull request #3502 from jgillis/develop

Fix cmake crosscompilation for core2 target

2 years agoMerge pull request #3504 from martin-frbg/issue3503
Martin Kroeker [Sat, 1 Jan 2022 10:43:17 +0000 (11:43 +0100)]
Merge pull request #3504 from martin-frbg/issue3503

Guard against omp_get_num_places returning zero

2 years agoGuard against omp_get_num_places returning zero
Martin Kroeker [Fri, 31 Dec 2021 23:46:23 +0000 (00:46 +0100)]
Guard against omp_get_num_places returning zero

2 years agoztrmm sve copy kernels
Bine Brank [Thu, 30 Dec 2021 20:00:16 +0000 (21:00 +0100)]
ztrmm sve copy kernels

2 years agoFix cmake crosscompilation for core2 target
jgillis [Wed, 29 Dec 2021 21:50:20 +0000 (22:50 +0100)]
Fix cmake crosscompilation for core2 target

Missing HAVE_SSE* cmake variables cause cc.cmake to forget about `-msse*` flags

2 years agofix zgemm kernel
Bine Brank [Wed, 29 Dec 2021 10:42:04 +0000 (11:42 +0100)]
fix zgemm kernel

2 years agoMerge pull request #3500 from martin-frbg/osx_dyn_xerbla
Martin Kroeker [Tue, 28 Dec 2021 21:54:27 +0000 (22:54 +0100)]
Merge pull request #3500 from martin-frbg/osx_dyn_xerbla

Ensure that the right xerbla gets included in OSX DYNAMIC_ARCH builds

2 years agoEnsure that the right xerbla gets included in OSX DYNAMIC_ARCH builds
Martin Kroeker [Tue, 28 Dec 2021 18:06:55 +0000 (19:06 +0100)]
Ensure that the right xerbla gets included in OSX DYNAMIC_ARCH builds

2 years agoMerge pull request #3496 from yuanhec/develop
Martin Kroeker [Tue, 28 Dec 2021 17:51:56 +0000 (18:51 +0100)]
Merge pull request #3496 from yuanhec/develop

Fixed MSA enabled optimization on Loongson-3A4000

2 years agoMerge remote-tracking branch 'upstream/develop' into develop
yuanhecai [Mon, 27 Dec 2021 01:50:57 +0000 (09:50 +0800)]
Merge remote-tracking branch 'upstream/develop' into develop

2 years agozgemm sve copy routines
Bine Brank [Sun, 26 Dec 2021 16:05:46 +0000 (17:05 +0100)]
zgemm sve copy routines

2 years agosve zgemm kernel
Bine Brank [Sun, 26 Dec 2021 07:44:05 +0000 (08:44 +0100)]
sve zgemm kernel

2 years agoadded macros for sve zgemm kernels
Bine Brank [Sat, 25 Dec 2021 10:46:41 +0000 (11:46 +0100)]
added macros for sve zgemm kernels

2 years agofix function typecast
Martin Kroeker [Fri, 24 Dec 2021 19:01:52 +0000 (20:01 +0100)]
fix function typecast

2 years agofix function typecast
Martin Kroeker [Fri, 24 Dec 2021 19:00:50 +0000 (20:00 +0100)]
fix function typecast

2 years agoFixed MSA enabled optimization on Loongson-3A4000
yuanhecai [Thu, 23 Dec 2021 12:04:27 +0000 (20:04 +0800)]
Fixed MSA enabled optimization on Loongson-3A4000

2 years agoMerge pull request #3491 from gxw-loongson/develop
Martin Kroeker [Wed, 22 Dec 2021 07:34:12 +0000 (08:34 +0100)]
Merge pull request #3491 from gxw-loongson/develop

loongarch64: Optimize dgemm_kernel

2 years agoclean up unused variables and unreachable statements
Martin Kroeker [Tue, 21 Dec 2021 17:53:55 +0000 (18:53 +0100)]
clean up unused variables and unreachable statements

2 years agofix function typecasts
Martin Kroeker [Tue, 21 Dec 2021 17:51:28 +0000 (18:51 +0100)]
fix function typecasts

2 years agofix function typecasts
Martin Kroeker [Tue, 21 Dec 2021 17:50:22 +0000 (18:50 +0100)]
fix function typecasts

2 years agofix function typecast
Martin Kroeker [Tue, 21 Dec 2021 17:49:18 +0000 (18:49 +0100)]
fix function typecast

2 years agofix function typecasts
Martin Kroeker [Tue, 21 Dec 2021 17:47:35 +0000 (18:47 +0100)]
fix function typecasts

2 years agofix function typecasts
Martin Kroeker [Tue, 21 Dec 2021 17:45:28 +0000 (18:45 +0100)]
fix function typecasts