projects
/
platform
/
upstream
/
openblas.git
/ shortlog
commit
grep
author
committer
pickaxe
?
search:
re
summary
| shortlog |
log
|
commit
|
commitdiff
|
tree
first ⋅ prev ⋅
next
platform/upstream/openblas.git
2021-12-05
Bine Brank
sgemm v2x8 SVE kernel
commit
|
commitdiff
|
tree
|
snapshot
2021-12-05
Bine Brank
strmm sve v1x8 kernel
commit
|
commitdiff
|
tree
|
snapshot
2021-11-29
Bine Brank
trmm sve copy fucntions for single precision
commit
|
commitdiff
|
tree
|
snapshot
2021-11-28
Bine Brank
add sgemm kernel and copy functions for sgemm and ssymm
commit
|
commitdiff
|
tree
|
snapshot
2021-11-26
Martin Kroeker
Merge pull request #3425 from binebrank/arm_sve_dgemm
commit
|
commitdiff
|
tree
|
snapshot
2021-11-26
Martin Kroeker
Merge pull request #3459 from rafaelcfsousa/fix_cmake
commit
|
commitdiff
|
tree
|
snapshot
2021-11-26
Martin Kroeker
Merge pull request #3462 from martin-frbg/azure-alpine2
commit
|
commitdiff
|
tree
|
snapshot
2021-11-26
Martin Kroeker
Update alpine-chroot-install again
commit
|
commitdiff
|
tree
|
snapshot
2021-11-26
Bine Brank
update CONTRIBUTORS.md
commit
|
commitdiff
|
tree
|
snapshot
2021-11-26
Bine Brank
Adapt CMake for SVE
commit
|
commitdiff
|
tree
|
snapshot
2021-11-26
Martin Kroeker
Merge pull request #3457 from wjc404/optimize-A53-dgemm
commit
|
commitdiff
|
tree
|
snapshot
2021-11-26
Martin Kroeker
Merge pull request #3456 from martin-frbg/issue3444
commit
|
commitdiff
|
tree
|
snapshot
2021-11-26
Martin Kroeker
AzureCI: Fetch alpine-chroot-install from master to...
commit
|
commitdiff
|
tree
|
snapshot
2021-11-25
Jia-Chen
MOD: add comments to a53 zgemm kernel
commit
|
commitdiff
|
tree
|
snapshot
2021-11-25
Rafael Cardoso...
Modify the order that cmake set the KERNEL variables...
commit
|
commitdiff
|
tree
|
snapshot
2021-11-24
Rafael Cardoso...
Fix the cmake parser to identify more patterns
commit
|
commitdiff
|
tree
|
snapshot
2021-11-24
Jia-Chen
MOD: optimize zgemm on cortex-A53/cortex-A55
commit
|
commitdiff
|
tree
|
snapshot
2021-11-23
Bine Brank
reduced dgemm_unroll_m to work with 128-bit sve
commit
|
commitdiff
|
tree
|
snapshot
2021-11-22
Bine Brank
removed unused code (compiler warnings)
commit
|
commitdiff
|
tree
|
snapshot
2021-11-22
Bine Brank
modify Makefile for SVE copy
commit
|
commitdiff
|
tree
|
snapshot
2021-11-21
Bine Brank
configure SVE Makefile
commit
|
commitdiff
|
tree
|
snapshot
2021-11-21
Bine Brank
some clean-up & commentary
commit
|
commitdiff
|
tree
|
snapshot
2021-11-20
Martin Kroeker
Fix unintended reversion of recent CortexA53 changes
commit
|
commitdiff
|
tree
|
snapshot
2021-11-20
Martin Kroeker
Add CMAKE support for cross-compiling to MIPS32
commit
|
commitdiff
|
tree
|
snapshot
2021-11-20
Martin Kroeker
Add generic mips32 target
commit
|
commitdiff
|
tree
|
snapshot
2021-11-20
Martin Kroeker
Add generic MIPS32 target
commit
|
commitdiff
|
tree
|
snapshot
2021-11-20
Bine Brank
symm SVE copy rutines
commit
|
commitdiff
|
tree
|
snapshot
2021-11-18
Martin Kroeker
Merge pull request #3451 from wjc404/optimize-A53-dgemm
commit
|
commitdiff
|
tree
|
snapshot
2021-11-18
Jia-Chen
MOD: optimize normal DGEMM on ARMV8 cortex-A53 & cortex-A55
commit
|
commitdiff
|
tree
|
snapshot
2021-11-16
Martin Kroeker
Merge pull request #3450 from mmuetzel/suffix-nofortran
commit
|
commitdiff
|
tree
|
snapshot
2021-11-15
Markus Mützel
cmake: Set SUFFIX64 also for NOFORTRAN
commit
|
commitdiff
|
tree
|
snapshot
2021-11-14
Bine Brank
add remaining trmm copy rutines for SVE
commit
|
commitdiff
|
tree
|
snapshot
2021-11-14
Martin Kroeker
Merge pull request #3449 from martin-frbg/mips_msa
commit
|
commitdiff
|
tree
|
snapshot
2021-11-13
Martin Kroeker
Ignore compiler support for MIPS MSA if the cpu lacks...
commit
|
commitdiff
|
tree
|
snapshot
2021-11-13
Martin Kroeker
MIPS P5600 and 24KC,1004K cpus do not support MSA
commit
|
commitdiff
|
tree
|
snapshot
2021-11-13
Martin Kroeker
get MSA capability from feature flags
commit
|
commitdiff
|
tree
|
snapshot
2021-11-13
Bine Brank
dtrmm_utcopy sve function
commit
|
commitdiff
|
tree
|
snapshot
2021-11-11
Martin Kroeker
Merge pull request #3447 from martin-frbg/issue3446
commit
|
commitdiff
|
tree
|
snapshot
2021-11-10
Martin Kroeker
Fix potentially wrong HOSTARCH definition in cross...
commit
|
commitdiff
|
tree
|
snapshot
2021-11-07
Bine Brank
add v2x8 kernel + fix sve dtrmm
commit
|
commitdiff
|
tree
|
snapshot
2021-11-05
Martin Kroeker
Merge pull request #3443 from martin-frbg/issue3441
commit
|
commitdiff
|
tree
|
snapshot
2021-11-05
Martin Kroeker
Fix NULL pointer checks in blas_memory_alloc
commit
|
commitdiff
|
tree
|
snapshot
2021-11-04
Martin Kroeker
Merge pull request #3431 from MehdiChinoune/export...
commit
|
commitdiff
|
tree
|
snapshot
2021-11-04
Martin Kroeker
Merge pull request #3442 from martin-frbg/cpuid_x86
commit
|
commitdiff
|
tree
|
snapshot
2021-11-04
Martin Kroeker
Add CPUIDs for Alder Lake and other recent Intel cpus
commit
|
commitdiff
|
tree
|
snapshot
2021-11-04
Martin Kroeker
Add CPUIDs for Alder Lake and some other recent Intel...
commit
|
commitdiff
|
tree
|
snapshot
2021-11-04
Martin Kroeker
Merge pull request #3429 from martin-frbg/issue3428
commit
|
commitdiff
|
tree
|
snapshot
2021-11-04
Martin Kroeker
Merge pull request #3440 from mhillenbrand/fix_gemv_indices
commit
|
commitdiff
|
tree
|
snapshot
2021-11-04
Martin Kroeker
Fix miscounting of threadpool size on Linux with OMP_PR...
commit
|
commitdiff
|
tree
|
snapshot
2021-11-03
Marius Hillenbrand
Fix flipped indices in benchmark for gemv
commit
|
commitdiff
|
tree
|
snapshot
2021-11-01
Bine Brank
add ARMV8SVE target
commit
|
commitdiff
|
tree
|
snapshot
2021-11-01
Martin Kroeker
Merge pull request #3427 from mhillenbrand/zarch-detect...
commit
|
commitdiff
|
tree
|
snapshot
2021-11-01
Martin Kroeker
Merge pull request #3434 from gxw-loongson/develop
commit
|
commitdiff
|
tree
|
snapshot
2021-11-01
gxw
Add cblas_{c/z}srot cblas_{c/z}rotg support
commit
|
commitdiff
|
tree
|
snapshot
2021-10-31
Bine Brank
fix sve dgemm kernel + sve dtrmm
commit
|
commitdiff
|
tree
|
snapshot
2021-10-30
Martin Kroeker
Fix nvidia HPC version checks
commit
|
commitdiff
|
tree
|
snapshot
2021-10-30
Bine Brank
added SVE ncopy and tcopy
commit
|
commitdiff
|
tree
|
snapshot
2021-10-30
Mehdi Chinoune
Fix exported OpenBLASTargets.cmake
commit
|
commitdiff
|
tree
|
snapshot
2021-10-29
Martin Kroeker
Adjust compiler options for nvidia hpc 21.9 (and fix...
commit
|
commitdiff
|
tree
|
snapshot
2021-10-28
Marius Hillenbrand
cpuid_zarch/hwcaps: add documentation and dump hwcaps...
commit
|
commitdiff
|
tree
|
snapshot
2021-10-28
Martin Kroeker
Merge pull request #3426 from martin-frbg/pr3424
commit
|
commitdiff
|
tree
|
snapshot
2021-10-27
Martin Kroeker
Add model number for Tiger Lake H (mobile variant)
commit
|
commitdiff
|
tree
|
snapshot
2021-10-27
Bine Brank
add sve dgemm prototype
commit
|
commitdiff
|
tree
|
snapshot
2021-10-27
Martin Kroeker
Merge pull request #3424 from Neutron3529/patch-1
commit
|
commitdiff
|
tree
|
snapshot
2021-10-27
Martin Kroeker
Merge pull request #3423 from mhillenbrand/fix-static...
commit
|
commitdiff
|
tree
|
snapshot
2021-10-27
Neutron3529
auto-detect for Intel i7-11800H
commit
|
commitdiff
|
tree
|
snapshot
2021-10-26
Marius Hillenbrand
s390x: use DYNAMIC_ARCH's cpu detection for compile...
commit
|
commitdiff
|
tree
|
snapshot
2021-10-25
Martin Kroeker
Merge pull request #3422 from martin-frbg/issue3421
commit
|
commitdiff
|
tree
|
snapshot
2021-10-24
Martin Kroeker
Revert #3252
commit
|
commitdiff
|
tree
|
snapshot
2021-10-20
Martin Kroeker
Merge pull request #3420 from martin-frbg/issue3419
commit
|
commitdiff
|
tree
|
snapshot
2021-10-20
Martin Kroeker
Remove dangerous optimization from previous #3252 ...
commit
|
commitdiff
|
tree
|
snapshot
2021-10-20
Martin Kroeker
Merge pull request #3418 from martin-frbg/issue2927-2
commit
|
commitdiff
|
tree
|
snapshot
2021-10-19
Martin Kroeker
Enable SVE for A64FX
commit
|
commitdiff
|
tree
|
snapshot
2021-10-18
Martin Kroeker
Add basic support for the Fujitsu A64FX (#3415)
commit
|
commitdiff
|
tree
|
snapshot
2021-10-18
Martin Kroeker
Merge pull request #3416 from guowangy/spr-bf16
commit
|
commitdiff
|
tree
|
snapshot
2021-10-18
Wangyang Guo
sbgemm: spr: disable small matrix path by default
commit
|
commitdiff
|
tree
|
snapshot
2021-10-18
Wangyang Guo
sbgemm: spr: implement otcopy_16
commit
|
commitdiff
|
tree
|
snapshot
2021-10-18
Wangyang Guo
sbgemm: spr: reuse ncopy_16 from cooperlake as incopy
commit
|
commitdiff
|
tree
|
snapshot
2021-10-18
Wangyang Guo
sbgemm: spr: optimization for tmp_c buffer
commit
|
commitdiff
|
tree
|
snapshot
2021-10-18
Wangyang Guo
sbgemm: spr: kernel handle alpha != 1.0
commit
|
commitdiff
|
tree
|
snapshot
2021-10-18
Wangyang Guo
sbgemm: spr: oncopy: use tile load/store instead
commit
|
commitdiff
|
tree
|
snapshot
2021-10-18
Wangyang Guo
sbgemm: spr: only load A once in tail_k handling
commit
|
commitdiff
|
tree
|
snapshot
2021-10-18
Wangyang Guo
sbgemm: spr: process k2 and odd k at the same time
commit
|
commitdiff
|
tree
|
snapshot
2021-10-18
Wangyang Guo
sbgemm: spr: enlarge P to 256 for performance
commit
|
commitdiff
|
tree
|
snapshot
2021-10-18
Wangyang Guo
sbgemm: spr: oncopy: avoid handling too much pointer...
commit
|
commitdiff
|
tree
|
snapshot
2021-10-18
Wangyang Guo
sbgemm: spr: reduce tile conf loading by seperate tail...
commit
|
commitdiff
|
tree
|
snapshot
2021-10-18
Wangyang Guo
sbgemm: spr: tuning for blocking params
commit
|
commitdiff
|
tree
|
snapshot
2021-10-18
Wangyang Guo
sbgemm: spr: kernel works for NN case when alpha is 1.0
commit
|
commitdiff
|
tree
|
snapshot
2021-10-18
Wangyang Guo
sbgemm: spr: kernel works for m32 in NN case
commit
|
commitdiff
|
tree
|
snapshot
2021-10-18
Wangyang Guo
sbgemm: spr: implement oncopy_16
commit
|
commitdiff
|
tree
|
snapshot
2021-10-18
Wangyang Guo
sbgemm: spr: add dummy source files
commit
|
commitdiff
|
tree
|
snapshot
2021-10-17
Martin Kroeker
Add march/mtune flags for clang builds on ARM64 as...
commit
|
commitdiff
|
tree
|
snapshot
2021-10-17
Martin Kroeker
Merge pull request #3404 from guowangy/spr-build
commit
|
commitdiff
|
tree
|
snapshot
2021-10-17
Martin Kroeker
Merge pull request #3413 from MehdiChinoune/cmake-readi...
commit
|
commitdiff
|
tree
|
snapshot
2021-10-17
Mehdi Chinoune
[NFC] Improve CMakeLists.txt file readibility
commit
|
commitdiff
|
tree
|
snapshot
2021-10-17
Martin Kroeker
Merge pull request #3411 from MehdiChinoune/both_shared...
commit
|
commitdiff
|
tree
|
snapshot
2021-10-16
Mehdi Chinoune
Support building both static and shared libraries
commit
|
commitdiff
|
tree
|
snapshot
2021-10-16
Martin Kroeker
Merge pull request #3410 from MehdiChinoune/mingw-clang-64
commit
|
commitdiff
|
tree
|
snapshot
2021-10-16
مهدي شينون...
Fix MinGW/Clang 64 bits detection.
commit
|
commitdiff
|
tree
|
snapshot
2021-10-12
Wangyang Guo
Fix build error in legacy gcc
commit
|
commitdiff
|
tree
|
snapshot
next