projects
/
platform
/
upstream
/
openblas.git
/ history
commit
grep
author
committer
pickaxe
?
search:
re
summary
|
shortlog
|
log
|
commit
|
commitdiff
|
tree
first ⋅ prev ⋅
next
Optimize cdot function for POWER10
[platform/upstream/openblas.git]
/
kernel
/
2021-01-15
Rajalakshmi Sriniv...
Optimize cdot function for POWER10
tree
|
commitdiff
2021-01-14
Martin Kroeker
Merge pull request #3067 from albertziegenhagel/fix...
tree
|
commitdiff
2021-01-14
Martin Kroeker
Merge pull request #3064 from martin-frbg/issue3063
tree
|
commitdiff
2021-01-14
Martin Kroeker
Merge pull request #3066 from martin-frbg/buffsizefix
tree
|
commitdiff
2021-01-14
Martin Kroeker
Merge pull request #3062 from austinpagan/GemmPreferedSize3
tree
|
commitdiff
2021-01-14
Martin Kroeker
Merge pull request #3061 from martin-frbg/arm64-pgi
tree
|
commitdiff
2021-01-14
Martin Kroeker
Merge pull request #3051 from martin-frbg/rocketlake
tree
|
commitdiff
2021-01-14
Albert Ziegenhagel
Fix building "generic" TRMM kernel with CMake
tree
|
commitdiff
2021-01-12
Martin Kroeker
Add workaround for NVIDIA HPC
tree
|
commitdiff
2021-01-12
Martin Kroeker
Add workaround for NVIDIA HPC
tree
|
commitdiff
2021-01-12
Martin Kroeker
Add workaround for NVIDIA HPC
tree
|
commitdiff
2021-01-12
Martin Kroeker
Add workaround for NVIDIA HPC mishandling of the asm...
tree
|
commitdiff
2021-01-12
Martin Kroeker
Add workaround for NVIDIA HPC mishandling of the asm...
tree
|
commitdiff
2021-01-12
Martin Kroeker
Support NVIDIA HPC compiler
tree
|
commitdiff
2021-01-10
Martin Kroeker
Merge pull request #7 from xianyi/develop
tree
|
commitdiff
2021-01-08
Martin Kroeker
Merge pull request #3055 from RajalakshmiSR/swapp10
tree
|
commitdiff
2021-01-08
Rajalakshmi Sriniv...
Optimize swap function for POWER10
tree
|
commitdiff
2021-01-01
Martin Kroeker
Merge pull request #3052 from ashwinyes/arm64_fix_nrm2
tree
|
commitdiff
2021-01-01
Ashwin Sekhar T K
arm64: Fix nrm2 for input vectors with Inf
tree
|
commitdiff
2020-12-27
Martin Kroeker
Merge pull request #6 from xianyi/develop
tree
|
commitdiff
2020-12-27
Martin Kroeker
Merge pull request #3035 from Joshua-Ashton/patch-1
tree
|
commitdiff
2020-12-21
Martin Kroeker
Merge pull request #3048 from martin-frbg/issue2998
tree
|
commitdiff
2020-12-21
Martin Kroeker
Temporarily revert to the old nrm2 kernels
tree
|
commitdiff
2020-12-21
Martin Kroeker
Temporarily revert to the old nrm2 kernels
tree
|
commitdiff
2020-12-21
Martin Kroeker
Temporarily revert to the old nrm2 kernel
tree
|
commitdiff
2020-12-19
Martin Kroeker
Merge pull request #3045 from martin-frbg/nvidiasdk
tree
|
commitdiff
2020-12-19
Martin Kroeker
Disable FMA intrinsics in the srot kernel when the...
tree
|
commitdiff
2020-12-19
Martin Kroeker
Amend SkylakeX options to support the NVIDIA compiler
tree
|
commitdiff
2020-12-19
Martin Kroeker
Merge pull request #3042 from martin-frbg/develop
tree
|
commitdiff
2020-12-17
Martin Kroeker
Conditionally add -mfma to compiler options where needed
tree
|
commitdiff
2020-12-13
Martin Kroeker
Merge pull request #4 from xianyi/develop
tree
|
commitdiff
2020-12-13
Martin Kroeker
Merge pull request #3036 from RajalakshmiSR/p10copyalign
tree
|
commitdiff
2020-12-13
Rajalakshmi Sriniv...
POWER10: Improve copy performance
tree
|
commitdiff
2020-12-12
Martin Kroeker
Merge pull request #3033 from xianyi/develop
tree
|
commitdiff
2020-12-11
Martin Kroeker
Merge pull request #3 from xianyi/develop
tree
|
commitdiff
2020-12-11
Martin Kroeker
Merge pull request #2994 from antonblanchard/power10...
tree
|
commitdiff
2020-12-10
Martin Kroeker
Merge pull request #3029 from RajalakshmiSR/axpyp10
tree
|
commitdiff
2020-12-10
Martin Kroeker
Merge pull request #3021 from austinpagan/trsm_p10
tree
|
commitdiff
2020-12-10
Rajalakshmi Sriniv...
POWER10: Improve axpy performance
tree
|
commitdiff
2020-12-10
Martin Kroeker
Merge pull request #3026 from martin-frbg/revert747
tree
|
commitdiff
2020-12-10
Martin Kroeker
Merge pull request #3027 from gxw-loongson/develop
tree
|
commitdiff
2020-12-09
gxw
Add msa support for loongson
tree
|
commitdiff
2020-12-08
Martin Kroeker
Merge pull request #2 from xianyi/develop
tree
|
commitdiff
2020-12-08
Martin Kroeker
Merge pull request #3025 from TiredNotTear/develop
tree
|
commitdiff
2020-12-07
Martin Kroeker
Merge pull request #3022 from jinboson/develop
tree
|
commitdiff
2020-12-07
Hao Chen
Fix failed cgemv and zgemv test case after using msa...
tree
|
commitdiff
2020-12-07
Hao Chen
Fix failed sswap and dswap case by using msa optimization
tree
|
commitdiff
2020-12-06
Martin Kroeker
Merge pull request #3024 from martin-frbg/sparc
tree
|
commitdiff
2020-12-06
Martin Kroeker
Work around DOT and SWAP test failures
tree
|
commitdiff
2020-12-06
Martin Kroeker
Fix compilation with SolarisStudio
tree
|
commitdiff
2020-12-06
Martin Kroeker
Merge pull request #1 from xianyi/develop
tree
|
commitdiff
2020-12-05
Jin Bo
Fix test errors reported by cblas_cgemm & cblas_ctrmm
tree
|
commitdiff
2020-12-04
Gordon Fossum
Added special unrolled vectorized versions of "Solve...
tree
|
commitdiff
2020-12-04
Martin Kroeker
Merge pull request #3018 from martin-frbg/issue3015
tree
|
commitdiff
2020-12-04
Martin Kroeker
Merge pull request #3016 from xiegengxin/complex-asum
tree
|
commitdiff
2020-12-04
Martin Kroeker
Merge pull request #3013 from martin-frbg/gcc46
tree
|
commitdiff
2020-12-04
Martin Kroeker
Merge pull request #3011 from cyyever/fix_link
tree
|
commitdiff
2020-12-02
Gengxin Xie
fix error declare function blas_level1_thread_with_retu...
tree
|
commitdiff
2020-12-01
Gengxin Xie
Improve the performance of zasum and casum with AVX512...
tree
|
commitdiff
2020-11-30
Martin Kroeker
Merge pull request #3014 from RajalakshmiSR/dgemvnp10
tree
|
commitdiff
2020-11-29
Rajalakshmi Sriniv...
POWER10: Optimize dgemv_n
tree
|
commitdiff
2020-11-22
Martin Kroeker
Merge pull request #112 from xianyi/develop
tree
|
commitdiff
2020-11-22
Martin Kroeker
Merge pull request #2965 from epsilon-0/develop
tree
|
commitdiff
2020-11-22
Martin Kroeker
Merge pull request #2988 from xiegengxin/smp-asum
tree
|
commitdiff
2020-11-22
Martin Kroeker
Merge pull request #2997 from Flamefire/reproduce_crash
tree
|
commitdiff
2020-11-22
Xianyi Zhang
Merge branch 'risc-v' into develop
tree
|
commitdiff
2020-11-22
Xianyi Zhang
Merge branch 'develop' into risc-v
tree
|
commitdiff
2020-11-16
Martin Kroeker
Merge pull request #2981 from Qiyu8/fix-sum
tree
|
commitdiff
2020-11-16
Martin Kroeker
Merge pull request #2983 from Qiyu8/optimize-srot
tree
|
commitdiff
2020-11-13
Martin Kroeker
Merge pull request #111 from xianyi/develop
tree
|
commitdiff
2020-11-13
Gengxin Xie
Improve the performance of dasum and sasum when SMP...
tree
|
commitdiff
2020-11-13
Qiyu8
modify system.cmake to enable fma flag
tree
|
commitdiff
2020-11-12
Qiyu8
fix the CI failure of target specific option mismatch
tree
|
commitdiff
2020-11-12
Qiyu8
fix the CI failure of lack the head
tree
|
commitdiff
2020-11-11
Qiyu8
modify macro
tree
|
commitdiff
2020-11-11
Qiyu8
only FMA3 and vector larger than 128 have positive...
tree
|
commitdiff
2020-11-11
Qiyu8
Optimize the performance of rot by using universal...
tree
|
commitdiff
2020-11-10
Qiyu8
fix sum optimize issues
tree
|
commitdiff
2020-11-10
Xianyi Zhang
Refs #2899. Merge branch 'damonyu1989-openblas-open...
tree
|
commitdiff
2020-11-10
Xianyi Zhang
Refs #2899
tree
|
commitdiff
2020-11-10
Xianyi Zhang
Merge branch 'develop' into risc-v
tree
|
commitdiff
2020-11-08
Martin Kroeker
Merge pull request #2972 from xiegengxin/rot-intrinsic
tree
|
commitdiff
2020-11-08
Martin Kroeker
Merge pull request #2980 from martin-frbg/fixgetarch
tree
|
commitdiff
2020-11-08
Martin Kroeker
Merge pull request #2979 from RajalakshmiSR/dot_power10
tree
|
commitdiff
2020-11-08
Martin Kroeker
Merge pull request #2978 from martin-frbg/fixdynfeatures
tree
|
commitdiff
2020-11-07
Rajalakshmi Sriniv...
Optimize sdot/ddot for POWER10
tree
|
commitdiff
2020-11-07
Martin Kroeker
Remove previous workaround for compiler flags related...
tree
|
commitdiff
2020-11-07
Martin Kroeker
Merge pull request #110 from xianyi/develop
tree
|
commitdiff
2020-11-07
Martin Kroeker
Merge pull request #2977 from martin-frbg/issue2976
tree
|
commitdiff
2020-11-07
Martin Kroeker
Fix macro name used in ifdef
tree
|
commitdiff
2020-11-05
Gengxin Xie
fix typo
tree
|
commitdiff
2020-11-05
Gengxin Xie
Improve the performance of rot by using AVX512 and...
tree
|
commitdiff
2020-11-04
Martin Kroeker
Merge pull request #2966 from martin-frbg/issue2964
tree
|
commitdiff
2020-11-02
Martin Kroeker
Merge pull request #2967 from RajalakshmiSR/dgemm88
tree
|
commitdiff
2020-11-01
Martin Kroeker
Merge pull request #2962 from brada4/develop
tree
|
commitdiff
2020-10-31
Rajalakshmi Sriniv...
POWER10: Change dgemm unroll factors
tree
|
commitdiff
2020-10-31
Martin Kroeker
Merge pull request #109 from xianyi/develop
tree
|
commitdiff
2020-10-31
Martin Kroeker
Merge pull request #2960 from thrasibule/avx2_detection
tree
|
commitdiff
2020-10-30
Martin Kroeker
Merge pull request #2956 from RajalakshmiSR/caxpy_p10
tree
|
commitdiff
2020-10-29
Rajalakshmi Sriniv...
Optimize caxpy for POWER10
tree
|
commitdiff
next