Optimize cdot function for POWER10
[platform/upstream/openblas.git] / kernel /
2021-01-15 Rajalakshmi Sriniv... Optimize cdot function for POWER10
2021-01-14 Martin KroekerMerge pull request #3067 from albertziegenhagel/fix...
2021-01-14 Martin KroekerMerge pull request #3064 from martin-frbg/issue3063
2021-01-14 Martin KroekerMerge pull request #3066 from martin-frbg/buffsizefix
2021-01-14 Martin KroekerMerge pull request #3062 from austinpagan/GemmPreferedSize3
2021-01-14 Martin KroekerMerge pull request #3061 from martin-frbg/arm64-pgi
2021-01-14 Martin KroekerMerge pull request #3051 from martin-frbg/rocketlake
2021-01-14 Albert ZiegenhagelFix building "generic" TRMM kernel with CMake
2021-01-12 Martin KroekerAdd workaround for NVIDIA HPC
2021-01-12 Martin KroekerAdd workaround for NVIDIA HPC
2021-01-12 Martin KroekerAdd workaround for NVIDIA HPC
2021-01-12 Martin KroekerAdd workaround for NVIDIA HPC mishandling of the asm...
2021-01-12 Martin KroekerAdd workaround for NVIDIA HPC mishandling of the asm...
2021-01-12 Martin KroekerSupport NVIDIA HPC compiler
2021-01-10 Martin KroekerMerge pull request #7 from xianyi/develop
2021-01-08 Martin KroekerMerge pull request #3055 from RajalakshmiSR/swapp10
2021-01-08 Rajalakshmi Sriniv... Optimize swap function for POWER10
2021-01-01 Martin KroekerMerge pull request #3052 from ashwinyes/arm64_fix_nrm2
2021-01-01 Ashwin Sekhar T Karm64: Fix nrm2 for input vectors with Inf
2020-12-27 Martin KroekerMerge pull request #6 from xianyi/develop
2020-12-27 Martin KroekerMerge pull request #3035 from Joshua-Ashton/patch-1
2020-12-21 Martin KroekerMerge pull request #3048 from martin-frbg/issue2998
2020-12-21 Martin KroekerTemporarily revert to the old nrm2 kernels
2020-12-21 Martin KroekerTemporarily revert to the old nrm2 kernels
2020-12-21 Martin KroekerTemporarily revert to the old nrm2 kernel
2020-12-19 Martin KroekerMerge pull request #3045 from martin-frbg/nvidiasdk
2020-12-19 Martin KroekerDisable FMA intrinsics in the srot kernel when the...
2020-12-19 Martin KroekerAmend SkylakeX options to support the NVIDIA compiler
2020-12-19 Martin KroekerMerge pull request #3042 from martin-frbg/develop
2020-12-17 Martin KroekerConditionally add -mfma to compiler options where needed
2020-12-13 Martin KroekerMerge pull request #4 from xianyi/develop
2020-12-13 Martin KroekerMerge pull request #3036 from RajalakshmiSR/p10copyalign
2020-12-13 Rajalakshmi Sriniv... POWER10: Improve copy performance
2020-12-12 Martin KroekerMerge pull request #3033 from xianyi/develop
2020-12-11 Martin KroekerMerge pull request #3 from xianyi/develop
2020-12-11 Martin KroekerMerge pull request #2994 from antonblanchard/power10...
2020-12-10 Martin KroekerMerge pull request #3029 from RajalakshmiSR/axpyp10
2020-12-10 Martin KroekerMerge pull request #3021 from austinpagan/trsm_p10
2020-12-10 Rajalakshmi Sriniv... POWER10: Improve axpy performance
2020-12-10 Martin KroekerMerge pull request #3026 from martin-frbg/revert747
2020-12-10 Martin KroekerMerge pull request #3027 from gxw-loongson/develop
2020-12-09 gxwAdd msa support for loongson
2020-12-08 Martin KroekerMerge pull request #2 from xianyi/develop
2020-12-08 Martin KroekerMerge pull request #3025 from TiredNotTear/develop
2020-12-07 Martin KroekerMerge pull request #3022 from jinboson/develop
2020-12-07 Hao ChenFix failed cgemv and zgemv test case after using msa...
2020-12-07 Hao ChenFix failed sswap and dswap case by using msa optimization
2020-12-06 Martin KroekerMerge pull request #3024 from martin-frbg/sparc
2020-12-06 Martin KroekerWork around DOT and SWAP test failures
2020-12-06 Martin KroekerFix compilation with SolarisStudio
2020-12-06 Martin KroekerMerge pull request #1 from xianyi/develop
2020-12-05 Jin BoFix test errors reported by cblas_cgemm & cblas_ctrmm
2020-12-04 Gordon FossumAdded special unrolled vectorized versions of "Solve...
2020-12-04 Martin KroekerMerge pull request #3018 from martin-frbg/issue3015
2020-12-04 Martin KroekerMerge pull request #3016 from xiegengxin/complex-asum
2020-12-04 Martin KroekerMerge pull request #3013 from martin-frbg/gcc46
2020-12-04 Martin KroekerMerge pull request #3011 from cyyever/fix_link
2020-12-02 Gengxin Xiefix error declare function blas_level1_thread_with_retu...
2020-12-01 Gengxin XieImprove the performance of zasum and casum with AVX512...
2020-11-30 Martin KroekerMerge pull request #3014 from RajalakshmiSR/dgemvnp10
2020-11-29 Rajalakshmi Sriniv... POWER10: Optimize dgemv_n
2020-11-22 Martin KroekerMerge pull request #112 from xianyi/develop
2020-11-22 Martin KroekerMerge pull request #2965 from epsilon-0/develop
2020-11-22 Martin KroekerMerge pull request #2988 from xiegengxin/smp-asum
2020-11-22 Martin KroekerMerge pull request #2997 from Flamefire/reproduce_crash
2020-11-22 Xianyi ZhangMerge branch 'risc-v' into develop
2020-11-22 Xianyi ZhangMerge branch 'develop' into risc-v
2020-11-16 Martin KroekerMerge pull request #2981 from Qiyu8/fix-sum
2020-11-16 Martin KroekerMerge pull request #2983 from Qiyu8/optimize-srot
2020-11-13 Martin KroekerMerge pull request #111 from xianyi/develop
2020-11-13 Gengxin XieImprove the performance of dasum and sasum when SMP...
2020-11-13 Qiyu8modify system.cmake to enable fma flag
2020-11-12 Qiyu8fix the CI failure of target specific option mismatch
2020-11-12 Qiyu8fix the CI failure of lack the head
2020-11-11 Qiyu8modify macro
2020-11-11 Qiyu8only FMA3 and vector larger than 128 have positive...
2020-11-11 Qiyu8Optimize the performance of rot by using universal...
2020-11-10 Qiyu8fix sum optimize issues
2020-11-10 Xianyi ZhangRefs #2899. Merge branch 'damonyu1989-openblas-open...
2020-11-10 Xianyi ZhangRefs #2899
2020-11-10 Xianyi ZhangMerge branch 'develop' into risc-v
2020-11-08 Martin KroekerMerge pull request #2972 from xiegengxin/rot-intrinsic
2020-11-08 Martin KroekerMerge pull request #2980 from martin-frbg/fixgetarch
2020-11-08 Martin KroekerMerge pull request #2979 from RajalakshmiSR/dot_power10
2020-11-08 Martin KroekerMerge pull request #2978 from martin-frbg/fixdynfeatures
2020-11-07 Rajalakshmi Sriniv... Optimize sdot/ddot for POWER10
2020-11-07 Martin KroekerRemove previous workaround for compiler flags related...
2020-11-07 Martin KroekerMerge pull request #110 from xianyi/develop
2020-11-07 Martin KroekerMerge pull request #2977 from martin-frbg/issue2976
2020-11-07 Martin KroekerFix macro name used in ifdef
2020-11-05 Gengxin Xiefix typo
2020-11-05 Gengxin XieImprove the performance of rot by using AVX512 and...
2020-11-04 Martin KroekerMerge pull request #2966 from martin-frbg/issue2964
2020-11-02 Martin KroekerMerge pull request #2967 from RajalakshmiSR/dgemm88
2020-11-01 Martin KroekerMerge pull request #2962 from brada4/develop
2020-10-31 Rajalakshmi Sriniv... POWER10: Change dgemm unroll factors
2020-10-31 Martin KroekerMerge pull request #109 from xianyi/develop
2020-10-31 Martin KroekerMerge pull request #2960 from thrasibule/avx2_detection
2020-10-30 Martin KroekerMerge pull request #2956 from RajalakshmiSR/caxpy_p10
2020-10-29 Rajalakshmi Sriniv... Optimize caxpy for POWER10
next