platform/upstream/openblas.git
9 years agoadded optimized ddot kernel for bulldozer
wernsaar [Fri, 22 Aug 2014 19:19:29 +0000 (21:19 +0200)]
added optimized ddot kernel for bulldozer

9 years agoadded optimized ddot kernel for nehalem
wernsaar [Fri, 22 Aug 2014 18:34:41 +0000 (20:34 +0200)]
added optimized ddot kernel for nehalem

9 years agobugfix for Makefile
wernsaar [Fri, 22 Aug 2014 15:02:55 +0000 (17:02 +0200)]
bugfix for Makefile

9 years agoupdate of KERNEL.BULLDOZER
wernsaar [Fri, 22 Aug 2014 15:01:27 +0000 (17:01 +0200)]
update of KERNEL.BULLDOZER

9 years agoadded optimized sdot kernel for nehalem
wernsaar [Fri, 22 Aug 2014 15:00:26 +0000 (17:00 +0200)]
added optimized sdot kernel for nehalem

9 years agoadded optimized sdot for bulldozer
wernsaar [Fri, 22 Aug 2014 12:29:17 +0000 (14:29 +0200)]
added optimized sdot for bulldozer

9 years agobugfix in Makefile
wernsaar [Fri, 22 Aug 2014 09:51:30 +0000 (11:51 +0200)]
bugfix in Makefile

9 years agoadded sdot and ddot benchmarks
wernsaar [Fri, 22 Aug 2014 09:42:07 +0000 (11:42 +0200)]
added sdot and ddot benchmarks

9 years agoadded hemv benchmark
wernsaar [Fri, 22 Aug 2014 08:00:09 +0000 (10:00 +0200)]
added hemv benchmark

9 years agoadded benchmarks for csymv and zsymv
wernsaar [Thu, 21 Aug 2014 17:33:57 +0000 (19:33 +0200)]
added benchmarks for csymv and zsymv

9 years agoadded optimized symv_L kernels for nehalem
wernsaar [Thu, 21 Aug 2014 12:27:00 +0000 (14:27 +0200)]
added optimized symv_L kernels for nehalem

9 years agoadded optimized ssymv_L kernel for bulldozer
wernsaar [Thu, 21 Aug 2014 11:32:06 +0000 (13:32 +0200)]
added optimized ssymv_L kernel for bulldozer

9 years agoadded optimized dsymv_L kernel for bulldozer
wernsaar [Thu, 21 Aug 2014 11:02:53 +0000 (13:02 +0200)]
added optimized dsymv_L kernel for bulldozer

9 years agoadded optimized dsymv_U kernel for nehalem
wernsaar [Wed, 20 Aug 2014 07:58:04 +0000 (09:58 +0200)]
added optimized dsymv_U kernel for nehalem

9 years agoupdated optimized dsymv_U kernel for bulldozer
wernsaar [Wed, 20 Aug 2014 07:00:56 +0000 (09:00 +0200)]
updated optimized dsymv_U kernel for bulldozer

9 years agoupdated optimized ssymv_U for bulldozer
wernsaar [Tue, 19 Aug 2014 17:25:03 +0000 (19:25 +0200)]
updated optimized ssymv_U for bulldozer

9 years agoadded optimized ssymv_U kernel for nehalem
wernsaar [Tue, 19 Aug 2014 15:09:45 +0000 (17:09 +0200)]
added optimized ssymv_U kernel for nehalem

9 years agoadded optimized ssymv_U kernel for bulldozer
wernsaar [Mon, 18 Aug 2014 11:52:24 +0000 (13:52 +0200)]
added optimized ssymv_U kernel for bulldozer

9 years agoadded optimized dsymv_U kernel for bulldozer
wernsaar [Mon, 18 Aug 2014 10:18:10 +0000 (12:18 +0200)]
added optimized dsymv_U kernel for bulldozer

9 years agoadd reference in C for symv_U
wernsaar [Sat, 16 Aug 2014 11:52:50 +0000 (13:52 +0200)]
add reference in C for symv_U

9 years agoadded reference in C for symv_L
wernsaar [Sat, 16 Aug 2014 09:36:48 +0000 (11:36 +0200)]
added reference in C for symv_L

9 years agoRef #433: removed obsolete lapack entries from common_interface.h
wernsaar [Fri, 15 Aug 2014 10:40:10 +0000 (12:40 +0200)]
Ref #433: removed obsolete lapack entries from common_interface.h

9 years agoMerge pull request #434 from wernsaar/develop
Zhang Xianyi [Fri, 15 Aug 2014 00:07:27 +0000 (08:07 +0800)]
Merge pull request #434 from wernsaar/develop

A lot of performance enhancements

9 years agoadded optimized cgemv_n for haswell
wernsaar [Thu, 14 Aug 2014 17:00:30 +0000 (19:00 +0200)]
added optimized cgemv_n for haswell

9 years agoadded optimized cgemv_t kernel for haswell
wernsaar [Thu, 14 Aug 2014 12:10:29 +0000 (14:10 +0200)]
added optimized cgemv_t kernel for haswell

9 years agooptimized zgemv_n kernel for sandybridge
wernsaar [Wed, 13 Aug 2014 14:10:03 +0000 (16:10 +0200)]
optimized zgemv_n kernel for sandybridge

9 years agoadded additional test values
wernsaar [Wed, 13 Aug 2014 12:54:50 +0000 (14:54 +0200)]
added additional test values

9 years agoadded fast return, if m or n < 1
wernsaar [Wed, 13 Aug 2014 11:54:19 +0000 (13:54 +0200)]
added fast return, if m or n < 1

9 years agooptimized zgemv_t_microk_haswell-2.c
wernsaar [Wed, 13 Aug 2014 11:42:22 +0000 (13:42 +0200)]
optimized zgemv_t_microk_haswell-2.c

9 years agobugfix for zgemv_n_microk_haswell-2.c
wernsaar [Wed, 13 Aug 2014 10:54:18 +0000 (12:54 +0200)]
bugfix for zgemv_n_microk_haswell-2.c

9 years agobugfix in zgemv_n_microk_sandy-2.c
wernsaar [Wed, 13 Aug 2014 10:18:03 +0000 (12:18 +0200)]
bugfix in zgemv_n_microk_sandy-2.c

9 years agoadded optimized cgemv_t c-kernel
wernsaar [Tue, 12 Aug 2014 10:15:41 +0000 (12:15 +0200)]
added optimized cgemv_t c-kernel

9 years agobugfix in zgemv_n_microk_haswell-2.c
wernsaar [Tue, 12 Aug 2014 08:02:25 +0000 (10:02 +0200)]
bugfix in zgemv_n_microk_haswell-2.c

9 years agomodified algorithm for better numerical stability
wernsaar [Tue, 12 Aug 2014 06:35:42 +0000 (08:35 +0200)]
modified algorithm for better numerical stability

9 years agoadded optimized zgemv_t kernel for haswell
wernsaar [Mon, 11 Aug 2014 14:57:52 +0000 (16:57 +0200)]
added optimized zgemv_t kernel for haswell

9 years agoadd optimized zgemv_t kernel for bulldozer
wernsaar [Mon, 11 Aug 2014 12:19:25 +0000 (14:19 +0200)]
add optimized zgemv_t kernel for bulldozer

9 years agoadded optimized zgemv_t for haswell
wernsaar [Mon, 11 Aug 2014 11:10:12 +0000 (13:10 +0200)]
added optimized zgemv_t for haswell

9 years agoadded optimimized zgemv_t c-kernel
wernsaar [Mon, 11 Aug 2014 07:13:18 +0000 (09:13 +0200)]
added optimimized zgemv_t c-kernel

9 years agodisabled optimized haswell zgemv_n kernel for windows ( bad rounding )
wernsaar [Sun, 10 Aug 2014 09:57:24 +0000 (11:57 +0200)]
disabled optimized haswell zgemv_n kernel for windows ( bad rounding )

9 years agoadded optimized zgemv_n kernel for haswell
wernsaar [Sun, 10 Aug 2014 06:39:17 +0000 (08:39 +0200)]
added optimized zgemv_n kernel for haswell

9 years agoadded zgemv_n c-function
wernsaar [Thu, 7 Aug 2014 20:30:20 +0000 (22:30 +0200)]
added zgemv_n c-function

9 years agoadded optimized dgemv_t kernel for haswell
wernsaar [Thu, 7 Aug 2014 08:08:54 +0000 (10:08 +0200)]
added optimized dgemv_t kernel for haswell

9 years agoadded optimized dgemv_n kernel for haswell
wernsaar [Thu, 7 Aug 2014 07:18:02 +0000 (09:18 +0200)]
added optimized dgemv_n kernel for haswell

9 years agoadded optimized sgemv_t kernel for sandybridge
wernsaar [Thu, 7 Aug 2014 05:49:33 +0000 (07:49 +0200)]
added optimized sgemv_t kernel for sandybridge

9 years agoenabled optimized nehalem sgemv_t kernel for windows
wernsaar [Wed, 6 Aug 2014 14:58:21 +0000 (16:58 +0200)]
enabled optimized nehalem sgemv_t kernel for windows

9 years agoadded optimized sgemv_t kernel for nehalem
wernsaar [Wed, 6 Aug 2014 14:21:48 +0000 (16:21 +0200)]
added optimized sgemv_t kernel for nehalem

9 years agoenabled optimized sgemv kernels for windows
wernsaar [Wed, 6 Aug 2014 12:24:36 +0000 (14:24 +0200)]
enabled optimized sgemv kernels for windows

9 years agoenabled optimized sgemv kernels for windows
wernsaar [Wed, 6 Aug 2014 12:06:30 +0000 (14:06 +0200)]
enabled optimized sgemv kernels for windows

9 years agoadded optimized sgemv_t kernel for haswell
wernsaar [Wed, 6 Aug 2014 11:42:41 +0000 (13:42 +0200)]
added optimized sgemv_t kernel for haswell

9 years agoadded optimized sgemv_t kernel
wernsaar [Wed, 6 Aug 2014 10:12:17 +0000 (12:12 +0200)]
added optimized sgemv_t kernel

9 years agoMerge pull request #430 from wernsaar/develop
Zhang Xianyi [Tue, 5 Aug 2014 18:52:30 +0000 (02:52 +0800)]
Merge pull request #430 from wernsaar/develop

added a better optimized sgemv_n kernel

9 years agobugfix in sgemv_n.c
wernsaar [Tue, 5 Aug 2014 18:14:29 +0000 (20:14 +0200)]
bugfix in sgemv_n.c

9 years agochanged default x86_64 sgemv_n kernel to sgemv_n.c
wernsaar [Tue, 5 Aug 2014 17:42:56 +0000 (19:42 +0200)]
changed default x86_64 sgemv_n kernel to sgemv_n.c

9 years agoenabled optimized sgemv_n kernels for windows
wernsaar [Tue, 5 Aug 2014 12:52:54 +0000 (14:52 +0200)]
enabled optimized sgemv_n kernels for windows

9 years agochanged long to blaslong for windows compatibility
wernsaar [Tue, 5 Aug 2014 11:28:39 +0000 (13:28 +0200)]
changed long to blaslong for windows compatibility

9 years agoadded optimized sgemv_n kernel for nehalem
wernsaar [Tue, 5 Aug 2014 08:50:08 +0000 (10:50 +0200)]
added optimized sgemv_n kernel for nehalem

9 years agooptimized sgemv_n kernel for sandybridge
wernsaar [Tue, 5 Aug 2014 06:53:09 +0000 (08:53 +0200)]
optimized sgemv_n kernel for sandybridge

9 years agoupdated haswell optimized sgmv_n kernel
wernsaar [Tue, 5 Aug 2014 06:04:47 +0000 (08:04 +0200)]
updated haswell optimized sgmv_n kernel

9 years agomodified sgemv_n for haswell
wernsaar [Mon, 4 Aug 2014 14:22:11 +0000 (16:22 +0200)]
modified sgemv_n for haswell

9 years agoadded a better optimized sgemv_n kernel for bulldozer and piledriver
wernsaar [Mon, 4 Aug 2014 12:29:01 +0000 (14:29 +0200)]
added a better optimized sgemv_n kernel for bulldozer and piledriver

9 years agoMerge pull request #429 from idunham/numprocs
Zhang Xianyi [Mon, 4 Aug 2014 00:12:23 +0000 (08:12 +0800)]
Merge pull request #429 from idunham/numprocs

Fix link error on Linux/musl.

9 years agoFix link error on Linux/musl.
Isaac Dunham [Sun, 3 Aug 2014 22:06:30 +0000 (15:06 -0700)]
Fix link error on Linux/musl.

get_nprocs() is a GNU convenience function equivalent to POSIX2008
sysconf(_SC_NPROCESSORS_ONLN); the latter should be available in unistd.h
on any current *nix. (OS X supports this call since 10.5, and FreeBSD
currently supports it. But this commit does not change FreeBSD or OS X
versions.)

9 years agoMerge pull request #427 from wernsaar/develop
Zhang Xianyi [Sat, 2 Aug 2014 16:57:44 +0000 (00:57 +0800)]
Merge pull request #427 from wernsaar/develop

added experimental support for big numa machines

9 years agoadded experimental support for big numa machines
wernsaar [Sat, 2 Aug 2014 11:40:16 +0000 (13:40 +0200)]
added experimental support for big numa machines

10 years agoMerge pull request #426 from wernsaar/develop
Zhang Xianyi [Sat, 2 Aug 2014 07:34:41 +0000 (15:34 +0800)]
Merge pull request #426 from wernsaar/develop

added benchmark program for lapack ?getri functions

10 years agobugfix for linux affinity code
wernsaar [Fri, 1 Aug 2014 21:10:08 +0000 (23:10 +0200)]
bugfix for linux affinity code

10 years agoadded benchmarks for lapack potrf, potrs and potri functions
wernsaar [Fri, 1 Aug 2014 19:08:37 +0000 (21:08 +0200)]
added benchmarks for lapack potrf, potrs and potri functions

10 years agoadded benchmark program for lapack ?getri functions
wernsaar [Fri, 1 Aug 2014 06:55:20 +0000 (08:55 +0200)]
added benchmark program for lapack ?getri functions

10 years agoMerge pull request #425 from wernsaar/develop
Zhang Xianyi [Fri, 1 Aug 2014 00:04:16 +0000 (08:04 +0800)]
Merge pull request #425 from wernsaar/develop

added benchmark for lapack ?geev routines

10 years agobugfix for zgeev
wernsaar [Thu, 31 Jul 2014 10:35:38 +0000 (12:35 +0200)]
bugfix for zgeev

10 years agoadded lapack geev benchmark
wernsaar [Thu, 31 Jul 2014 08:35:25 +0000 (10:35 +0200)]
added lapack geev benchmark

10 years agoMerge pull request #424 from ihnorton/fix_arm_cpuid
Zhang Xianyi [Thu, 31 Jul 2014 05:54:07 +0000 (13:54 +0800)]
Merge pull request #424 from ihnorton/fix_arm_cpuid

cpuid_arm: fix detection when cpuinfo uses "Processor"

10 years agocpuid_arm: fix detection when cpuinfo uses "Processor"
Isaiah Norton [Thu, 31 Jul 2014 04:59:30 +0000 (04:59 +0000)]
cpuid_arm: fix detection when cpuinfo uses "Processor"

instead of "model name"

10 years agobugfix in cholesky.c
wernsaar [Wed, 30 Jul 2014 12:00:19 +0000 (14:00 +0200)]
bugfix in cholesky.c

10 years agoextented plot-filter.sh for linpack and cholesky benchmarks
wernsaar [Wed, 30 Jul 2014 11:03:42 +0000 (13:03 +0200)]
extented plot-filter.sh for linpack and cholesky benchmarks

10 years agoMerge pull request #422 from wernsaar/develop
Zhang Xianyi [Wed, 30 Jul 2014 09:09:58 +0000 (17:09 +0800)]
Merge pull request #422 from wernsaar/develop

optimization of sandybridge cgemm-kernel

10 years agooptimization of sandybridge cgemm-kernel
wernsaar [Tue, 29 Jul 2014 17:07:21 +0000 (19:07 +0200)]
optimization of sandybridge cgemm-kernel

10 years agoMerge pull request #421 from wernsaar/develop
Zhang Xianyi [Tue, 29 Jul 2014 07:50:00 +0000 (15:50 +0800)]
Merge pull request #421 from wernsaar/develop

optimized sgemm- and cgemm-kernel for haswell

10 years agooptimized cgemm kernel for haswell
wernsaar [Tue, 29 Jul 2014 06:53:09 +0000 (08:53 +0200)]
optimized cgemm kernel for haswell

10 years agooptimized sgemm kernel for haswell
wernsaar [Mon, 28 Jul 2014 09:50:01 +0000 (11:50 +0200)]
optimized sgemm kernel for haswell

10 years agoMerge pull request #420 from wernsaar/develop
Zhang Xianyi [Sun, 27 Jul 2014 15:30:14 +0000 (23:30 +0800)]
Merge pull request #420 from wernsaar/develop

Optimizations for HASWELL

10 years agooptimized zgemm kernel for haswell
wernsaar [Sun, 27 Jul 2014 09:51:42 +0000 (11:51 +0200)]
optimized zgemm kernel for haswell

10 years agooptimizations for trsm
wernsaar [Fri, 25 Jul 2014 09:59:17 +0000 (11:59 +0200)]
optimizations for trsm

10 years agooptimizations for trmm
wernsaar [Fri, 25 Jul 2014 08:00:23 +0000 (10:00 +0200)]
optimizations for trmm

10 years agoallow to set custom value for ?GEMM_DEFAULT_UNROLL_MN, optimizations for syrk
wernsaar [Thu, 24 Jul 2014 16:43:31 +0000 (18:43 +0200)]
allow to set custom value for ?GEMM_DEFAULT_UNROLL_MN, optimizations for syrk

10 years agoadded a sample plot-filter scripts and a header file for gnuplot
wernsaar [Mon, 21 Jul 2014 12:50:24 +0000 (14:50 +0200)]
added a sample plot-filter scripts and a header file for gnuplot

10 years agoadded symv benchmark
wernsaar [Mon, 21 Jul 2014 05:50:54 +0000 (07:50 +0200)]
added symv benchmark

10 years agoadd benchmark for ger
wernsaar [Mon, 21 Jul 2014 04:25:42 +0000 (06:25 +0200)]
add benchmark for ger

10 years agoMerge pull request #419 from wernsaar/develop
Zhang Xianyi [Sun, 20 Jul 2014 15:35:17 +0000 (23:35 +0800)]
Merge pull request #419 from wernsaar/develop

added optimized sgemv kernels for Sandy Bridge, Haswell, Bullldozer, and Piledriver.

10 years agoadded sgemv_n microkernel for haswell
wernsaar [Sun, 20 Jul 2014 12:52:25 +0000 (14:52 +0200)]
added sgemv_n microkernel for haswell

10 years agooptimized sgemv_n_microk_sandy.c
wernsaar [Sun, 20 Jul 2014 12:08:04 +0000 (14:08 +0200)]
optimized sgemv_n_microk_sandy.c

10 years agoadded sgemv_n microkernel for sandybridge
wernsaar [Sun, 20 Jul 2014 11:17:47 +0000 (13:17 +0200)]
added sgemv_n microkernel for sandybridge

10 years agoadded sgemv_t microkernel for haswell
wernsaar [Sun, 20 Jul 2014 09:30:32 +0000 (11:30 +0200)]
added sgemv_t microkernel for haswell

10 years agobugfix in sgemv_t_microk_sandy.c
wernsaar [Sun, 20 Jul 2014 08:48:41 +0000 (10:48 +0200)]
bugfix in sgemv_t_microk_sandy.c

10 years agoadded sgemv_t microkernel for sandybridge
wernsaar [Sun, 20 Jul 2014 08:21:08 +0000 (10:21 +0200)]
added sgemv_t microkernel for sandybridge

10 years agoadded optimized sgemv_t for bulldozer and piledriver
wernsaar [Sat, 19 Jul 2014 13:48:07 +0000 (15:48 +0200)]
added optimized sgemv_t for bulldozer and piledriver

10 years agodon't use this sgemv_n on Windows
wernsaar [Sat, 19 Jul 2014 05:15:34 +0000 (07:15 +0200)]
don't use this sgemv_n on Windows

10 years agoMerge pull request #414 from staticfloat/sf/symlinkfix
Zhang Xianyi [Fri, 18 Jul 2014 15:13:18 +0000 (23:13 +0800)]
Merge pull request #414 from staticfloat/sf/symlinkfix

Don't create an absolute symlink when installing on Darwin

10 years agoperformance optimizations for sgemv_n
wernsaar [Fri, 18 Jul 2014 09:25:21 +0000 (11:25 +0200)]
performance optimizations for sgemv_n

10 years agoadded blocked sgemv_n and microkernel for bulldozer and piledriver
wernsaar [Thu, 17 Jul 2014 21:15:07 +0000 (23:15 +0200)]
added blocked sgemv_n and microkernel for bulldozer and piledriver