traits [Wed, 13 Jul 2011 16:54:23 +0000 (00:54 +0800)]
Fixed #44 a makefile bug when DYNAMIC_ARCH=1 and INTERFACE64=1.
traits [Tue, 28 Jun 2011 07:43:08 +0000 (15:43 +0800)]
Merge branch 'hotfix-0.1alpha2.1'
traits [Tue, 28 Jun 2011 07:42:09 +0000 (15:42 +0800)]
Refs #42. Output the error message when detecting fortran compiler failed.
traits [Thu, 23 Jun 2011 07:18:40 +0000 (15:18 +0800)]
Merge branch 'release-v0.1alpha2'
traits [Thu, 23 Jun 2011 07:16:24 +0000 (15:16 +0800)]
Fixed #38. Released v0.1 alpha2.
traits [Thu, 23 Jun 2011 07:09:34 +0000 (15:09 +0800)]
Refs #37. Updated REAME about the compatible issue with EKOPath compiler.
Xianyi Zhang [Wed, 22 Jun 2011 05:19:39 +0000 (13:19 +0800)]
Refs #39. Moved the shared lib (dll) to top directory in MingW64 compiler environment.
Xianyi Zhang [Tue, 21 Jun 2011 17:52:20 +0000 (01:52 +0800)]
Refs #39. It's unnecessary to include sys/mman.h file in blas_server_omp.c.
Xianyi Zhang [Tue, 21 Jun 2011 10:06:13 +0000 (18:06 +0800)]
Refs #38. Prepare the docs with v0.1alpha2.
Xianyi Zhang [Tue, 21 Jun 2011 09:50:00 +0000 (17:50 +0800)]
Merge branch 'loongson3a' into release-v0.1alpha2
Xianyi Zhang [Tue, 21 Jun 2011 09:40:16 +0000 (17:40 +0800)]
Merge branch 'add_install_target' into develop
Xianyi Zhang [Tue, 21 Jun 2011 09:39:08 +0000 (17:39 +0800)]
Refs #20. Fixed the installation bug with DYNAMIC_ARCH=1.
Xianyi Zhang [Mon, 20 Jun 2011 10:40:05 +0000 (18:40 +0800)]
Merge branch 'add_install_target' into develop
Conflicts:
Changelog.txt
Xianyi Zhang [Mon, 20 Jun 2011 10:36:29 +0000 (18:36 +0800)]
Refs #20. Updated the docs.
Xianyi Zhang [Mon, 20 Jun 2011 10:35:35 +0000 (18:35 +0800)]
Fixed #20. Added install target in makefile. You can use "make install PREFIX=your_installation_directory".
Xianyi Zhang [Sun, 19 Jun 2011 04:07:31 +0000 (12:07 +0800)]
Updated gitignore file.
Xianyi Zhang [Sun, 19 Jun 2011 03:59:38 +0000 (11:59 +0800)]
Merge branch 'master' of github.com:xianyi/OpenBLAS into develop
Xianyi Zhang [Sun, 19 Jun 2011 03:55:29 +0000 (11:55 +0800)]
Fixed #27. Temporarily walk around axpy's low performance issue with small imput size & multithreads.
Xianyi Zhang [Sat, 11 Jun 2011 12:59:00 +0000 (05:59 -0700)]
Merge pull request #36 from pipping/master
Fixed the bug about USE_OPENMP=0 enabling OpenMP
Elias Pipping [Sat, 11 Jun 2011 12:36:16 +0000 (14:36 +0200)]
Make USE_OPENMP=0 disable openmp
Xianyi Zhang [Thu, 9 Jun 2011 14:59:49 +0000 (22:59 +0800)]
Fixed #35 a build bug with NO_LAPACK=1 DYNAMIC_ARCH=1 FC=gfortran. I forgot to test it with gfortran in last bug fixed commit.
Xianyi Zhang [Thu, 9 Jun 2011 03:38:59 +0000 (11:38 +0800)]
Fixed #35 a build bug with NO_LAPACK=1 & DYNAMIC_ARCH=1.
Xianyi Zhang [Thu, 9 Jun 2011 02:40:15 +0000 (10:40 +0800)]
Print the wall time (cycles) with enabling FUNCTION_PROFILE.
Wang Qian [Tue, 7 Jun 2011 04:53:25 +0000 (12:53 +0800)]
Fixed #33 ztrmm bug on Nehalem.
Xianyi [Fri, 3 Jun 2011 05:19:54 +0000 (13:19 +0800)]
Fixed #32 a SEGFAULT bug with gcc-4.6. According to i386 calling convention, The called funtion should remove the hidden return value address from the stack.
Xianyi Zhang [Mon, 30 May 2011 04:42:17 +0000 (12:42 +0800)]
Fixed #31 Shared library placement on Mac. Thank Mr.Viral B. Shah for this patch.
traz [Sat, 28 May 2011 09:48:34 +0000 (09:48 +0000)]
Fixed #30 strmm computational error on Loongson3A.
Xianyi Zhang [Fri, 27 May 2011 13:15:30 +0000 (21:15 +0800)]
Fixed the makefile bug about openblas_set_num_threads.
Xianyi Zhang [Fri, 27 May 2011 10:16:19 +0000 (18:16 +0800)]
Fixed a bug about detecting underscore prefix in c_check.
Xianyi Zhang [Fri, 27 May 2011 10:12:45 +0000 (18:12 +0800)]
Ingnore *.obj files in git.
traz [Fri, 27 May 2011 09:47:17 +0000 (09:47 +0000)]
Modify single precision compiler conditions, increasing single precision kernel code on Loongson3a.
traz [Wed, 18 May 2011 10:54:51 +0000 (10:54 +0000)]
Remove the useless code, modify code comments and format.
Xianyi Zhang [Tue, 17 May 2011 21:24:00 +0000 (21:24 +0000)]
Fixed #28. Convert the result to double precision in MIPS64 dsdot_k kernel.
traz [Sat, 14 May 2011 22:00:57 +0000 (22:00 +0000)]
Fixed #25 dtrmm and dtrsm computational error on Loongson3A.
Xianyi Zhang [Thu, 12 May 2011 18:41:39 +0000 (02:41 +0800)]
Added missed testing codes for dsdot.
Xianyi Zhang [Thu, 12 May 2011 18:34:30 +0000 (02:34 +0800)]
Fixed #28. Convert the result to double precision in the end of dsdot kernel.
Xianyi Zhang [Thu, 12 May 2011 18:19:55 +0000 (02:19 +0800)]
Added the unit testcase for dsdot.
Xianyi Zhang [Thu, 12 May 2011 17:21:39 +0000 (01:21 +0800)]
Added the unit test for drotmg.
Xianyi Zhang [Thu, 12 May 2011 11:06:31 +0000 (19:06 +0800)]
Merge branch 'hotfix-readme_about_branches' into develop
Xianyi Zhang [Thu, 12 May 2011 11:06:02 +0000 (19:06 +0800)]
Merge branch 'hotfix-readme_about_branches'
Xianyi Zhang [Thu, 12 May 2011 11:05:20 +0000 (19:05 +0800)]
Added the spec of git branches about this project.
traz [Wed, 11 May 2011 10:44:23 +0000 (10:44 +0000)]
Finish dtrsm_kernel_Rx.S on Loongson3A.
Xianyi Zhang [Tue, 10 May 2011 17:12:32 +0000 (01:12 +0800)]
Fixed #26 the wrong result of rotmg. Used fabs() instead of abs().
traz [Tue, 10 May 2011 12:48:43 +0000 (12:48 +0000)]
Finish dtrsm_kernel_Lx.S on Loongson3A.
traz [Mon, 9 May 2011 17:31:58 +0000 (17:31 +0000)]
Modify dtrsm compiler options
traz [Mon, 9 May 2011 17:28:20 +0000 (17:28 +0000)]
Fixed #24 drmm error on Loongson3A
Xianyi Zhang [Fri, 6 May 2011 09:03:35 +0000 (17:03 +0800)]
Added openblas_set_num_threads for Fortran.
Xianyi Zhang [Wed, 4 May 2011 05:03:10 +0000 (13:03 +0800)]
Fixed #23. Fixed a bug of f_check script about generating link flags.
Xianyi Zhang [Tue, 3 May 2011 09:19:36 +0000 (17:19 +0800)]
Fixed a bug when detecting Intel CPU.
traits [Tue, 3 May 2011 06:42:11 +0000 (14:42 +0800)]
Fixed a build bug with NO_LAPACK=1 and SANNITY_CHECK=1.
Xianyi Zhang [Fri, 22 Apr 2011 14:14:06 +0000 (22:14 +0800)]
Fixed #16. Print the user-friendly message when detecting CPU failed.
Xianyi Zhang [Fri, 22 Apr 2011 14:07:46 +0000 (22:07 +0800)]
Added docs for make TARGET=your_cpu_target.
Xianyi Zhang [Fri, 22 Apr 2011 12:21:42 +0000 (20:21 +0800)]
Fixed #19. Provided an error msg when the arch is not supported.
Xianyi Zhang [Wed, 20 Apr 2011 05:41:38 +0000 (13:41 +0800)]
Fixed #21. Added extern C to support C++. Thank Tasio for the patch.
traz [Sun, 17 Apr 2011 20:26:49 +0000 (20:26 +0000)]
Completely dtrmm function.
traz [Fri, 15 Apr 2011 21:56:25 +0000 (21:56 +0000)]
Increased handling trmm part, no edge handling. Test size(M and N) must be a multiple of 4 .
traz [Mon, 11 Apr 2011 22:46:36 +0000 (22:46 +0000)]
Modify prefetching C.
traz [Mon, 11 Apr 2011 22:17:57 +0000 (22:17 +0000)]
Adjust kc size from 112 to 116 .
Xianyi Zhang [Mon, 11 Apr 2011 21:46:48 +0000 (21:46 +0000)]
Changed default page size to 16KB on Loongson 3A.
Xianyi Zhang [Thu, 7 Apr 2011 06:52:35 +0000 (14:52 +0800)]
Supported goto_set_num_threads & openblas_set_num_threads functions when USE_OPENMP=1.
Xianyi Zhang [Mon, 28 Mar 2011 02:58:39 +0000 (10:58 +0800)]
Fixed #14 the SEGFAULT bug on 64 cores. On SMP server, the number of CPUs or cores should be less than or equal to 64.
Xianyi Zhang [Thu, 24 Mar 2011 17:16:12 +0000 (01:16 +0800)]
Fixed #13. Fixed blasint undefined bug in <cblas.h> file.
Xianyi Zhang [Sun, 20 Mar 2011 15:35:31 +0000 (23:35 +0800)]
Updated the developing version to v0.1 alpha2.
Xianyi Zhang [Sun, 20 Mar 2011 15:30:09 +0000 (23:30 +0800)]
Init Changelog file for next release version(v0.1alpha2).
traz [Wed, 6 Apr 2011 10:39:31 +0000 (10:39 +0000)]
Change BLOCK SIZE of LOONGSON3A TARGET.
traz [Wed, 6 Apr 2011 10:38:34 +0000 (10:38 +0000)]
Add dgemm compiler Options in KERNEL.LOONGSON3A.
traz [Wed, 6 Apr 2011 10:36:44 +0000 (10:36 +0000)]
New kernel in LOONGSON3A.
Xianyi Zhang [Mon, 28 Mar 2011 02:58:39 +0000 (10:58 +0800)]
Fixed #14 the SEGFAULT bug on 64 cores. On SMP server, the number of CPUs or cores should be less than or equal to 64.
Xianyi Zhang [Thu, 24 Mar 2011 17:16:12 +0000 (01:16 +0800)]
Fixed #13. Fixed blasint undefined bug in <cblas.h> file.
Xianyi Zhang [Tue, 22 Mar 2011 06:16:18 +0000 (14:16 +0800)]
Merge branch 'master' of github.com:xianyi/OpenBLAS into x86
Xianyi Zhang [Tue, 22 Mar 2011 06:09:47 +0000 (14:09 +0800)]
Fixed the detecting bug on Intel Core i5. Thank ggl329 for the patch.
Xianyi Zhang [Sun, 20 Mar 2011 15:35:31 +0000 (23:35 +0800)]
Updated the developing version to v0.1 alpha2.
Xianyi Zhang [Sun, 20 Mar 2011 15:30:09 +0000 (23:30 +0800)]
Init Changelog file for next release version(v0.1alpha2).
Xianyi Zhang [Sun, 20 Mar 2011 14:44:57 +0000 (22:44 +0800)]
OpenBLAS 0.1 alpha version 1.
Xianyi Zhang [Sun, 20 Mar 2011 13:57:58 +0000 (21:57 +0800)]
Merge remote branch 'origin/loongson3a' into x86
Xianyi Zhang [Sun, 20 Mar 2011 13:57:09 +0000 (21:57 +0800)]
Merge remote branch 'origin/loongson3a' into x86
Xianyi Zhang [Sun, 20 Mar 2011 13:56:40 +0000 (21:56 +0800)]
Detect Intel Core Clarkdale & Arrandale
Xianyi Zhang [Fri, 18 Mar 2011 23:05:56 +0000 (23:05 +0000)]
Fixed the bug about Loongson3A gsLQC1 & gsSQC1 instructions in daxpy kernel. Now daxpy is correct.
Xianyi Zhang [Fri, 18 Mar 2011 01:20:15 +0000 (01:20 +0000)]
Merge branch 'loongson3a' of github.com:xianyi/OpenBLAS into loongson3a
Xianyi Zhang [Fri, 18 Mar 2011 01:10:58 +0000 (01:10 +0000)]
Supported detecting new kernel(2.6.36) & new Loongson3A03 CPU.
Wang Qian [Mon, 7 Mar 2011 11:22:32 +0000 (11:22 +0000)]
Modified the default kernel makefile in MIPS64 arch.
Xianyi Zhang [Sat, 5 Mar 2011 02:17:10 +0000 (10:17 +0800)]
Support unalign address in daxpy on loongson3a simd..
Xianyi Zhang [Fri, 4 Mar 2011 09:50:17 +0000 (17:50 +0800)]
Unroll to 16 in daxpy on loongson3a.
Xianyi Zhang [Fri, 4 Mar 2011 14:11:52 +0000 (14:11 +0000)]
Merge commit 'origin/x86' into loongson3a
Xianyi Zhang [Fri, 4 Mar 2011 03:53:04 +0000 (11:53 +0800)]
Merge branch 'x86' of github.com:xianyi/OpenBLAS into x86
Xianyi Zhang [Fri, 4 Mar 2011 03:51:32 +0000 (11:51 +0800)]
Support NO_LAPACK=1 to build the lib without LAPACK functions.
Xianyi [Wed, 2 Mar 2011 16:46:39 +0000 (00:46 +0800)]
Changed movlps macro name in capital in x86/zdot_sse2.S file.
Xianyi [Wed, 2 Mar 2011 10:45:30 +0000 (18:45 +0800)]
On x86 32bits, gcc 4.4.3 generated wrong codes (movsd) from movlps in zdot_sse2.S line 191.
This would casue zdotu & zdotc failures. Instead, use movlpd to walk around it. Fixed #8. Fixed #9.
Xianyi Zhang [Wed, 2 Mar 2011 10:03:40 +0000 (18:03 +0800)]
Added zdotu with x & y offset=1 test case.
Xianyi Zhang [Wed, 2 Mar 2011 05:52:05 +0000 (13:52 +0800)]
Merge remote branch 'origin/x86' into loongson3a
Xianyi Zhang [Wed, 2 Mar 2011 05:40:55 +0000 (13:40 +0800)]
updated the changelog.
Xianyi Zhang [Wed, 2 Mar 2011 05:38:32 +0000 (13:38 +0800)]
Fixed randomly SEGFAULT when nodemask==NULL with above Linux 2.6.34. Fixed #12. Thank Mr.Ei-ji Nakama providing this patch.
Xianyi Zhang [Sat, 26 Feb 2011 04:27:56 +0000 (12:27 +0800)]
Added Changelog. Fixed #11.
Xianyi Zhang [Sat, 26 Feb 2011 03:51:39 +0000 (11:51 +0800)]
Enable Debug flags in memory alloc and init functions.
Xianyi Zhang [Sat, 26 Feb 2011 03:19:54 +0000 (11:19 +0800)]
Added DEBUG option in Makefile.rule. Fixed DEBUG typo mistakes.
Xianyi Zhang [Thu, 24 Feb 2011 09:02:52 +0000 (17:02 +0800)]
Merge branch 'x86' of github.com:xianyi/OpenBLAS into x86
Xianyi Zhang [Thu, 24 Feb 2011 07:16:21 +0000 (15:16 +0800)]
Fixed #10. Supported GOTO_NUM_THREADS & GOTO_THREADS_TIMEOUT environment variables.
Xianyi [Wed, 23 Feb 2011 12:08:34 +0000 (20:08 +0800)]
Fixed #7. Modified axpy kernel codes to avoid unloop with incx==0 or incy==0 in x86 32bits arch.
Xianyi Zhang [Tue, 22 Feb 2011 06:16:46 +0000 (14:16 +0800)]
Added unit test case (zdotu, N=1).
Xianyi Zhang [Tue, 22 Feb 2011 05:40:40 +0000 (13:40 +0800)]
Supported building debug version.