Some of the new multi-arch string functions for x86-64 were
not aligned to 16 byte boundarie,s possibly creating unnecessary
cache line misses and delays.
+2009-07-03 Ulrich Drepper <drepper@redhat.com>
+
+ * sysdeps/x86_64/multiarch/strcmp.S: Make sure functions are all
+ aligned to 16 byte boundaries.
+ * sysdeps/x86_64/multiarch/strcpy.S: Likewise.
+ * sysdeps/x86_64/multiarch/strlen.S: Likewise.
+ * sysdeps/x86_64/multiarch/rawmemchr.S: Likewise.
+
2009-07-02 H.J. Lu <hongjiu.lu@intel.com>
* config.h.in (HAVE_SSE4_SUPPORT): New macro.
# undef ENTRY
# define ENTRY(name) \
.type __rawmemchr_sse2, @function; \
+ .align 16; \
__rawmemchr_sse2: cfi_startproc; \
CALL_MCOUNT
# undef END
# undef ENTRY
# define ENTRY(name) \
.type STRCMP_SSE2, @function; \
+ .align 16; \
STRCMP_SSE2: cfi_startproc; \
CALL_MCOUNT
# undef END
# undef ENTRY
# define ENTRY(name) \
.type STRCPY_SSE2, @function; \
+ .align 16; \
STRCPY_SSE2: cfi_startproc; \
CALL_MCOUNT
# undef END
# undef ENTRY
# define ENTRY(name) \
.type __strlen_sse2, @function; \
+ .align 16; \
__strlen_sse2: cfi_startproc; \
CALL_MCOUNT
# undef END