-
- Downloads
x86-64: Add Avoid_Short_Distance_REP_MOVSB
commit 3ec5d83d Author: H.J. Lu <hjl.tools@gmail.com> Date: Sat Jan 25 14:19:40 2020 -0800 x86-64: Avoid rep movsb with short distance [BZ #27130] introduced some regressions on Intel processors without Fast Short REP MOV (FSRM). Add Avoid_Short_Distance_REP_MOVSB to avoid rep movsb with short distance only on Intel processors with FSRM. bench-memmove-large on Skylake server shows that cycles of __memmove_evex_unaligned_erms improves for the following data size: before after Improvement length=4127, align1=3, align2=0: 479.38 349.25 27% length=4223, align1=9, align2=5: 405.62 333.25 18% length=8223, align1=3, align2=0: 786.12 496.38 37% length=8319, align1=9, align2=5: 727.50 501.38 31% length=16415, align1=3, align2=0: 1436.88 840.00 41% length=16511, align1=9, align2=5: 1375.50 836.38 39% length=32799, align1=3, align2=0: 2890.00 1860.12 36% length=32895, align1=9, align2=5: 2891.38 1931.88 33%
Showing
- sysdeps/x86/cacheinfo.h 9 additions, 0 deletionssysdeps/x86/cacheinfo.h
- sysdeps/x86/cpu-features.c 5 additions, 0 deletionssysdeps/x86/cpu-features.c
- sysdeps/x86/include/cpu-features-preferred_feature_index_1.def 1 addition, 0 deletions...ps/x86/include/cpu-features-preferred_feature_index_1.def
- sysdeps/x86/sysdep.h 5 additions, 0 deletionssysdeps/x86/sysdep.h
- sysdeps/x86_64/multiarch/memmove-vec-unaligned-erms.S 5 additions, 0 deletionssysdeps/x86_64/multiarch/memmove-vec-unaligned-erms.S
Loading
Please register or sign in to comment