glibc中strlen的实现

全部 Ruby Python PHP Flash C++ .net Rails Flex C C# Django

浏览 5412 次

锁定老帖子主题：glibc中strlen的实现精华帖 (0) :: 良好帖 (1) :: 新手帖 (0) :: 隐藏帖 (0)
作者	正文
simohayha 等级: 性别: 文章: 1513 积分: 951 来自: 火星	发表时间：2009-08-04 相关推荐: glibc-strlen 分析 glibc strlen 实现方式分析 glibc 系列之strlen（）函数 glibc java_glibc笔记——strlen Glibc中strlen的汇编实现更多相关推荐 C glibc中的strlen的实现主要的思想就是每次检测4个字节(long int)。这样的话就降低了循环的次数，从而从整体上提高了效率。这里它使用了两个技巧，一个是由于传进来的字符串的地址有可能不是4字节(long int)对其的，因此首先需要遍历字符串从而找到4字节对其的那个地址。然后再进行比较. 第二个技巧就是如何高效的判断4个字节中是否有字节为0. 下来来看源码，这个源码的注释还是满详细的。这里主要都是一些位计算的技巧： size_t strlen (str) const char str; { const char char_ptr; const unsigned long int longword_ptr; unsigned long int longword, himagic, lomagic; / Handle the first few characters by reading one character at a time. Do this until CHAR_PTR is aligned on a longword boundary. / for (char_ptr = str; ((unsigned long int) char_ptr & (sizeof (longword) - 1)) != 0; ++char_ptr) if (char_ptr == '\0') return char_ptr - str; /* All these elucidatory comments refer to 4-byte longwords, but the theory applies equally well to 8-byte longwords. / longword_ptr = (unsigned long int ) char_ptr; /* Bits 31, 24, 16, and 8 of this number are zero. Call these bits the "holes." Note that there is a hole just to the left of each byte, with an extra at the end: bits: 01111110 11111110 11111110 11111111 bytes: AAAAAAAA BBBBBBBB CCCCCCCC DDDDDDDD The 1-bits make sure that carries propagate to the next 0-bit. The 0-bits provide holes for carries to fall into. / himagic = 0x80808080L; lomagic = 0x01010101L; if (sizeof (longword) > 4) { / 64-bit version of the magic. / / Do the shift in two steps to avoid a warning if long has 32 bits. / himagic = ((himagic << 16) << 16) \| himagic; lomagic = ((lomagic << 16) << 16) \| lomagic; } if (sizeof (longword) > 8) abort (); / Instead of the traditional loop which tests each character, we will test a longword at a time. The tricky part is testing if any of the four bytes in the longword in question are zero. / for (;;) { longword = longword_ptr++; if (((longword - lomagic) & ~longword & himagic) != 0) { /* Which of the bytes was the zero? If none of them were, it was a misfire; continue the search. / const char cp = (const char ) (longword_ptr - 1); if (cp[0] == 0) return cp - str; if (cp[1] == 0) return cp - str + 1; if (cp[2] == 0) return cp - str + 2; if (cp[3] == 0) return cp - str + 3; if (sizeof (longword) > 4) { if (cp[4] == 0) return cp - str + 4; if (cp[5] == 0) return cp - str + 5; if (cp[6] == 0) return cp - str + 6; if (cp[7] == 0) return cp - str + 7; } } } } 代码比较长,不过注释也比较详细,这里主要分析下两个位运算: for (char_ptr = str; ((unsigned long int) char_ptr & (sizeof (longword) - 1)) != 0; ++char_ptr) if (char_ptr == '\0') return char_ptr - str; 这里是做一个循环来判断4-byte对齐的那个地址,这里一般的编译器传递进来的字符串指针都是4-byte对其的,因此这步在很多编译器都是跳过的. 下来来看如何在4字节的数字中判断是否有某个字节为0. himagic = 0x80808080L; lomagic = 0x01010101L; if (((longword - lomagic) & ~longword & himagic) != 0) 这里分两个部分看. 第一部分(longword - lomagic) 这个表达式是用来判断最高位的,它的意思是: 如果longword的任何一个字节如果大于0x80或者等于0的情况下,它的这个字节的最高位的值都会是不等于0的. 第二部分(~longword & himagic)这个表达式是用来判断longword的每个字节的最高位的设置,也就是判断longword是否小于0x80,如果小于0x80则这个表达式的值是himagic(0x80808080).否则就是0. 因此这两部分合并起来刚好就是如果longword的某个字节小于0x80,并且某个字节为0,则整个表达式的值不为0. 而我们知道传进来的字符串的每个字符的大小的ascii码的范围就是[0,127],也就是肯定是小于0x80的.所以这个表达式就刚好是我们所要的. 声明：ITeye文章版权属于作者，受法律保护。没有作者书面许可不得转载。推荐链接
返回顶楼

andrew913 等级: 性别: 文章: 100 积分: 150 来自: 杭州	发表时间：2009-08-04 strlen，非常精妙。
返回顶楼	回帖地址 0 0 请登录后投票

mryufeng 等级: 性别: 文章: 526 积分: 1780 来自: 广州	发表时间：2009-08-04 glibc的作者也就是这位先生写了如下的书 http://mryufeng.iteye.com/blog/429084
返回顶楼	回帖地址 0 0 请登录后投票

joshzhu 等级: 初级会员性别: 文章: 17 积分: 30 来自: 广州	发表时间：2009-08-04 64位下8字节呢。BTW，还有另一种写法： if (((longword - lomagic) & himagic) != 0) { ... } P.S.楼主看的glibc是什么版本的？
返回顶楼	回帖地址 0 0 请登录后投票

simohayha 等级: 性别: 文章: 1513 积分: 951 来自: 火星	发表时间：2009-08-04 joshzhu 写道 64位下8字节呢。BTW，还有另一种写法： if (((longword - lomagic) & himagic) != 0) { ... } P.S.楼主看的glibc是什么版本的？你的这种写法,貌似在2.10当bug被fix了: http://sourceware.org/ml/glibc-cvs/2009-q1/msg00512.html http://sources.redhat.com/bugzilla/show_bug.cgi?id=5807 我看的版本是2.10..
返回顶楼	回帖地址 0 0 请登录后投票

andrew913 等级: 性别: 文章: 100 积分: 150 来自: 杭州	发表时间：2009-08-05 始终认为应届生应该来搞搞这些东西。比做那种XXXXX系统的项目经验有前途多了。
返回顶楼	回帖地址 0 0 请登录后投票

andrew913 等级: 性别: 文章: 100 积分: 150 来自: 杭州	发表时间：2009-08-05 himagic = 0x80808080L; lomagic = 0x01010101L; (longword - lomagic) & himagic 貌似比较好理解，不知道为什么被 bug-fix了如果有中文，貌似和挨个遍历没区别。不知道有没有高效的方法。另外如果4个字节的并排对比，万一第一个字节就已经是 '\0'了呢？那不是越界了。
返回顶楼	回帖地址 0 0 请登录后投票

joshzhu 等级: 初级会员性别: 文章: 17 积分: 30 来自: 广州	发表时间：2009-08-05 谢谢simohayha兄的考古挖掘工作下面是我的几个观点： 1、if (((longword - lomagic) & himagic) != 0) 和 if (((longword - lomagic) & ~longword & himagic) != 0) 这两种写法都是对的，也就是说，先前的glibc没有bug！而新版本的glibc多出的& ~longword从效率上看更慢。因此这个fix是个倒退。 2、两种写法的strlen都可以正确处理值大于128的情况。 3、这段注释也应该去掉 /* Bits 31, 24, 16, and 8 of this number are zero. Call these bits the "holes." Note that there is a hole just to the left of each byte, with an extra at the end: bits: 01111110 11111110 11111110 11111111 bytes: AAAAAAAA BBBBBBBB CCCCCCCC DDDDDDDD The 1-bits make sure that carries propagate to the next 0-bit. The 0-bits provide holes for carries to fall into. */ P.S. 等俺有空给glibc那帮人写封mail？:wink:
返回顶楼	回帖地址 0 0 请登录后投票

simohayha 等级: 性别: 文章: 1513 积分: 951 来自: 火星	发表时间：2009-08-05 joshzhu 写道谢谢simohayha兄的考古挖掘工作下面是我的几个观点： 1、if (((longword - lomagic) & himagic) != 0) 和 if (((longword - lomagic) & ~longword & himagic) != 0) 这两种写法都是对的，也就是说，先前的glibc没有bug！而新版本的glibc多出的& ~longword从效率上看更慢。因此这个fix是个倒退。 2、两种写法的strlen都可以正确处理值大于128的情况。 3、这段注释也应该去掉 /* Bits 31, 24, 16, and 8 of this number are zero. Call these bits the "holes." Note that there is a hole just to the left of each byte, with an extra at the end: bits: 01111110 11111110 11111110 11111111 bytes: AAAAAAAA BBBBBBBB CCCCCCCC DDDDDDDD The 1-bits make sure that carries propagate to the next 0-bit. The 0-bits provide holes for carries to fall into. */ P.S. 等俺有空给glibc那帮人写封mail？:wink: 哈,我也是觉得有些奇怪,明显多了一次与操作,效率会变慢. 而且去掉~longword,也看不出那里错了,可Ulrich Drepper偏偏就当bug给fix了..支持你发email问他..
返回顶楼	回帖地址 0 0 请登录后投票

linac 等级: 初级会员性别: 文章: 1 积分: 30 来自: 福州	发表时间：2009-08-10 这里数学原理： a-1如果不进位，则最高位不变， ~a的最高位与a相反，因此(a-1) & ~a的最高位为0。仅当a=0时，a-1进位，最高位改变，与~a的最高位一致，则(a-1)&~a的最高位为1 明显的一个bug。
返回顶楼	回帖地址 1 0 请登录后投票

论坛首页 → 编程语言技术版

跳转论坛: