On Mon, Apr 26, 2010 at 01:57:29AM +0200, Joakim Tjernlund wrote:
Ah, now I think I know. The while(buf < end) is optimized for post inc so that is why.
tested little and was surprised, only 3-5% slower with the while loop compared to my for loop, it is mainly the post increment that does that. On x86 I can hardly see any difference between post and pre inc.
I also got 5% slowdown on MIPS. If i replaced while(buf < end) with while(buf != end), i got no slowdown.
However, gcc won't inline add32 as it is too big on ppc and that is a disaster. Could you add inline to add32?
There is 'inline' in a current git. -- Elen sila lumenn' omentielvo Ondrej 'SanTiago' Zajicek (email: santiago@crfreenet.org) OpenPGP encrypted e-mails preferred (KeyID 0x11DEADC3, wwwkeys.pgp.net) "To err is human -- to blame it on a computer is even more so."