x86, core: Optimize hweight32()