[PATCH 2/5] Do the movd/movq workaround for the osx assembler, for sha3-permute