ARM/NEON optimizations