[WIP][RFC][PATCH 1-3/x] chacha: add asm optimizations from Andrew Moon