On Tue, Apr 16, 2013 at 11:55 AM, Niels Möller nisse@lysator.liu.se wrote:
It would also be nice if we could force 16-byte alignment for the l1_key
array (this is important for assembly routines), which would them imply 16-byte alignment for the complete context struct. Could help x86 sse2 assembly. And could help also on ARM, but I'm not sure if the system (primarily linker and malloc) really makes 16-byte alignment possible there.
Would it make sense to force allocation of the context (i.e., no context on the stack) via ctx_alloc() function that will use posix_memalign or memalign?
Alternatively you could have a separate set of functions that would operate on aligned data.
regards, Nikos