align stack properly for calling global ctors/dtors on x86[_64]