optimize mempcpy to minimize need for data saved across the call
[musl] / src / thread / __unmapself.c