make pthread_exit run dtors for last thread, wait to decrement thread count
[musl] / include / malloc.h