Extending the domain of transparent checkpoint-restart for large-scale HPC