Simmering: Sufficient is better than optimal for training neural networks [2410.19912]