The Effect of Model Size on Worst-Group Generalization [2112.04094]