A Proposal to Study "Is High Quality Data All We Need?" [2203.06404]