End-to-end speech recognition modeling from de-identified data [2207.05469]