Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training [2204.12768]