Self-supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions [2312.16613]