Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations [2308.13007]