Parameter Efficient Audio Captioning With Faithful Guidance Using Audio-text Shared Latent Representation [2309.03340]