Information Theoretic Guarantees For Policy Alignment In Large Language Models [2406.05883]