GPT Semantic Cache: Reducing LLM Costs and Latency via Semantic Embedding Caching [2411.05276]