UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning [2406.03324]