Interpreting Adaptive Gradient Methods by Parameter Scaling for Learning-Rate-Free Optimization [2401.03240]