Exploring Low-Cost Transformer Model Compression for Large-Scale Commercial Reply Suggestions [2111.13999]