Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition [2309.07988]