ThinK: Thinner Key Cache by Query-Driven Pruning [2407.21018]