When can transformers compositionally generalize in-context? [2407.12275]