Mathematical Corpus
Mathematical corpora are large collections of mathematical text and code used to train machine learning models capable of advanced mathematical reasoning. Current research focuses on developing and evaluating these corpora, exploring different model architectures like autoregressive decoders, and benchmarking performance across various formal mathematical languages and tokenization methods. The creation of high-quality, diverse corpora and the resulting improved models have significant implications for automating mathematical tasks, enhancing mathematical education, and facilitating interdisciplinary research.
Papers
October 15, 2024
April 22, 2024
February 12, 2024
December 28, 2023