Symbolic Regression
Symbolic regression (SR) is a machine learning technique aiming to discover concise, interpretable mathematical expressions that accurately model data. Current research emphasizes improving the efficiency and accuracy of SR algorithms, such as genetic programming and newer approaches incorporating neural networks and large language models, often focusing on techniques to handle noisy data and limited datasets. These advancements are significantly impacting scientific fields like materials science and physics by enabling the discovery of underlying physical laws and the development of more accurate and interpretable models for complex systems. The resulting interpretable models enhance scientific understanding and facilitate more efficient data analysis across various disciplines.
Papers
Minimum variance threshold for epsilon-lexicase selection
Guilherme Seidyo Imai Aldeia, Fabricio Olivetti de Franca, William G. La Cava
Interpretability in Symbolic Regression: a benchmark of Explanatory Methods using the Feynman data set
Guilherme Seidyo Imai Aldeia, Fabricio Olivetti de Franca
Inexact Simplification of Symbolic Regression Expressions with Locality-sensitive Hashing
Guilherme Seidyo Imai Aldeia, Fabricio Olivetti de Franca, William G. La Cava
PruneSymNet: A Symbolic Neural Network and Pruning Algorithm for Symbolic Regression
Min Wu, Weijun Li, Lina Yu, Wenqiang Li, Jingyi Liu, Yanjie Li, Meilan Hao
Empowering Machines to Think Like Chemists: Unveiling Molecular Structure-Polarity Relationships with Hierarchical Symbolic Regression
Siyu Lou, Chengchun Liu, Yuntian Chen, Fanyang Mo