Cheminformatics Task
Cheminformatics leverages computational methods to analyze and predict the properties of chemical compounds, accelerating drug discovery and materials science. Current research emphasizes the development and application of large language models (LLMs), graph neural networks (GNNs), and other deep learning architectures to analyze molecular structures represented as SMILES strings or graphs, often incorporating techniques like contrastive learning and self-supervised pre-training for improved performance. These advancements enable more efficient prediction of molecular properties, such as toxicity and activity, and facilitate tasks like retrosynthesis prediction and virtual screening, ultimately streamlining chemical research and development.
Papers
Scikit-fingerprints: easy and efficient computation of molecular fingerprints in Python
Jakub Adamczyk, Piotr Ludynia
From 2015 to 2023: How Machine Learning Aids Natural Product Analysis
Suwen Shi, Ziwei Huang, Xingxin Gu, Xu Lin, Chaoying Zhong, Junjie Hang, Jianli Lin, Claire Chenwen Zhong, Lin Zhang, Yu Li, Junjie Huang
CACTUS: Chemistry Agent Connecting Tool-Usage to Science
Andrew D. McNaughton, Gautham Ramalaxmi, Agustin Kruel, Carter R. Knutson, Rohith A. Varikoti, Neeraj Kumar
The Role of Model Architecture and Scale in Predicting Molecular Properties: Insights from Fine-Tuning RoBERTa, BART, and LLaMA
Lee Youngmin, Lang S. I. D. Andrew, Cai Duoduo, Wheat R. Stephen