Molecular Representation
Molecular representation focuses on encoding the complex information within molecules into formats suitable for machine learning, aiming to improve predictions of molecular properties and facilitate drug discovery and materials science. Current research emphasizes multimodal approaches, integrating various data types like molecular graphs, SMILES strings, and textual descriptions, often leveraging graph neural networks (GNNs), transformers, and contrastive learning methods. These advancements enable more accurate and efficient prediction of molecular properties, accelerating the design and development of new molecules with desired characteristics. The resulting improvements in molecular understanding have significant implications for diverse fields, including drug discovery, materials science, and environmental chemistry.
Papers
Molecular Graph Representation Learning Integrating Large Language Models with Domain-specific Small Models
Tianyu Zhang, Yuxiang Ren, Chengbin Hou, Hairong Lv, Xuegong Zhang
Rethinking Molecular Design: Integrating Latent Variable and Auto-Regressive Models for Goal Directed Generation
Heath Arthur-Loui, Amina Mollaysa, Michael Krauthammer
Analysis of Atom-level pretraining with Quantum Mechanics (QM) data for Graph Neural Networks Molecular property models
Jose Arjona-Medina, Ramil Nugmanov
GLaD: Synergizing Molecular Graphs and Language Descriptors for Enhanced Power Conversion Efficiency Prediction in Organic Photovoltaic Devices
Thao Nguyen, Tiara Torres-Flores, Changhyun Hwang, Carl Edwards, Ying Diao, Heng Ji