Research

Research

The Spectroscopy and AI Lab (SAIL), led by Joonyoung F. Joung, integrates artificial intelligence with chemistry to study chemical reaction prediction and the discovery of new materials.

Prediction of Physicochemical Properties

We leverage machine learning models and quantum chemical calculations (e.g., DFT) to predict critical physicochemical properties of molecules, such as optical properties, energy levels, and solubility. These predictions not only aid in discovering desired materials or pharmaceuticals but also support the optimization of separation processes by providing insights into solubility behavior.

Deep learning model relating chromophore and solvent structure to optical properties
From our work in JACS Au (2021).

Chemical Reaction Prediction

Chemical reaction prediction has traditionally relied on expert knowledge and reaction rules, making it difficult to anticipate unexpected byproducts or impurities. We develop models that predict products either in an end-to-end manner or by learning mechanistic pathways. This helps to explain impurity formation and ultimately discover novel reactions.

Electron-flow matching model architecture for reaction mechanism prediction
From our work in Nature (2025).

De Novo Molecular Design

Traditional approaches to molecular design rely heavily on trial-and-error experimentation, even when the desired properties are known. At SAIL, we develop machine learning-based de novo design techniques that generate candidate molecules expected to exhibit target properties. This accelerates material discovery while reducing cost and effort. In addition to generative modeling, we leverage quantum chemical methods such as DFT to understand how molecular structure determines properties. This allows us to extract design principles grounded in theory and propose new molecular structures with improved performance.

Generative deep learning model that designs molecules for target properties
From our work in ACS Cent. Sci. (2025).

Database Development

Reliable AI models require high-quality data. We build structured databases by extracting information from experiments and literature, and use these data for training and benchmarking models.

Experimental optical-property data: absorption, emission, and lifetime
From our work in Sci. Data (2020).