This repository contains the source code for the Bayesian Gaussian Mixture Model (BGMM) and Gaussian Process (GP) classification analysis described in the paper "Catassembly Triad: A Catalytic Framework for Enantioselective Chiral Molecular Assembly". The implementation quantitatively validates catalyst efficacy prediction through triad descriptors (attachability, controllability, detachability).
- Iterative BGMM clustering with automatic component selection
- ELBO convergence analysis for model stability
- 3D probability mapping with Gaussian Processes
- Visualization of cluster evolution and classification confidence
- Experimental validation
- Create virtual environment:
python -m venv .venv
source .venv/bin/activate # Linux/Mac
.venv\Scripts\activate # Windows- Install dependencies:
pip install numpy pandas scikit-learn matplotlib plotly kaleidoor:
uv add numpy├── data/ # Input datasets
│ └── data_sorted_by_No_in_article.csv
├── src/
│ ├── bgmm.py # Bayesian GMM implementation
│ ├── elbo_plot.py # ELBO convergence visualization
│ └── gp_plotly_3D.py # 3D probability mapping
├── reports/ # Generated analysis reports
├── output/ # Visualizations and plots
└── pyproject.toml # Dependency configuration- Run BGMM Analysis
python src/bgmm.pyGenerates: Cluster reports in reports/ 3D visualizations in output/
- Generate ELBO Plot
python src/elbo_plot.pyGenerates: ELBO visualizations in output/
- Create 3D Probability Map
python src/gp_plotly_3D.pyGenerates: GP 3D visualizations in output/
If you find this code useful in your research, please cite the following paper:
@article{Li2025Catassembly,
title = {Catassembly Triad: A Catalytic Framework for Enantioselective Chiral Molecular Assembly},
author = {Li, Zhihao and Huang, Xuehai and Jiang, Yansong and Zeng, Wei and Zou, Ding and Dong, Xue and Yang, Liulin and Cao, Xiaoyu and Tian, Zhongqun and Wang, Yu},
journal = {Journal of the American Chemical Society},
year = {2025},
doi = {10.1021/jacs.5c16840},
url = {https://doi.org/10.1021/jacs.5c16840}
}