data
Molecular data abstraction: Molecule, Dataset, FormatConverter
Machine Learning & Quantum Chemistry Driven Chemical Research Platform
This page shows a long-term technical direction. Reviewers can return to the project section, verify certificates, or open the one-page resume.
ChemAI Lab is a unified chemistry + AI research platform designed to provide an end-to-end toolchain at the intersection of computational chemistry and machine learning. It integrates multiple open-source cheminformatics tools (RDKit, Chemprop, DeepChem, DScribe, MLatom, Molfeat) with a unified abstraction layer for data handling, feature engineering, model training, explainable AI, and quantum chemistry interfaces.
The project currently focuses on asymmetric organocatalysis with chiral phosphoric acids (CPAs), following a scientific roadmap of 5 stages across 100 development phases β from data infrastructure through to inverse molecular design.
16 subpackages covering the full research workflow
Molecular data abstraction: Molecule, Dataset, FormatConverter
Molecular featurization: MolecularFeaturizer, FeatureStore
Model registry: ModelHub, ModelRegistry
Neural network modules & custom layers
Workflow orchestration: Workflow, PipelineStep
Quantum chemistry interface: QMInterface
Auto hyperparameter optimization
Model evaluation metrics
Explainable AI
Data visualization
Model zoo & pretrained weights
CLI, config management, utilities
Serialization utilities
Model hub & version management
Asymmetric organocatalysis with chiral phosphoric acids (CPAs) β 100 phases across 5 stages
Standardized formats, descriptor libraries, reaction encoding, visualization tools
Phases 1-20CPA-specific descriptors, enantioselectivity prediction, SHAP analysis, multi-task yield/ee models
Phases 21-40Meta-learning, data augmentation, active learning, transfer learning
Phases 41-60DFT feature fusion, reaction surface modeling, causal inference, physics-constrained NNs
Phases 61-80Target-driven molecular generation, inverse condition optimization, automated lab loops, multi-objective optimization
Phases 81-100