Skip to content

ML+AI

  • pQSAR: build massively multitask, two-step machine learning models with unprecedented scope, accuracy, and applicability domain
    Code
    Last Commit
    Publication
  • ZairaChem: Automated ML-based (Q)SAR
    Code
    Last Commit
    Publication
  • MolPipeline:
    Code
    Last Commit
    Publication

CNN

  • conv_qsar_fast: QSAR/QSPR using descriptor-free molecular embedding
    Code
    Last Commit

Complete Package

  • SPOC: A tool for calculating spatial and physicochemical descriptors from molecular dynamics simulations.
    Code
    Last Commit
    Publication
  • DeepPurpose: A Deep Learning Library for Compound and Protein Modeling, DTI, Drug Property, PPI, DDI, Protein Function Prediction
    Code
    Last Commit
    Publication
  • Oloren ChemEngine: unified API for the development and use of molecular property predictors
    Code
    Last Commit

DNN

  • DeepNeuralNet-QSAR: For training a multi-task DNN with dense QSAR dataset(s)
    Code
    Last Commit

Ensemble

  • AIS-Ensemble:
    Code
    Last Commit
    Publication

Few-Shot

  • FewGS: This repository contains source code and datasets for "Few-Shot Graph and SMILES Learning for Molecular Property Prediction."
    Code
    Last Commit
  • Few-Shot-Learning-for-Low-Data-Drug-Discovery:
    Code
    Last Commit
    Publication.)
  • FS-Mol: A Few-Shot Learning Dataset of Molecules
    Code
    Last Commit
    Publication
  • Meta-MGNN:
    Code
    Last Commit
  • MHNfs: Context-enriched molecule representations improve few-shot drug discovery, available on HuggingFace
    Code
    Last Commit
    Publication
  • MolFesCue: molecular property prediction in Data-Limited and imbalanced contexts using Few-Shot and contrastive learning
    Code
    Last Commit
  • PG-DERN:
    Code
    Last Commit
    Publication
  • MolecularGPT:
    Code
    Last Commit
    Publication

GAT

  • LGGA:
    Code
    Last Commit

GNN

  • MolE: Molecular representations through redundancy reduction of Embeddings
    Code
    Last Commit
    Publication
  • CheMixNet: A mixed DNN architecture that predicts chemical properties using multiple molecular representations.
    Code
    Last Commit
    Publication
  • DeepDelta: A pairwise deep learning approach predicting property differences between two molecules.
    Code
    Last Commit
    Publication
  • MMGX:
    Code
    Last Commit
    Publication
  • MolCLR: Molecular Contrastive Learning of Representations via Graph Neural Networks
    Code
    Last Commit
    Publication
  • molgraph: Offers graph neural networks with TensorFlow and Keras for molecular machine learning, focusing on compatibility and ease of use.
    Code
    Last Commit

Graph-Fusion

  • MolProp:
    Code
    Last Commit
    Publication

Hybrid

  • FGNN:
    Code
    Last Commit
    Publication

LLM

  • MolPMoFiT: transfer learning method based on self-supervised pre-training + task-specific fine-tuning for QSPR/QSAR modeling
    Code
    Last Commit
    Publication
  • MolecularGPT:
    Code
    Last Commit
    Publication

MPGNN

  • Graphormer: deep learning package that allows researchers and developers to train custom models for molecule modeling tasks
    Code
    Last Commit
    Publication

Other NN

  • ChemProp: Features a deep learning approach for molecular property prediction, focusing on scalability and fast uncertainty quantification.
    Code
    Last Commit
    Publication
  • GeminiMol: Incorporates conformational space profile into molecular representation learning, enhancing drug discovery including virtual screening, target identification, and QSAR.
    Code
    Last Commit
    Publication

Pretrained Models

  • OPERA: Open-source QSAR models for pKa prediction using multiple machine learning approaches. Also suite of QSAR models (windows, linux), recent implementation (CATMoS Acute Toxicity Modeling Suite, acute oral toxicity) (standalone).
    Code
    Last Commit
    Publication

Reviews

  • A Survey of Graph Neural Network Architectures in Ligand Binding Affinity Prediction Models:
    Publication

Several architectures

  • DeltaClassifiers: a novel molecular pairing approach to process this data. This creates a new classification task of predicting which one of two paired molecules is more potent.
    Code
    Last Commit
    Publication

Transfer Learning

  • PGM:
    Code Publication

Transformer

  • KnoMol:
    Code
    Last Commit
    Publication
  • ChemBERTa: BERT-like models applied to chemical SMILES data for drug design, chemical modelling, and property prediction
    Code
    Last Commit
    Publication
  • GPT-MolBERTa: A text-based molecular property prediction model utilizing a novel approach to represent SMILES molecules.
    Code
    Last Commit
  • PointGAT:
    Code
    Last Commit
  • TOML-BERT:
    Code
    Last Commit
  • X-MOL: large-scale pre-training for molecular understanding and diverse molecular analysis
    Code
    Last Commit
    Publication