Model-CAPERhlA-35M / README.md
POOUCAS's picture
Update README.md
d0609d3 verified
metadata
license: mit

Model Information 🧬

License: MIT

πŸ”¬ Base Model:

westlake-repl/SaProt_35M_AF2

🧩 Task Type:

Protein-level regression

πŸ“Š Dataset:

DATASET-CAPE-RhlA-seqlabel

  • protein: Contains mutation data including the RhlA enzyme sequence and corresponding performance metrics.
  • Label: The experimentally tested fitness score, representing the scaled mutation effect for each mutant.
  • Source: Label derived from CAPE

πŸ”‘ Model Input Type:

Amino acid sequence; label in RhlA

πŸ“ˆ Performance (the best on test set):

Spearman's ρ: 0.862


LoRA Configuration βš™οΈ

  • r: 8
  • LoRA dropout: 0.1
  • LoRA alpha: 8
  • Modules to save: ["regression"]

Training Configuration πŸŽ›οΈ

  • Optimizer:
    • Class: AdamW
    • Betas: (0.9, 0.98)
    • Weight decay: 0.01
  • Learning rate: 5e-5
  • Epochs: 5
  • Batch size: Adaptive