|
--- |
|
base_model: westlake-repl/SaProt_35M_AF2 |
|
library_name: peft |
|
--- |
|
# Base model: [westlake-repl/SaProt_35M_AF2](https://huggingface.co/westlake-repl/SaProt_35M_AF2) |
|
|
|
# Model Card for Model ID |
|
|
|
<!-- Provide a quick summary of what the model is/does. --> |
|
This model is used to predict interaction of antigen-variable domain of heavy chain of heavy chain antibody (VHH) with |
|
severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) spike proteins. |
|
|
|
This model is trained on Alpha variants, thus can only be used to pridict interaction with Alpha variants. |
|
|
|
### Task type |
|
protein level classification |
|
|
|
### Dataset description |
|
The dataset is from [COGNANO/AVIDa-SARS-CoV-2](https://huggingface.co/datasets/COGNANO/AVIDa-SARS-CoV-2). |
|
|
|
We collect all amino acid sequences whose antigen type is Alpha varitant. |
|
|
|
Binary label represented by 1 for the binding pair and 0 for the non-binding pair. |
|
### Model input type |
|
Amino acid sequence |
|
|
|
### Performance |
|
test_acc: 0.95 |
|
|
|
test_loss: 0.32 |
|
|
|
### LoRA config |
|
lora_dropout: 0.0 |
|
|
|
lora_alpha: 16 |
|
|
|
target_modules: ["query", "key", "value", "intermediate.dense", "output.dense"] |
|
|
|
modules_to_save: ["classifier"] |
|
|
|
### Training config |
|
class: AdamW |
|
|
|
betas: (0.9, 0.98) |
|
|
|
weight_decay: 0.01 |
|
|
|
learning rate: 1e-4 |
|
|
|
epoch: 10 |
|
|
|
batch size: 200 |
|
|
|
precision: 16-mixed |