File size: 1,026 Bytes
b65dd0c 7ca5874 2d0fea6 7ca5874 b65dd0c 8f115fe b65dd0c 5911117 c042042 ccc0170 47c6a19 c042042 ccc0170 c042042 b65dd0c 8f115fe b65dd0c 8f115fe b65dd0c 8f115fe 1cf6e0b 8f115fe b65dd0c 7ca5874 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 |
---
tags:
- generated_from_trainer
model-index:
- name: SMILES_BERT
results: []
widget:
- text: CC(=O)NC1<mask>CC=C(C=C1)O
pipeline_tag: fill-mask
---
# SMILES_BERT
A BERT model trained on a list of 50,000 SMILES for MLM
Example:
Acetaminophen
```
CC(=O)NC1=CC=C(C=C1)O
```
## Model description
This model is a BERT model that was trained on a list of 50k SMILES. The SMILES were sourced from BindingDB and the compounds bind to certain proteins
with some affinity. The purpose of this model was to provide a model which can then be fine-tuned for other tasks in which SMILES data can be useful.
## Intended uses & limitations
This model was trained in order to provide a model which can then be fine-tuned for other tasks in which SMILES data can be useful such as
predicting physical properties, chemical activity, or biological activity.
### Training results
Training Loss: 0.9446000
Further evaluation is needed
### Framework versions
- Transformers 4.37.0.dev0
- Pytorch 2.1.0+cu121
- Tokenizers 0.15.0 |