nferruz
/

ProtGPT2

Noelia Ferruz commited on Aug 25, 2022

Commit

006ad59

1 Parent(s): d4c5995

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -53,11 +53,7 @@ python run_clm.py --model_name_or_path nferruz/ProtGPT2 --train_file training.tx
 The HuggingFace script run_clm.py can be found here: https://github.com/huggingface/transformers/blob/master/examples/pytorch/language-modeling/run_clm.py
 ### **How to select the best sequences**
-We've observed that perplexity values correlate with AlphaFold2's plddt. This plot shows perplexity vs. pldtt values for each of the 10,000 sequences in the ProtGPT2-generated dataset:
-<div align="center">
-<img src="https://huggingface.co/nferruz/ProtGPT2/blob/main/ppl-plddt.png" width="45%" />
-</div>
 We recommend to compute perplexity for each sequence with the HuggingFace evaluate method `perplexity`:

 The HuggingFace script run_clm.py can be found here: https://github.com/huggingface/transformers/blob/master/examples/pytorch/language-modeling/run_clm.py
 ### **How to select the best sequences**
+We've observed that perplexity values correlate with AlphaFold2's plddt. This plot shows perplexity vs. pldtt values for each of the 10,000 sequences in the ProtGPT2-generated dataset (see https://huggingface.co/nferruz/ProtGPT2/blob/main/ppl-plddt.png)
 We recommend to compute perplexity for each sequence with the HuggingFace evaluate method `perplexity`: