togethercomputer
/

GPT-JT-6B-v1

Text Generation

Model card Files Files and versions

juewang commited on Nov 25, 2022

Commit

d91dcaa

·

1 Parent(s): 95e005c

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -197,7 +197,8 @@ widget:
 # Model Summary
 We present GPT-JT, a fork of GPT-6B, trained for 20,000 steps, that outperforms most 100B+ parameter models at classification, and improves most tasks. GPT-JT was trained with a new decentralized algorithm with  1G interconnect.
 GPT-JT is a bidirectional dense model, trained through UL2 objective with NI, P3, COT, the pile data.
-Please check out our demo: [TOMA-app](https://huggingface.co/spaces/togethercomputer/TOMA-app).
 # Quick Start
 ```python

 # Model Summary
 We present GPT-JT, a fork of GPT-6B, trained for 20,000 steps, that outperforms most 100B+ parameter models at classification, and improves most tasks. GPT-JT was trained with a new decentralized algorithm with  1G interconnect.
 GPT-JT is a bidirectional dense model, trained through UL2 objective with NI, P3, COT, the pile data.
+**Please check out our demo: [TOMA-app](https://huggingface.co/spaces/togethercomputer/TOMA-app).**
 # Quick Start
 ```python