Model Card for Model ID
Pretrained GPT-NeoX model with 2.06GB English news dataset. Took about 20 hours to reach 40,000 iterations. Trained on p3.16xlarge. Different hyperparameter: gradient_accumulation_step 4
Model Details
Model Description
- Developed by: Eunyoung Lee
- Model type: GPT-NeoX
- Language(s) (NLP): English
- Downloads last month
- 12
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.