Model Card for Model ID

Pretrained GPT-NeoX model with 2.06GB English news dataset. Took about 20 hours to reach 40,000 iterations. Trained on p3.16xlarge. Different hyperparameter: gradient_accumulation_step 4

Model Details

Model Description

  • Developed by: Eunyoung Lee
  • Model type: GPT-NeoX
  • Language(s) (NLP): English
Downloads last month
12
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.