Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
xRundaAILab
/
gushiGPT
like
0
No application file
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
7c12b15
gushiGPT
/
config
/
eval_gpt2.py
xRunda
首次提交
5dcfa45
over 1 year ago
raw
Copy download link
history
blame
Safe
208 Bytes
# evaluate the base gpt2
# n_layer=12, n_head=12, n_embd=768
# 124M parameters
batch_size =
8
eval_iters =
500
# use more iterations to get good estimate
eval_only =
True
wandb_log =
False
init_from =
'gpt2'