bobox's picture
Training in progress, step 112, checkpoint
d6e1f65 verified
raw
history blame contribute delete
167 Bytes
{
"embed_dim": 1024,
"num_heads": 8,
"dropout": 0.05,
"bias": true,
"use_layernorm": true,
"use_MLP": true,
"MLP_h_size": 2048,
"use_residual": false
}