Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Kristijan
/
gpt2_wt103_12-layer
like
0
PyTorch
English
gpt2
language-model
transformer
wikitext-103
Eval Results
arxiv:
2210.13569
Model card
Files
Files and versions
Community
1
bcb66e3
gpt2_wt103_12-layer
1 contributor
History:
2 commits
Kristijan
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
bcb66e3
almost 2 years ago
.gitattributes
Safe
1.48 kB
initial commit
almost 2 years ago
config.json
Safe
686 Bytes
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 2 years ago
optimizer.pt
Safe
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
What is a pickle import?
862 MB
LFS
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 2 years ago
pytorch_model.bin
Safe
pickle
Detected Pickle imports (4)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.ByteStorage"
What is a pickle import?
443 MB
LFS
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 2 years ago
rng_state.pth
pickle
Detected Pickle imports (7)
"_codecs.encode"
,
"numpy.core.multiarray._reconstruct"
,
"numpy.ndarray"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.ByteStorage"
,
"collections.OrderedDict"
,
"numpy.dtype"
How to fix it?
14.5 kB
LFS
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 2 years ago
scaler.pt
Safe
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
559 Bytes
LFS
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 2 years ago
scheduler.pt
Safe
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
623 Bytes
LFS
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 2 years ago
trainer_state.json
Safe
15.2 kB
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 2 years ago
training_args.bin
pickle
Detected Pickle imports (4)
"transformers.trainer_utils.SchedulerType"
,
"transformers.trainer_utils.IntervalStrategy"
,
"transformers.training_args.TrainingArguments"
,
"torch.device"
How to fix it?
2.48 kB
LFS
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 2 years ago