Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Kristijan
/
gpt2_wt103_12-layer
like
0
PyTorch
English
gpt2
language-model
transformer
wikitext-103
Eval Results
arxiv:
2210.13569
Model card
Files
Files and versions
Community
1
refs/pr/1
gpt2_wt103_12-layer
1 contributor
History:
6 commits
SFconvertbot
Adding `safetensors` variant of this model
b39af55
verified
about 2 months ago
.gitattributes
1.48 kB
initial commit
over 1 year ago
README.md
1.87 kB
Update README.md
over 1 year ago
config.json
686 Bytes
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
over 1 year ago
model.safetensors
443 MB
LFS
Adding `safetensors` variant of this model
about 2 months ago
optimizer.pt
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
What is a pickle import?
862 MB
LFS
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
over 1 year ago
pytorch_model.bin
pickle
Detected Pickle imports (4)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.ByteStorage"
What is a pickle import?
443 MB
LFS
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
over 1 year ago
rng_state.pth
pickle
Detected Pickle imports (7)
"_codecs.encode"
,
"numpy.core.multiarray._reconstruct"
,
"numpy.ndarray"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.ByteStorage"
,
"collections.OrderedDict"
,
"numpy.dtype"
How to fix it?
14.5 kB
LFS
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
over 1 year ago
scaler.pt
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
559 Bytes
LFS
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
over 1 year ago
scheduler.pt
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
623 Bytes
LFS
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
over 1 year ago
trainer_state.json
15.2 kB
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
over 1 year ago
training_args.bin
pickle
Detected Pickle imports (4)
"transformers.trainer_utils.SchedulerType"
,
"transformers.trainer_utils.IntervalStrategy"
,
"transformers.training_args.TrainingArguments"
,
"torch.device"
How to fix it?
2.48 kB
LFS
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
over 1 year ago