Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Aleph-Alpha
/
umup-research-1b-fp8
like
0
Follow
Aleph Alpha
238
Text Generation
scaling
arxiv:
2407.17465
License:
open-aleph-license
Model card
Files
Files and versions
Community
main
umup-research-1b-fp8
Ctrl+K
Ctrl+K
1 contributor
History:
6 commits
GregorZiegltrumAA
Update README.md
de4f8cc
verified
10 months ago
.gitattributes
Safe
1.52 kB
initial commit
10 months ago
LICENSE
Safe
1.72 kB
Update LICENSE
10 months ago
README.md
Safe
3.46 kB
Update README.md
10 months ago
config.yml
Safe
2.47 kB
Add Model
10 months ago
model_state_layer_0_EmbeddingInput.pt
Safe
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
What is a pickle import?
268 MB
LFS
Add Model
10 months ago
model_state_layer_10_TransformerLayer.pt
Safe
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
What is a pickle import?
101 MB
LFS
Add Model
10 months ago
model_state_layer_11_TransformerLayer.pt
101 MB
LFS
Add Model
10 months ago
model_state_layer_12_TransformerLayer.pt
101 MB
LFS
Add Model
10 months ago
model_state_layer_13_TransformerLayer.pt
101 MB
LFS
Add Model
10 months ago
model_state_layer_14_TransformerLayer.pt
101 MB
LFS
Add Model
10 months ago
model_state_layer_15_TransformerLayer.pt
101 MB
LFS
Add Model
10 months ago
model_state_layer_16_TransformerLayer.pt
101 MB
LFS
Add Model
10 months ago
model_state_layer_17_LayerNormWrapper.pt
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.BFloat16Storage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
5.55 kB
LFS
Add Model
10 months ago
model_state_layer_18_TransformerLMHead.pt
268 MB
LFS
Add Model
10 months ago
model_state_layer_1_TransformerLayer.pt
Safe
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
What is a pickle import?
101 MB
LFS
Add Model
10 months ago
model_state_layer_2_TransformerLayer.pt
Safe
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
101 MB
LFS
Add Model
10 months ago
model_state_layer_3_TransformerLayer.pt
Safe
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
101 MB
LFS
Add Model
10 months ago
model_state_layer_4_TransformerLayer.pt
Safe
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
101 MB
LFS
Add Model
10 months ago
model_state_layer_5_TransformerLayer.pt
Safe
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
101 MB
LFS
Add Model
10 months ago
model_state_layer_6_TransformerLayer.pt
101 MB
LFS
Add Model
10 months ago
model_state_layer_7_TransformerLayer.pt
101 MB
LFS
Add Model
10 months ago
model_state_layer_8_TransformerLayer.pt
Safe
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
101 MB
LFS
Add Model
10 months ago
model_state_layer_9_TransformerLayer.pt
Safe
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
101 MB
LFS
Add Model
10 months ago
vocab.json
Safe
4.22 MB
Add Model
10 months ago