Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
OpenNLPLab
/
TransNormerLLM-385M
like
9
Text Generation
Transformers
PyTorch
English
Chinese
TransNormerLLM
custom_code
arxiv:
2307.14995
arxiv:
2009.03300
License:
other
Model card
Files
Files and versions
xet
Community
1
Train
Use this model
e423fc2
TransNormerLLM-385M
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
OpenNLPLab
Publish 385M Model
e423fc2
almost 2 years ago
.gitattributes
Safe
1.52 kB
initial commit
almost 2 years ago
README.md
98 Bytes
Update README.md
almost 2 years ago
config.json
Safe
1.03 kB
Publish 385M Model
almost 2 years ago
configuration_transnormer.py
Safe
2.27 kB
Publish 385M Model
almost 2 years ago
generation_config.json
Safe
110 Bytes
Publish 385M Model
almost 2 years ago
lightning_attention.py
Safe
15.3 kB
Publish 385M Model
almost 2 years ago
modeling_transnormer.py
40.3 kB
Publish 385M Model
almost 2 years ago
norm.py
Safe
1.25 kB
Publish 385M Model
almost 2 years ago
pytorch_model.bin
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.BFloat16Storage"
What is a pickle import?
798 MB
xet
Publish 385M Model
almost 2 years ago
special_tokens_map.json
410 Bytes
Publish 385M Model
almost 2 years ago
srmsnorm_triton.py
Safe
5.75 kB
Publish 385M Model
almost 2 years ago
tokenization_baichuan.py
9.82 kB
Publish 385M Model
almost 2 years ago
tokenizer.model
1.14 MB
xet
Publish 385M Model
almost 2 years ago
tokenizer_config.json
819 Bytes
Publish 385M Model
almost 2 years ago
utils.py
Safe
3.77 kB
Publish 385M Model
almost 2 years ago