Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
OpenNLPLab
/
TransNormerLLM2-3B-300B
like
3
Text Generation
Transformers
PyTorch
English
Chinese
TransNormerLLM
custom_code
arxiv:
2307.14995
arxiv:
2210.10340
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Use this model
main
TransNormerLLM2-3B-300B
1 contributor
History:
10 commits
OpenNLPLab
Upgrade to lightning att2
5ba41e2
verified
10 months ago
images
Upload lightning-leopard.jpg
12 months ago
.gitattributes
Safe
1.52 kB
initial commit
about 1 year ago
Community License for TransNormerLLM Model.pdf
Safe
263 kB
Add license
12 months ago
README.md
Safe
9.89 kB
Update README.md
12 months ago
TransNormerLLM模型社区许可协议.pdf
Safe
294 kB
Add license
12 months ago
config.json
Safe
926 Bytes
Fix 3B config error
11 months ago
configuration_transnormer.py
Safe
2.27 kB
Publish 3B2-300B
12 months ago
generation_config.json
Safe
164 Bytes
Publish 3B2-300B
12 months ago
lightning_attention.py
Safe
15.3 kB
Publish 3B2-300B
12 months ago
lightning_attention2.py
Safe
15.3 kB
Upgrade to lightning att2
10 months ago
modeling_transnormer.py
Safe
34.6 kB
Upgrade to lightning att2
10 months ago
norm.py
Safe
1.27 kB
Publish 3B2-300B
12 months ago
pytorch_model-00001-of-00003.bin
Safe
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.BFloat16Storage"
What is a pickle import?
1.97 GB
LFS
Publish 3B2-300B
12 months ago
pytorch_model-00002-of-00003.bin
Safe
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
1.97 GB
LFS
Publish 3B2-300B
12 months ago
pytorch_model-00003-of-00003.bin
Safe
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
1.88 GB
LFS
Publish 3B2-300B
12 months ago
pytorch_model.bin.index.json
Safe
13.8 kB
Publish 3B2-300B
12 months ago
special_tokens_map.json
Safe
410 Bytes
Publish 3B2-300B
12 months ago
srmsnorm_triton.py
Safe
5.76 kB
Publish 3B2-300B
12 months ago
tokenization_baichuan.py
Safe
9.57 kB
Publish 3B2-300B
12 months ago
tokenizer.model
Safe
1.14 MB
LFS
Publish 3B2-300B
12 months ago
tokenizer_config.json
Safe
819 Bytes
Publish 3B2-300B
12 months ago
utils.py
Safe
4.39 kB
Publish 3B2-300B
12 months ago