Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
/
gpt3-8b-multi-3.5t-base
like
8
Follow
NVIDIA
34.6k
Text Generation
English
Megatron-LM
nvidia
Mamba
Mamba-2
SSM
8B
arxiv:
2406.07887
arxiv:
2405.21060
License:
apache-2.0
Model card
Files
Files and versions
Community
1
main
gpt3-8b-multi-3.5t-base
/
release
/
mp_rank_00
Commit History
Update model arguments
51d7f04
rwaleffe
commited on
Jun 13, 2024
Upload model
890e55a
rwaleffe
commited on
Jun 13, 2024