Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
/
mamba2-hybrid-8b-3t-32k
like
5
Follow
NVIDIA
20.1k
Text Generation
English
Megatron-LM
nvidia
Mamba
Mamba-2
SSM
8B
arxiv:
2406.07887
arxiv:
2405.21060
License:
apache-2.0
Model card
Files
Files and versions
Community
main
mamba2-hybrid-8b-3t-32k
1 contributor
History:
2 commits
rwaleffe
Upload model
425c863
10 months ago
release
Upload model
10 months ago
.gitattributes
Safe
1.52 kB
initial commit
10 months ago
README.md
Safe
2.23 kB
Upload model
10 months ago
latest_checkpointed_iteration.txt
Safe
8 Bytes
Upload model
10 months ago
mt_nlg_plus_multilingual_ja_zh_the_stack_frac_015_256k.model
Safe
4.57 MB
LFS
Upload model
10 months ago