Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
llama-moe
/
LLaMA-MoE-v2-3_8B-residual-sft
like
2
Follow
LLaMA-MoE
13
Safetensors
English
mixtral
MoE
custom_code
arxiv:
2411.15708
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
main
LLaMA-MoE-v2-3_8B-residual-sft
/
trainer_state.json
Commit History
commit
c1ac977
huxy912
commited on
Nov 26, 2024