Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
AndreaUnibo
/
JetMoE_rank_lstm_final_full_trained_depth3_n2
like
0
Text Generation
Transformers
Safetensors
jetmoe
trl
sft
Inference Endpoints
4-bit precision
bitsandbytes
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
JetMoE_rank_lstm_final_full_trained_depth3_n2
Commit History
Upload tokenizer
adb9c73
verified
AndreaUnibo
commited on
25 days ago
Upload JetMoEForCausalLM
6a2e262
verified
AndreaUnibo
commited on
25 days ago
initial commit
ddb5d7b
verified
AndreaUnibo
commited on
25 days ago