Llama3.1-Mamba-8B-distill / trainer_state.json

Commit History

add models
0c247b3

Junxiong Wang commited on