Commit History
don't use deepspeed or fsdp when merging loras (#1479)
87ca3f9
unverified
refactor utils.data module for line count linter (#1476)
e0fcef4
unverified
Pretrain multipack v2 (#1470)
5aa5097
unverified
fix pretraining_ on odd datasets (#1463)
586bd8d
unverified
reduce verbosity of the special tokens (#1472)
0b10377
unverified
qwen2_moe support w multipack (#1455)
6086be8
unverified
fix some of the edge cases for Jamba (#1452)
05b398a
unverified
Support loading datasets saved via save_to_disk (#1432)
e634118
unverified
Keith Stevens
commited on