pszemraj's picture
update ckpt with 6ish epochs of training with 1024 TOKENS as max output
9996867