Fix unexpected kw args (version mismatch between train machine and test machine?) 695e75b slackingfred commited on Apr 22
Add config.json from https://huggingface.co/CMU-AIR2/deepseek_lora_hardarithmetic/commit/897694fea18d122a157c31409e5455b3adc7ccd6 d1fd2f5 Fred commited on Apr 20
180K steps (WandB run: https://wandb.ai/cmu-11785-s24-sg06/huggingface/runs/3060wpp0 at 80K steps) d5dd255 Fred commited on Apr 18