akrishnan
/

TOFU_olmo-7b_ft_full_10epochs_lr9e-06_seed42_run1

Text Generation

Inference Endpoints

Model card Files Files and versions Community

TOFU_olmo-7b_ft_full_10epochs_lr9e-06_seed42_run1 / cfg.yaml

akrishnan's picture

Training in progress, step 1250

6fc159d verified 10 days ago

history blame contribute delete

337 Bytes

	model_family: olmo-7b
	LoRA:
	r: 0
	alpha: 32
	dropout: 0.05
	data_path: locuslab/TOFU
	split: full
	batch_size: 8
	gradient_accumulation_steps: 4
	num_epochs: 10
	lr: 9.0e-06
	seed: 42
	run_index: 1
	save_dir: paper_models/final_ft_noLORA_${num_epochs}_epochs_inst_lr${lr}_${model_family}_${split}_seed${seed}_${run_index}/
	weight_decay: 0.01