kashif HF staff commited on
Commit
250d5be
1 Parent(s): af975d2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -1
README.md CHANGED
@@ -79,7 +79,18 @@ Use the code below to get started with the model.
79
  ### Training Procedure
80
 
81
  ```bash
82
- accelerate launch --config_file examples/accelerate_configs/multi_gpu.yaml examples/scripts/dpo_vlm.py --dataset_name HuggingFaceH4/rlaif-v_formatted --model_name_or_path HuggingFaceTB/SmolVLM-Instruct --per_device_train_batch_size 8 --gradient_accumulation_steps 32 --dataset_num_proc 32 --output_dir dpo_smolvlm_rlaif-v --bf16 --torch_dtype bfloat16 --use_peft --lora_target_modules=all-linear exit
 
 
 
 
 
 
 
 
 
 
 
83
  ```
84
 
85
  ### Framework versions
 
79
  ### Training Procedure
80
 
81
  ```bash
82
+ accelerate launch --config_file examples/accelerate_configs/multi_gpu.yaml \
83
+ examples/scripts/dpo_vlm.py \
84
+ --dataset_name HuggingFaceH4/rlaif-v_formatted \
85
+ --model_name_or_path HuggingFaceTB/SmolVLM-Instruct \
86
+ --per_device_train_batch_size 8 \
87
+ --gradient_accumulation_steps 32 \
88
+ --dataset_num_proc 32 \
89
+ --output_dir dpo_smolvlm_rlaif-v \
90
+ --bf16 \
91
+ --torch_dtype bfloat16 \
92
+ --use_peft \
93
+ --lora_target_modules=all-linear exit
94
  ```
95
 
96
  ### Framework versions