nickrosh commited on
Commit
19aa0ae
·
1 Parent(s): 5520e46

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -1
README.md CHANGED
@@ -2,4 +2,28 @@
2
  license: cc-by-sa-4.0
3
  datasets:
4
  - nickrosh/Evol-Instruct-Code-80k-v1
5
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
  license: cc-by-sa-4.0
3
  datasets:
4
  - nickrosh/Evol-Instruct-Code-80k-v1
5
+ ---
6
+
7
+ This model uses the Evol-Instruct-Code-80k-v1 dataset generated using the [Evol-Teacher](https://github.com/nickrosh/evol-teacher) repo. Currently, WizardCoder is one the most performant Code Generation models, being beaten only by ChatGPT. This takes the Code Alpaca 20k dataset and evolves each instruction through a randomly chosen evolution prompt to increase instruction complexity. These prompts range from increase time/space complexity, to increasing requirements, to adding erroneus code to improve robustness, etc. This is done three times with pruning and post processing to remove unwanted instructions and responses. The iterative addition of more complexity gives higher quality and more in-depth instructions than what is ususally generated in Alpaca methods. This, like in the case of WizardCoder and WizardLM, can lead to strong performance that gets very close to RLHF model performance.
8
+
9
+ This model was fine tuned with this generated dataset, using the following parameters:
10
+ ```bash
11
+ --model_name_or_path replit/replit-code-v1-3b \
12
+ --data_path ./data/EvolInstruct-Code-80k/EvolInstruct-Code-80k.json \
13
+ --output_dir ./checkpoints \
14
+ --num_train_epochs 3 \
15
+ --per_device_train_batch_size 1 \
16
+ --gradient_accumulation_steps 8 \
17
+ --evaluation_strategy "no" \
18
+ --save_strategy "steps" \
19
+ --save_steps 50 \
20
+ --save_total_limit 2 \
21
+ --learning_rate 2e-5 \
22
+ --weight_decay 0. \
23
+ --warmup_ratio 0.03 \
24
+ --lr_scheduler_type "cosine" \
25
+ --logging_steps 1 \
26
+ --model_max_length 2000 \
27
+ --bf16 True \
28
+ --tf32 True
29
+ ```