Post
2371
Trained another version of llama3-8b-instruct which beats the base model. This time without losing too many points on gsm8k benchmark. Again, using AutoTrain π₯ pip install autotrain-advanced
Trained model: abhishek/autotrain-llama3-orpo-v2
Trained model: abhishek/autotrain-llama3-orpo-v2