Oblivion's End
A merged LoRA for gemma-2-9b-it, trained using DPO datasets for creative writing using my DPO training notebook.
Model Details
Model Description
- Finetuned from model: google/gemma-2-9b-it
Model Sources [optional]
- Repository: GitHub.
Uses
Made for creative writing.
Training Details
Training Data
Check out the model card details.
Training Procedure
Model training performance (margins) are available in the wandb instance.
Training Hyperparameters
- Training regime: bf16 on a 1x 80GB A100 node.
Model Examination [optional]
[More Information Needed]
Environmental Impact
Total emissions are estimated to be 0.83 kgCO$_2$eq.