mehmetkeremturkcan
/

oblivionsend

Text Generation

creative-writing

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

mehmetkeremturkcan commited on 19 days ago

Commit

a8e0ed5

·

verified ·

1 Parent(s): 15bdcfa

Update README.md

Files changed (1) hide show

README.md +55 -3

README.md CHANGED Viewed

@@ -1,3 +1,55 @@
----
-license: gemma
----

+---
+license: gemma
+datasets:
+- sam-paech/gutenberg3-generalfiction-scifi-fantasy-romance-adventure-dpo
+- sam-paech/gutenbergs_1_2_3_antislop-dpo
+language:
+- en
+base_model:
+- google/gemma-2-9b-it
+library_name: transformers
+tags:
+- creative-writing
+---
+# Oblivion's End
+A merged LoRA for gemma-2-9b-it, trained using DPO datasets for creative writing using [my DPO training notebook](https://github.com/mkturkcan/dpo-model-trainer).
+## Model Details
+### Model Description
+- **Finetuned from model:** google/gemma-2-9b-it
+### Model Sources [optional]
+- **Repository:** [GitHub](https://github.com/mkturkcan/dpo-model-trainer/tree/main).
+## Uses
+Made for creative writing.
+## Training Details
+### Training Data
+Check out the model card details.
+### Training Procedure
+Model training performance (margins) are available in the [wandb instance](https://api.wandb.ai/links/mkturkcan/4djkmhwp).
+#### Training Hyperparameters
+- **Training regime:** bf16 on a 1x 80GB A100 node.
+## Model Examination [optional]
+<!-- Relevant interpretability work for the model goes here -->
+[More Information Needed]
+## Environmental Impact
+Total emissions are estimated to be 0.83 kgCO$_2$eq.