pratikdoshi
/

finetune-llama-7b-text-to-sql

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

pratikdoshi commited on Sep 20, 2024

Commit

d2d0def

·

verified ·

1 Parent(s): 48c4322

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# finetune-llama-7b-text-to-sql
 This model is a fine-tuned version of [codellama/CodeLlama-7b-hf](https://huggingface.co/codellama/CodeLlama-7b-hf) on [b-mc2/sql-create-context](https://huggingface.co/datasets/b-mc2/sql-create-context).
@@ -24,9 +24,8 @@ This model is a fine-tuned version of [codellama/CodeLlama-7b-hf](https://huggin
 The model is trained on 10,000 random samples from [b-mc2/sql-create-context](https://huggingface.co/datasets/b-mc2/sql-create-context).
 It is trained in a manner described by [Phil Schmid here](https://www.philschmid.de/fine-tune-llms-in-2024-with-trl).
-## Training procedure
-### Training hyperparameters
 | Hyperparameter | Value |
 | -------------- | ----- |
@@ -41,11 +40,12 @@ It is trained in a manner described by [Phil Schmid here](https://www.philschmid
 | lr_scheduler_warmup_ratio | 0.03 |
 | num_epochs | 3 |
-### Training results
-### Framework versions
 - PEFT 0.7.2.dev0
 - Transformers 4.36.2

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# Finetuning Llama-7b on text-to-sql task
 This model is a fine-tuned version of [codellama/CodeLlama-7b-hf](https://huggingface.co/codellama/CodeLlama-7b-hf) on [b-mc2/sql-create-context](https://huggingface.co/datasets/b-mc2/sql-create-context).
 The model is trained on 10,000 random samples from [b-mc2/sql-create-context](https://huggingface.co/datasets/b-mc2/sql-create-context).
 It is trained in a manner described by [Phil Schmid here](https://www.philschmid.de/fine-tune-llms-in-2024-with-trl).
+## Training hyperparameters
 | Hyperparameter | Value |
 | -------------- | ----- |
 | lr_scheduler_warmup_ratio | 0.03 |
 | num_epochs | 3 |
+## Training results
+![Train loss](assets/train_loss.png)
+## Framework versions
 - PEFT 0.7.2.dev0
 - Transformers 4.36.2