jordiclive commited on
Commit
bfcac77
1 Parent(s): 2815cee

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +34 -0
README.md ADDED
@@ -0,0 +1,34 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: EleutherAI/pile-t5-base
3
+ tags:
4
+ - generated_from_trainer
5
+ model-index:
6
+ - name: pile-t5-base-instruction
7
+ results: []
8
+ language:
9
+ - en
10
+ metrics:
11
+ - rouge
12
+ datasets:
13
+ - taskydata/Pile-T5-Instruction
14
+ ---
15
+
16
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
17
+ should probably proofread and complete it, then remove this comment. -->
18
+
19
+ # pile-t5-base-instruction
20
+
21
+ This model is a fine-tuned version of [EleutherAI/pile-t5-base](https://huggingface.co/EleutherAI/pile-t5-base) on [Pile-T5-Instruction](https://huggingface.co/datasets/taskydata/Pile-T5-Instruction) dataset.
22
+
23
+
24
+ ### Training hyperparameters
25
+
26
+ The following hyperparameters were used during training:
27
+ - learning_rate: 0.0001
28
+ - train_batch_size: 4
29
+ - effective_batch_size: 64
30
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=0.00000001
31
+ - num_epochs: 6
32
+
33
+ ### Training results
34
+ [Wandb](https://wandb.ai/jordanclive/tasky-instruction/runs/5yx1yzzk/overview)