n3wtou commited on
Commit
40a026f
1 Parent(s): 930d053

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -3
README.md CHANGED
@@ -5,6 +5,12 @@ tags:
5
  model-index:
6
  - name: n3wtou/mt5-small-finedtuned-4-swahili
7
  results: []
 
 
 
 
 
 
8
  ---
9
 
10
  <!-- This model card has been generated automatically according to the information Keras had access to. You should
@@ -12,7 +18,7 @@ probably proofread and complete it, then remove this comment. -->
12
 
13
  # n3wtou/mt5-small-finedtuned-4-swahili
14
 
15
- This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
  - Train Loss: 3.0006
18
  - Validation Loss: 2.7015
@@ -20,7 +26,7 @@ It achieves the following results on the evaluation set:
20
 
21
  ## Model description
22
 
23
- More information needed
24
 
25
  ## Intended uses & limitations
26
 
@@ -59,4 +65,4 @@ The following hyperparameters were used during training:
59
  - Transformers 4.29.2
60
  - TensorFlow 2.12.0
61
  - Datasets 2.12.0
62
- - Tokenizers 0.13.3
 
5
  model-index:
6
  - name: n3wtou/mt5-small-finedtuned-4-swahili
7
  results: []
8
+ datasets:
9
+ - csebuetnlp/xlsum
10
+ language:
11
+ - sw
12
+ metrics:
13
+ - rouge
14
  ---
15
 
16
  <!-- This model card has been generated automatically according to the information Keras had access to. You should
 
18
 
19
  # n3wtou/mt5-small-finedtuned-4-swahili
20
 
21
+ This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the [csebuetnlp/xlsum](https://huggingface.co/datasets/csebuetnlp/xlsum/viewer/swahili/train) dataset.
22
  It achieves the following results on the evaluation set:
23
  - Train Loss: 3.0006
24
  - Validation Loss: 2.7015
 
26
 
27
  ## Model description
28
 
29
+ This model is a fined-tuned google/mt5-small for Kiswahili abstractive text generation
30
 
31
  ## Intended uses & limitations
32
 
 
65
  - Transformers 4.29.2
66
  - TensorFlow 2.12.0
67
  - Datasets 2.12.0
68
+ - Tokenizers 0.13.3