File size: 1,201 Bytes
859f350
 
 
 
 
 
 
 
 
f71ea07
 
5c6a92e
f71ea07
 
 
 
 
 
c5ab7db
 
f71ea07
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1abfe88
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
---
datasets:
- shay681/Legal_Clauses
language:
- he
base_model:
- google/mt5-small
pipeline_tag: text2text-generation
---
# Text2Text Legal Clauses Finetuned Model

This model fine-tunes [google/mt5-small](https://huggingface.co/google/mt5-small) model on [shay681/Legal_Clauses dataset](https://huggingface.co/datasets/shay681/Legal_Clauses) dataset.


## Training and evaluation data

| Dataset  | Split | # samples |
| -------- | ----- | --------- |
| Legal_Clauses | train | 147,946 |
| Legal_Clauses | validation | 36,987 |


## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- evaluation_strategy: "epoch"
- learning_rate: 5e-5
- train_batch_size: 4
- eval_batch_size: 4
- num_train_epochs: 5
- weight_decay: 0.01


### Framework versions

- Transformers 4.17.0
- Pytorch 1.10.0+cu111
- Datasets 1.18.4
- Tokenizers 0.11.6


### Results

| Metric | # Value   |
| ------ | --------- |
| **Accuracy** | **0.87** | 
| **F1** | **0.64** |


### About Me
Created by Shay Doner.
This is my final project as part of intelligent systems M.Sc studies at Afeka College in Tel-Aviv.
For more cooperation, please contact email: 
[email protected]