Brad John Pitt commited on
Commit
ac97287
1 Parent(s): 6bf75dc

Training in progress, epoch 1

Browse files
README.md CHANGED
@@ -7,9 +7,6 @@ tags:
7
  model-index:
8
  - name: Gopal-finetuned-custom-en-to-ru
9
  results: []
10
- language:
11
- - en
12
- - ru
13
  ---
14
 
15
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -21,11 +18,11 @@ This model is a fine-tuned version of [Helsinki-NLP/opus-mt-en-ru](https://huggi
21
 
22
  ## Model description
23
 
24
- This is the fine-tuned model with my custom dataset and hyperparameters, Follow huggingface tutorials for enhanced knowledge
25
 
26
- ## Intended uses & limitation
27
 
28
- Work on the accuracy part, increase the number of datasets, atleast 20k<n<100k
29
 
30
  ## Training and evaluation data
31
 
@@ -35,6 +32,15 @@ More information needed
35
 
36
  ### Training hyperparameters
37
 
 
 
 
 
 
 
 
 
 
38
 
39
  ### Training results
40
 
@@ -45,4 +51,4 @@ More information needed
45
  - Transformers 4.38.2
46
  - Pytorch 2.2.1+cu121
47
  - Datasets 2.18.0
48
- - Tokenizers 0.15.2
 
7
  model-index:
8
  - name: Gopal-finetuned-custom-en-to-ru
9
  results: []
 
 
 
10
  ---
11
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
18
 
19
  ## Model description
20
 
21
+ More information needed
22
 
23
+ ## Intended uses & limitations
24
 
25
+ More information needed
26
 
27
  ## Training and evaluation data
28
 
 
32
 
33
  ### Training hyperparameters
34
 
35
+ The following hyperparameters were used during training:
36
+ - learning_rate: 1e-05
37
+ - train_batch_size: 64
38
+ - eval_batch_size: 64
39
+ - seed: 42
40
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
41
+ - lr_scheduler_type: linear
42
+ - num_epochs: 10
43
+ - mixed_precision_training: Native AMP
44
 
45
  ### Training results
46
 
 
51
  - Transformers 4.38.2
52
  - Pytorch 2.2.1+cu121
53
  - Datasets 2.18.0
54
+ - Tokenizers 0.15.2
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3d138a5fbbb8693bfcf467f7fd13b01586d822efd98a5b28f3df7703054884e3
3
  size 304869976
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0f697206969406bac32b21da52975f51b951f54e385e1b8c5669735cfed68f8e
3
  size 304869976
runs/Mar29_05-51-40_9a0f808d947e/events.out.tfevents.1711692161.9a0f808d947e.1707.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d4dc57ee8e2aa57a1f709255a561478eaf5be9b79c42df6d20eba0f520d77b01
3
+ size 407
runs/Mar29_06-29-08_9a0f808d947e/events.out.tfevents.1711693763.9a0f808d947e.1707.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a510abde1360a1c6e916b08e11d1c6a2abad984e396dbbfd03fcb8225fa399b4
3
+ size 4184
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:68209316b8d3275677d7c9f24d08eff726d8562f4033ec4481d99b2d92cc2079
3
  size 5112
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b6a60a76b9e85131bcff00f2557bcd35b4bd03bd50fb59fdbbd30dc1c65ffbd7
3
  size 5112