pszemraj commited on
Commit
98ae09a
·
verified ·
1 Parent(s): 958e619

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -20
README.md CHANGED
@@ -5,33 +5,21 @@ tags:
5
  - generated_from_trainer
6
  metrics:
7
  - accuracy
8
- model-index:
9
- - name: bert-plus-embedderForMLM-goodwiki-deduped-split_4096-usecache
10
- results: []
11
  ---
12
 
13
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
- should probably proofread and complete it, then remove this comment. -->
15
 
16
- # bert-plus-embedderForMLM-goodwiki-deduped-split_4096-usecache
 
 
 
17
 
18
- This model was trained from scratch on the BEE-spoke-data/goodwiki-deduped-split dataset.
19
  It achieves the following results on the evaluation set:
20
  - Loss: 1.9835
21
  - Accuracy: 0.6159
22
 
23
- ## Model description
24
-
25
- More information needed
26
-
27
- ## Intended uses & limitations
28
-
29
- More information needed
30
-
31
- ## Training and evaluation data
32
-
33
- More information needed
34
-
35
  ## Training procedure
36
 
37
  ### Training hyperparameters
@@ -63,4 +51,4 @@ The following hyperparameters were used during training:
63
  - Transformers 4.37.2
64
  - Pytorch 2.3.0.dev20240206+cu121
65
  - Datasets 2.16.1
66
- - Tokenizers 0.15.1
 
5
  - generated_from_trainer
6
  metrics:
7
  - accuracy
8
+ license: apache-2.0
9
+ datasets:
10
+ - euirim/goodwiki
11
  ---
12
 
 
 
13
 
14
+ # BEE-spoke-data/bert-plus-L8-4096-v1.0
15
+
16
+
17
+ Further progression after multitask training etc. The most recent/last dataset it saw was the euirim/goodwiki dataset.
18
 
 
19
  It achieves the following results on the evaluation set:
20
  - Loss: 1.9835
21
  - Accuracy: 0.6159
22
 
 
 
 
 
 
 
 
 
 
 
 
 
23
  ## Training procedure
24
 
25
  ### Training hyperparameters
 
51
  - Transformers 4.37.2
52
  - Pytorch 2.3.0.dev20240206+cu121
53
  - Datasets 2.16.1
54
+ - Tokenizers 0.15.1