File size: 656 Bytes
b566327
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5880d94
7891099
53f3d3a
b5e6404
50ad0d4
b9d6a5d
f208049
9d71484
6549fc9
1dafe50
f7eafa4
be851f5
af19a7d
f253076
85e690e
c49896c
d40485a
2b7bafb
d9085d4
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
---
language: en
license: mit
library_name: pytorch
---
# Plainly Optimized Network
Dataset: BIGBENCH

Trainer Hyperparameters:
- `lr` = 5e-05
- `per_device_batch_size` = 8
- `gradient_accumulation_steps` = 2
- `weight_decay` = 0.0
- `seed` = 42

|eval_loss|eval_accuracy|epoch|
|--|--|--|
|10.379|0.571|1.0|
|9.388|0.643|2.0|
|10.286|0.571|3.0|
|10.324|0.571|4.0|
|10.254|0.571|5.0|
|10.166|0.571|6.0|
|10.122|0.571|7.0|
|10.020|0.571|8.0|
|10.035|0.571|9.0|
|9.961|0.571|10.0|
|9.963|0.571|11.0|
|9.962|0.571|12.0|
|9.990|0.500|13.0|
|10.817|0.571|14.0|
|10.030|0.571|15.0|
|10.049|0.571|16.0|
|10.057|0.571|17.0|
|10.067|0.571|18.0|
|10.080|0.571|19.0|