lewtun HF staff commited on
Commit
8a65292
1 Parent(s): f6cf98a

Add evaluation results on the rte config and validation split of glue

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the rte config and validation split of the [glue](https://huggingface.co/datasets/glue) dataset by

@JeremiahZ

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-glue-rte-157f21-1508454839).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=glue).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=glue).

Files changed (1) hide show
  1. README.md +33 -0
README.md CHANGED
@@ -22,6 +22,39 @@ model-index:
22
  - name: Accuracy
23
  type: accuracy
24
  value: 0.6895306859205776
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
25
  ---
26
 
27
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
22
  - name: Accuracy
23
  type: accuracy
24
  value: 0.6895306859205776
25
+ - task:
26
+ type: natural-language-inference
27
+ name: Natural Language Inference
28
+ dataset:
29
+ name: glue
30
+ type: glue
31
+ config: rte
32
+ split: validation
33
+ metrics:
34
+ - name: Accuracy
35
+ type: accuracy
36
+ value: 0.6823104693140795
37
+ verified: true
38
+ - name: Precision
39
+ type: precision
40
+ value: 0.7047619047619048
41
+ verified: true
42
+ - name: Recall
43
+ type: recall
44
+ value: 0.5648854961832062
45
+ verified: true
46
+ - name: AUC
47
+ type: auc
48
+ value: 0.7394646031580048
49
+ verified: true
50
+ - name: F1
51
+ type: f1
52
+ value: 0.6271186440677967
53
+ verified: true
54
+ - name: loss
55
+ type: loss
56
+ value: 0.7001310586929321
57
+ verified: true
58
  ---
59
 
60
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You