arvindabacus
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -87,6 +87,7 @@ Score vs selected others (sourced from: (https://lmsys.org/blog/2024-04-19-arena
|
|
87 |
| Model | Score | 95% Confidence Interval | Average Tokens |
|
88 |
| :---- | ---------: | ----------: | ------: |
|
89 |
| GPT-4-Turbo-2024-04-09 | 82.6 | (-1.8, 1.6) | 662 |
|
|
|
90 |
| Claude-3-Opus-20240229 | 60.4 | (-3.3, 2.4) | 541 |
|
91 |
| Gemini-1.5-pro-latest | 72.1 | (-2.3, 2.2) | 630 |
|
92 |
| **Smaug-Llama-3-70B-Instruct** | 56.7 | (-2.2, 2.6) | 661 |
|
|
|
87 |
| Model | Score | 95% Confidence Interval | Average Tokens |
|
88 |
| :---- | ---------: | ----------: | ------: |
|
89 |
| GPT-4-Turbo-2024-04-09 | 82.6 | (-1.8, 1.6) | 662 |
|
90 |
+
| GPT-4o | 78.3 | (-2.4, 2.1) | 685 |
|
91 |
| Claude-3-Opus-20240229 | 60.4 | (-3.3, 2.4) | 541 |
|
92 |
| Gemini-1.5-pro-latest | 72.1 | (-2.3, 2.2) | 630 |
|
93 |
| **Smaug-Llama-3-70B-Instruct** | 56.7 | (-2.2, 2.6) | 661 |
|