denial07 commited on
Commit
c629bb9
·
verified ·
1 Parent(s): 385d9c9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -5,7 +5,8 @@ license_link: https://huggingface.co/Qwen/Qwen2-72B-Instruct/resolve/main/LICENS
5
  ---
6
 
7
  ### LogicKor Benchmark (24.07.31)
8
- | Rank (1-shot) | Model | Reasoning | Math | Writing | Coding | Understanding | Grammar | Singleturn | Multiturn | Total |
 
9
  |------|-------|-----------|-------|--------|--------|-------|---------|-----------|-----------|-------|
10
  | 1 | openai/gpt-4o-2024-05-13 | 9.21 | 8.71 | 9.64 | 9.78 | 9.64 | 9.50 | 9.33 | 9.50 | 9.41 |
11
  | 2 | anthropic/claude-3-5-sonnet-20240620 | 8.64 | 8.42 | 9.85 | 9.78 | 9.92 | 9.21 | 9.26 | 9.35 | 9.30 |
 
5
  ---
6
 
7
  ### LogicKor Benchmark (24.07.31)
8
+ * 1-shot only
9
+ | Rank | Model | Reasoning | Math | Writing | Coding | Understanding | Grammar | Singleturn | Multiturn | Total |
10
  |------|-------|-----------|-------|--------|--------|-------|---------|-----------|-----------|-------|
11
  | 1 | openai/gpt-4o-2024-05-13 | 9.21 | 8.71 | 9.64 | 9.78 | 9.64 | 9.50 | 9.33 | 9.50 | 9.41 |
12
  | 2 | anthropic/claude-3-5-sonnet-20240620 | 8.64 | 8.42 | 9.85 | 9.78 | 9.92 | 9.21 | 9.26 | 9.35 | 9.30 |