File size: 5,891 Bytes
c5afbf5
 
 
7d5d533
c5afbf5
 
 
 
 
869c1c7
 
 
 
 
 
c5afbf5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7d5d533
 
 
 
 
 
 
 
9dfc55d
 
 
 
 
e9585fe
9dfc55d
 
 
 
 
 
 
 
 
 
 
 
 
 
 
07ab39c
 
 
b806593
444be3f
 
 
 
 
 
 
 
b806593
 
 
 
 
 
 
 
 
 
 
07ab39c
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
HEADER_TITLE = """
<div style="display: flex; flex-wrap: wrap; justify-content: space-around; align-items: center;">
    <div>
        <h1 align="center" id="space-title">Adam Mickiewicz University's<br>Center for Artificial Intelligence</h1>
        <h2 align="center" id="space-subtitle">LLMzSzŁ: a comprehensive LLM benchmark for Polish</h2>
    </div>
</div>
"""

MAIN_DESC = """## Overall scores

**LLMzSzŁ** (_LLM-y za Szkolną Ławą_, LLMs Behind the School Desk) is a comprehensive LLM benchmark for Polish,
based on a collection of Polish national exams, including both academic
and professional tests extracted from the archives of the Polish Central Examination Board.
"""

GIM_DESC = """### Middle School Exam

The **Middle School Exam** (*egzamin gimnazjalny*) was formerly a key component of Poland’s secondary education system. This exam was taken by students completing their lower secondary education.

#### Subjects Covered

This exam assessed students in core subjects like:

- **Polish Language**
- **Mathematics**
- **Science**
- **Foreign Language**

preparing them for higher levels of education.

#### Recent Changes

Due to recent educational reforms in Poland:

- **Middle Schools Phased Out**: Middle schools have been removed from the education system.
- **Exam Discontinued**: With this change, the Middle School Exam has also been discontinued.
"""

MAT_DESC = """### High School Exam

The **High School Exam** (*matura*) is one of the most critical exams in Poland. This exam serves as a prerequisite for higher education and is a significant milestone in students' academic journeys.

#### Mandatory Subjects

The *matura* exam includes essential subjects:

- **Polish Language**
- **Mathematics**
- **Foreign Language**

#### Additional Subjects

Students may also choose additional subjects based on:

- **Areas of Interest**: Students select subjects aligned with their strengths or future studies.
- **University Requirements**: Specific subjects may be necessary for admission to certain university programs.

#### Minimum points required
To pass the High School Exam, students needed to achieve a minimum score of **30%** in mandatory subjects. Additional subjects do not require a minimum score to pass, but the results significantly impact the points needed for university admission.
"""

OSM_DESC = """### 8th-Grade Exam

The **8th-grade exam** (*egzamin ósmoklasisty*) is a standardized assessment taken by Polish students at the end of their primary education (8th grade).

#### Subjects Covered

This exam mainly includes tests in:

- **Polish Language**
- **Mathematics**
- **Foreign Language**

#### Purpose and Role

- **Secondary Education Foundation**: Provides a foundation for secondary education placements, guiding students to the next stage of their academic journey.
- **Student Assessment**: Evaluates core competencies, ensuring readiness for high school-level studies.
"""

ZAW_DESC = """### Professional Exam

The **Professional Exam** (*egzamin zawodowy*) is tailored for students pursuing vocational education in Poland. This exam is typically taken after completing vocational training programs, which may range from:

- **Technical High Schools**
- **Post-secondary Vocational Courses**

#### Purpose and Importance

- **Certification**: A successful score on this exam certifies a student’s qualifications in a specific trade or profession.
- **Career Readiness**: Enables students to enter the workforce directly.
- **Further Education**: Offers a foundation for pursuing specialized training.
"""

ABOUT = """## ABOUT

LLMzSzŁ (_LLM-y za Szkolną Ławą_, LLMs Behind the School Desk) is a comprehensive LLM benchmark for Polish,
based on a collection of Polish national exams, including both academic
and professional tests extracted from the archives of the [Polish Central Examination Board](https://cke.gov.pl).

The benchmark was prepared by Adam Mickiewicz University's Center for Artificial Intelligence.
"""

DATASET_TABLE = """
$$
\\begin{array}{|c|c|c|c|}
\\hline
\\textbf{Exam} & \\textbf{Discipline} & \\textbf{Number of Questions} & \\textbf{Original Number of Questions} \\\\
\\hline
8th-Grade Exam & Polish Language & 15 & 101\\\\
8th-Grade Exam & Mathematics & 35 & 99\\\\
Middle School Exam & Mathematics & 110 & 304\\\\
Middle School Exam & Nature & 65 & 226\\\\
High School Exam & Biology & 21 & 169\\\\
High School Exam & Physics & 136 & 399\\\\
High School Exam & Mathematics & 220 & 492\\\\
Professional Exam & Arts & 1778 & 3480\\\\
Professional Exam & Mechanical, Mining and Metallurgical & 9097 & 30240\\\\
Professional Exam & Agriculture and Forestry & 7344 & 23200\\\\
\\hline
\\end{array}
$$
"""

CITATION_LABEL = "Copy the following snippet to cite these results"

CITATION_CONTENT = """
@misc{eval-harness,
      title        = {A framework for few-shot language model evaluation},  
      author       = {Gao, Leo and Tow, Jonathan and Abbasi, Baber and Biderman, Stella and Black, Sid and DiPofi, Anthony and Foster, Charles and Golding, Laurence and Hsu, Jeffrey and Le Noac'h, Alain and Li, Haonan and McDonell, Kyle and Muennighoff, Niklas and Ociepa, Chris and Phang, Jason and Reynolds, Laria and Schoelkopf, Hailey and Skowron, Aviya and Sutawika, Lintang and Tang, Eric and Thite, Anish and Wang, Ben and Wang, Kevin and Zou, Andy},
      year         = 2023,
      publisher    = {Zenodo},
      doi          = {10.5281/zenodo.10256836},
      url          = {https://zenodo.org/records/10256836}
},
@inproceedings{gralinski-2019-geval,
    title = "{GE}val: Tool for Debugging {NLP} Datasets and Models",
    author = "Grali{\'n}ski, Filip  and
      Wr{\'o}blewska, Anna  and
      Stanis{\l}awek, Tomasz  and
      Grabowski, Kamil  and
      G{\'o}recki, Tomasz",
    year = "2019",
    doi = "10.18653/v1/W19-4826",
    url = "https://aclanthology.org/W19-4826/",
}
}"""