Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -7,4 +7,21 @@ sdk: static
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
+
# Indic Language Benchmarking for Large Language Models
|
11 |
+
### India is diverse with 22+ languages. This project aims to benchmark the performance of large language models on Indic languages across datasets. Goal is to evaluate the models' abilities in understanding, generating, and processing text in these languages.
|
12 |
+
#### We currently have 8 languages across 3 datasets, more coming soon
|
13 |
+
|
14 |
+
## Languages
|
15 |
+
- Bengali (bn)
|
16 |
+
- Gujarati (gu)
|
17 |
+
- Hindi (hi)
|
18 |
+
- Kannada (kn)
|
19 |
+
- Malayalam (ml)
|
20 |
+
- Odiya (or)
|
21 |
+
- Tamil (ta)
|
22 |
+
- Telugu (te)
|
23 |
+
|
24 |
+
## Datasets
|
25 |
+
- ARC: [hi](https://huggingface.co/datasets/indicbench/arc_hi), [bn](https://huggingface.co/datasets/indicbench/arc_bn), [gu](https://huggingface.co/datasets/indicbench/arc_gu), [kn](https://huggingface.co/datasets/indicbench/arc_kn), [ml](https://huggingface.co/datasets/indicbench/arc_ml), [or](https://huggingface.co/datasets/indicbench/arc_or), [ta](https://huggingface.co/datasets/indicbench/arc_ta), [te](https://huggingface.co/datasets/indicbench/arc_te)
|
26 |
+
- TruthfulQA: [hi](https://huggingface.co/datasets/indicbench/truthfulqa_hi), [bn](https://huggingface.co/datasets/indicbench/truthfulqa_bn), [gu](https://huggingface.co/datasets/indicbench/truthfulqa_gu), [kn](https://huggingface.co/datasets/indicbench/truthfulqa_kn), [ml](https://huggingface.co/datasets/indicbench/truthfulqa_ml), [or](https://huggingface.co/datasets/indicbench/truthfulqa_or), [ta](https://huggingface.co/datasets/indicbench/truthfulqa_ta), [te](https://huggingface.co/datasets/indicbench/truthfulqa_te)
|
27 |
+
- Hellaswag: [hi](https://huggingface.co/datasets/indicbench/hellaswag_hi), [bn](https://huggingface.co/datasets/indicbench/hellaswag_bn), [gu](https://huggingface.co/datasets/indicbench/hellaswag_gu), [kn](https://huggingface.co/datasets/indicbench/hellaswag_kn), [ml](https://huggingface.co/datasets/indicbench/hellaswag_ml), [or](https://huggingface.co/datasets/indicbench/hellaswag_or), [ta](https://huggingface.co/datasets/indicbench/hellaswag_ta), [te](https://huggingface.co/datasets/indicbench/hellaswag_te)
|