Radu1999
/

Mistral-Instruct-Ukrainian-SFT

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Radu1999 commited on Feb 11

Commit

536da40

•

1 Parent(s): c0475d8

Update README.md

Files changed (1) hide show

README.md +10 -2

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ license: apache-2.0
 # Ukrainian finetuned Mistral-7B-Instruct-v0.2
-<!-- Supervised finetuning of Mistral-7B-Instruct-v0.2 on ukrainian dataset-->
 ## Instruction format
@@ -24,7 +24,15 @@ This instruction model is based on Mistral-7B-v0.2, a transformer model with the
 - Grouped-Query Attention
 - Sliding-Window Attention
 - Byte-fallback BPE tokenizer
--
 ## 💻 Usage
 ```python

 # Ukrainian finetuned Mistral-7B-Instruct-v0.2
+Supervised finetuning of Mistral-7B-Instruct-v0.2 on ukrainian datasets.
 ## Instruction format
 - Grouped-Query Attention
 - Sliding-Window Attention
 - Byte-fallback BPE tokenizer
+## Datasets
+- [UA-SQUAD](https://huggingface.co/datasets/FIdo-AI/ua-squad/resolve/main/ua_squad_dataset.json)
+- [Ukrainian StackExchange](https://huggingface.co/datasets/zeusfsx/ukrainian-stackexchange)
+- [UAlpaca Dataset](https://github.com/robinhad/kruk/blob/main/data/cc-by-nc/alpaca_data_translated.json)
+- [Ukrainian Subset from Belebele Dataset](https://github.com/facebookresearch/belebele)
+- [Ukrainian Subset from XQA](https://github.com/thunlp/XQA)
+- TODO - Ukrainian Subset from MKQA
 ## 💻 Usage
 ```python