Spaces:

OpenLLM-Ro
/

README

Running

mihaimasala commited on Apr 23

Commit

8dbc9bd

verified ·

1 Parent(s): 2528892

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -16,11 +16,15 @@ We value:
 In this organization, you can find RoLLM models, based on different underlying models and in different flavours (i.e., foundational, instruct, or chat variants). There are currently four model collections:
 - RoLlama2: Romanian models based on Llama2
 - RoMistral: Romanian models based on Mistral
-- RoGemma: Romanian models based on Gemma
 - RoLlama3: Romanian models based on Llama3
-Furthermore, here you can find data used to train and evaluate LLMs in Romanian. Currently, there are two data collections:
 - SFT datasets: data used for supervised (instruction) finetuning
 - Evaluation datasets: data used for evaluating LLM in Romanian
 See details in [https://arxiv.org/abs/2406.18266](https://arxiv.org/abs/2406.18266) and [https://arxiv.org/abs/2405.07703](https://arxiv.org/abs/2405.07703).

 In this organization, you can find RoLLM models, based on different underlying models and in different flavours (i.e., foundational, instruct, or chat variants). There are currently four model collections:
 - RoLlama2: Romanian models based on Llama2
 - RoMistral: Romanian models based on Mistral
 - RoLlama3: Romanian models based on Llama3
+- RoLlama3.1: Romanian models based on Llama3.1
+- RoGemma: Romanian models based on Gemma
+- RoGemma2: Romanian models based on Gemma2
+Furthermore, here you can find data used to train and evaluate LLMs in Romanian. Currently, there are three data collections:
 - SFT datasets: data used for supervised (instruction) finetuning
+- Alignment datasets: data used mainly for Direct Preference Optimization (DPO)
 - Evaluation datasets: data used for evaluating LLM in Romanian
 See details in [https://arxiv.org/abs/2406.18266](https://arxiv.org/abs/2406.18266) and [https://arxiv.org/abs/2405.07703](https://arxiv.org/abs/2405.07703).