URP
/

urllm-ko-7b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ChangheeCho commited on Mar 25, 2024

Commit

96ee769

·

verified ·

1 Parent(s): cf1aa9a

Update README.md

Files changed (1) hide show

README.md +9 -5

README.md CHANGED Viewed

@@ -15,14 +15,18 @@ tags:
 ## Model Details
-**Input:** The model accepts only text input.
-**Output:** The model produces text output exclusively.
 **Model Architecture:**
 urLLM-KO-7B is an auto-regressive language model that leverages an optimized transformer architecture derived from Llama-2-7b.
 **Training Corpus**
-The model was trained using selected datasets from Modu Corpus and Korean Wikipedia (approximately 27GB).

 ## Model Details
 **Model Architecture:**
 urLLM-KO-7B is an auto-regressive language model that leverages an optimized transformer architecture derived from Llama-2-7b.
 **Training Corpus**
+The model was trained using selected datasets from Modu Corpus and Korean Wikipedia (approximately 28GB).
+**Vocab Expansion**
+| Model Name | Vocabulary Size | Description |
+| --- | --- | --- |
+| Original Solar | 32000 | Sentencepiece BPE |
+| **Expanded urLLM-KO-7B** | 51385 | Sentencepiece BPE. Added Korean vocab and merges |