iAkashPaul commited on
Commit
04cab8b
1 Parent(s): 209ef73

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -3
README.md CHANGED
@@ -13,7 +13,8 @@ This model from [Telugu-LLM-Labs](https://huggingface.co/Telugu-LLM-Labs/Indic-g
13
  git clone https://huggingface.co/iAkashPaul/Indic-gemma-2b-finetuned-sft-Navarasa-GGUF # & cd into it, update paths accordingly
14
 
15
  ./main --file prompt.md --lora ./models/ggml-adapter-model.bin --lora-base ./models/indic-llm_Q8.gguf
16
- ./server --lora ./models/ggml-adapter-model.bin --lora-base ./models/indic-llm_Q8.gguf -m ./models/indic-llm_Q8.gguf
 
17
 
18
  ```
19
 
@@ -32,8 +33,17 @@ Save this to a file(ex. prompt.md) & load it with the main executable.
32
  ## Performance
33
 
34
  * LORA+BASE (not merged)
35
- ![](indic-llm-q8.png)
 
 
 
 
 
36
 
37
  * Merged model
38
 
39
- #ToDo
 
 
 
 
 
13
  git clone https://huggingface.co/iAkashPaul/Indic-gemma-2b-finetuned-sft-Navarasa-GGUF # & cd into it, update paths accordingly
14
 
15
  ./main --file prompt.md --lora ./models/ggml-adapter-model.bin --lora-base ./models/indic-llm_Q8.gguf
16
+
17
+ ./main --file prompt.md -m ./models/merged_indic_llm_Q8.gguf -ngl 99
18
 
19
  ```
20
 
 
33
  ## Performance
34
 
35
  * LORA+BASE (not merged)
36
+
37
+ * ```
38
+ ./server --lora ./models/ggml-adapter-model.bin --lora-base ./models/indic-llm_Q8.gguf -m ./models/indic-llm_Q8.gguf
39
+ ```
40
+
41
+ * ![](indic-llm-q8.png)
42
 
43
  * Merged model
44
 
45
+ * ```
46
+ ./server -ngl 20 -m ./models/merged_indic_llm_Q8.gguf
47
+ ```
48
+
49
+ * ![](Q8-75tok.png)