UnstableLlama
/

Rombos-LLM-V2.6-Nemotron-70b-exl2

Model card Files Files and versions Community

UnstableLlama commited on Oct 18, 2024

Commit

a266222

·

verified ·

1 Parent(s): 74253ab

Update README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -3,6 +3,16 @@ license: llama3.1
 ---
 # Rombos-LLM-V2.6-Nemotron-70b
 ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/642cc1c253e76b4c2286c58e/lLAQ7FGTYDz0xT8rs9-ti.jpeg)
 I applied the last step of my continuous finetuning method to the Nemotron-70b model from Nvidia. More details bellow:

 ---
 # Rombos-LLM-V2.6-Nemotron-70b
+---
+<p><h2>ExLlamaV2 Quantization</h2></p>
+<p>Quantized with <a href="https://github.com/turboderp/exllamav2/releases/tag/v0.2.2">ExLlamaV2 v0.2.3</a></p>
+[2.2 Bits Per Weight](https://huggingface.co/UnstableLlama/Rombos-LLM-V2.6-Nemotron-70b-exl2/tree/2_2)
+[4.65 Bits Per Weight](https://huggingface.co/UnstableLlama/Rombos-LLM-V2.6-Nemotron-70b-exl2/tree/4_65)
+---
 ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/642cc1c253e76b4c2286c58e/lLAQ7FGTYDz0xT8rs9-ti.jpeg)
 I applied the last step of my continuous finetuning method to the Nemotron-70b model from Nvidia. More details bellow: