QuantFactory/Llama2-7B-Hindi-finetuned-GGUF

This is quantized version of subhrokomol/Llama2-7B-Hindi-finetuned created using llama.cpp

Original Model Card

This model was trained on 24GB of RTX A500 on zicsx/mC4-Hindi-Cleaned-3.0 dataset (1%) for 3 hours.

We used Hugging Face PEFT-LoRA PyTorch for training.

Transtokenization process in --

GGUF

Model size

6.74B params

Architecture

llama

Hardware compatibility

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Quantized

(65)

this model