vickiitb
/

hermes-2-pro-llama-3-8b-bnb-4bit

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

Model Card for Model ID

This model is a 4bit quantified version of Hermes-2-Pro-Llama-3-8b and is meant to be used with tool_calls which are supported by Hermes.

It is faster to respond to queries using the tool_call construct even when working with external vector databases

Model Details

Model Description

This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.

Developed by: Vikas Lal Sahita
Funded by [optional]: Self
Shared by [optional]: [More Information Needed]
Model type: [More Information Needed]
Language(s) (NLP): [More Information Needed]
License: [More Information Needed]
Finetuned from model [optional]: [More Information Needed]

Downloads last month: 5

Safetensors

Model size

4.65B params

Tensor type

FP16

·

F32

·

U8

·

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

Model tree for vickiitb/hermes-2-pro-llama-3-8b-bnb-4bit

Base model

NousResearch/Meta-Llama-3-8B

Finetuned

NousResearch/Hermes-2-Pro-Llama-3-8B

Quantized

(36)

this model