Model Card for Model ID

This model is a 4bit quantified version of Hermes-2-Pro-Llama-3-8b and is meant to be used with tool_calls which are supported by Hermes.

It is faster to respond to queries using the tool_call construct even when working with external vector databases

Model Details

Model Description

This is the model card of a 馃 transformers model that has been pushed on the Hub. This model card has been automatically generated.

  • Developed by: Vikas Lal Sahita
  • Funded by [optional]: Self
  • Shared by [optional]: [More Information Needed]
  • Model type: [More Information Needed]
  • Language(s) (NLP): [More Information Needed]
  • License: [More Information Needed]
  • Finetuned from model [optional]: [More Information Needed]
Downloads last month
5
Safetensors
Model size
4.65B params
Tensor type
FP16
F32
U8
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for vickiitb/hermes-2-pro-llama-3-8b-bnb-4bit

Quantized
(36)
this model