https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-405B

#227

by leafspark - opened Aug 15

Discussion

leafspark

Aug 15

•

edited Aug 15

NousResearch/Hermes-3-Llama-3.1-405B

Another huge model by NousResearch, interestingly it's a full parameter fine tune on the base.

They also released a 8B and 70B finetune:
https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-70B
https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B

nicoboss

Aug 15

•

edited Aug 15

What a high-quality model. This is a full parameter finetune of Llama-3.1 405B on par with Llama-3.1 405B Instruct. It uses the ChatML template, supports function calling and has a system prompt to generate structured JSON output.

mradermacher

Owner Aug 16

Queued, but for the 405B, patience is required.

mradermacher changed discussion status to closed Aug 16

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment