TheBloke
/

Mixtral-8x7B-v0.1-GGUF

Model card Files Files and versions Community

Resources

View closed (2)

Not longer compatible with the newest llama.cpp

#22 opened about 1 month ago by

Update README.md

#21 opened 7 months ago by

Update README to fix install of huggingfacecli command

#20 opened 11 months ago by

Hardware Requirements for Q4_K_M

#19 opened 12 months ago by

ShivanshMathur007

function calling

#18 opened 12 months ago by

Can not load by ctransformers

#17 opened 12 months ago by

I love the mixtral-8x7b-instruct-v0.1.Q5_K_M.gguf

#16 opened 12 months ago by

My cpu is only using 50% of its cores.

#15 opened about 1 year ago by

Would this run on a 32GM RAM & 8GB VRAM?

#14 opened about 1 year ago by

Weird. Ooga is still not loading after a fresh pull from release.

#13 opened about 1 year ago by

Anyone else seeing similar behavior? I especially like the start "Death, ..." plus some gobblygook.

#12 opened about 1 year ago by

Could this model deploy with fastchat? :)

#10 opened about 1 year ago by

How many tokens per second?

#9 opened about 1 year ago by

KCPP frankenstein experimental release for Mixtral

#8 opened about 1 year ago by

Not finding blk.0.ffn_gate.weight. I checked sha256sum, matches the Q6_K version. Any thoughts on how to fix this?

#6 opened about 1 year ago by

For the time being that mode with unofficial llamacpp works terrible - bad bad in answering - Instruct version is the best all of llm ever so far.

#5 opened about 1 year ago by

create_tensor: tensor 'blk.0.ffn_gate.weight' not found

#4 opened about 1 year ago by

It works.

#3 opened about 1 year ago by

mixtral instruct too?

#2 opened about 1 year ago by

Other quant types.

#1 opened about 1 year ago by