GGML_ASSERT Error when running llama-embedding with this model

by rayzinnz - opened 4 days ago

4 days ago

When testing with
./llama-embedding -m snowflake-arctic-embed-l-v2.0.F16.gguf -e -p "Hello world" --verbose-prompt
I get this error:
GGML_ASSERT(params.n_batch >= params.n_ctx) failed
Did you get it and how did you resolve it?

rayzinnz

4 days ago

OK found the fix, we must specify the context and batch length, i.e. 512 or 8192, i.e.
/llama-embedding --batch-size 8192 --ctx-size 8192 -m ...

rayzinnz changed discussion status to closed 4 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment