How to enable reasoning parser in vllm (gguf)

by fullstack - opened Jul 11

Jul 11

Is there a way to enable reasoning parsing in vllm for this model?

vllm serve ./reka-flash-3.1-rekaquant-q3_k_s.gguf --max-model-len 40000 --disable-log-requests --enable-reasoning --reasoning-parser

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment