Update tokenizer_config.json

by erichartford - opened Feb 22

base: refs/heads/main

←

from: refs/pr/3

Discussion Files changed

-1

erichartford

Feb 22

Use official chat template, that inserts a <think>

Update tokenizer_config.json7d72a9d7

erichartford

Feb 22

•

edited Feb 22

can you please go back to DeepSeek's official chat template that includes <think> to force it to start with a <think> otherwise it often skips thinking, or mixes thinking in with the response

https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B/blob/main/tokenizer_config.json#L34

robertgshaw2

Neural Magic org Feb 22

Thanks @erichartford for pointing this out. Did DeepSeek change the tokenizer? Just want to make sure we understand why it diverged

nm-research changed pull request status to merged Feb 22

nm-research

Neural Magic org Feb 22

@erichartford Great catch, thanks a lot for pointing it out! Indeed, DeepSeek updated tokenizer_config.json with this diff 14 days ago, and we forked their model for quantization on the initial release day; therefore, we haven't picked it up. We will update all of our quantized models.

erichartford

Feb 23

Nice thank you

erichartford

Feb 23

•

edited Feb 23

Note a lot of ui's are dependent on the opening <think> which the model will not longer generate - providers may want to insert a <think> at the beginning of the response

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment