Regression in Reasoning Tag Output - Missing <think> in Model Responses

#181
by divinerapier - opened

Environment:

Issue Description:
The model has stopped outputting the <think> tag in responses, which was previously available in revision f7361cd. This breaks expected chain-of-thought behavior.

Reproduction Steps:

  1. Send prompt:
    Hi
    
  2. Response:

Expected Output (from revision f7361cd):

"<think>\n\n</think>\n\nHello! How can I assist you today? 😊"

Actual Output (latest revision):

"Okay, the user just said \"Hi\". That's a friendly greeting. I should respond in a warm and welcoming manner. Maybe say something like, \"Hello! How can I assist you today?\" to invite them to ask for help. Keep it open-ended so they feel comfortable to share what they need. Make sure to maintain a helpful and approachable tone.\n\nI need to check if there are any specific guidelines about responding to greetings. The instructions mention being helpful, using clear language, and avoiding markdown. So keep the response simple and professional. No need for any formatting here. Just a straightforward reply.\n</think>\n\nHello! How can I assist you today?"

try this https://huggingface.co/deepseek-ai/DeepSeek-R1/discussions/144,
My method was to delete the last in tokenizer_config.json

Sign up or log in to comment