Spaces:

Tonic
/

SmolFactory

Running

App Files Files Community

Tonic commited on Jul 20

Commit

32fca7d

verified ·

1 Parent(s): 9a7a5af

adds no think tag correctly

Browse files

Files changed (12) hide show

NO_THINK_TAG_GUIDE.md +146 -0
config/runpod_config.py +2 -2
config/train_smollm3.py +2 -2
config/train_smollm3_long_context.py +2 -2
config/train_smollm3_no_think_test.py +38 -0
config/train_smollm3_openhermes_fr.py +2 -2
config/train_smollm3_openhermes_fr_a100_balanced.py +2 -2
config/train_smollm3_openhermes_fr_a100_large.py +2 -2
config/train_smollm3_openhermes_fr_a100_max_performance.py +2 -2
config/train_smollm3_openhermes_fr_a100_multiple_passes.py +2 -2
data.py +12 -1
test_no_think.py +86 -0

NO_THINK_TAG_GUIDE.md ADDED Viewed

	@@ -0,0 +1,146 @@

+# SmolLM3 `/no_think` Tag Implementation Guide
+## The Problem
+You were using the `enable_thinking` parameter in the chat template configuration, which is **incorrect** for SmolLM3. The `/no_think` tag should be added as a **system message** in your training data, not as a configuration parameter.
+### What was wrong:
+```python
+# ❌ INCORRECT - This doesn't work for SmolLM3
+chat_template_kwargs={
+    "enable_thinking": False,  # This parameter doesn't exist in SmolLM3
+    "add_generation_prompt": True
+}
+```
+### What's correct:
+```python
+# ✅ CORRECT - Add /no_think as system message
+messages = [
+    {"role": "system", "content": "You are a helpful assistant. /no_think"},
+    {"role": "user", "content": "What is machine learning?"},
+    {"role": "assistant", "content": "Machine learning is..."}
+]
+```
+## The Solution
+### 1. Updated Data Processing
+The `data.py` file now properly handles the `/no_think` tag by:
+- Adding a system message with `/no_think` when `no_think_system_message=True`
+- Using the correct chat template parameters
+- Properly formatting messages for SmolLM3
+### 2. Updated Configuration
+All configuration files now use the correct parameter:
+```python
+chat_template_kwargs={
+    "add_generation_prompt": True,
+    "no_think_system_message": True  # Set to True to add /no_think tag
+}
+```
+### 3. How It Works
+When `no_think_system_message=True`, the system automatically adds:
+```
+{"role": "system", "content": "You are a helpful assistant. /no_think"}
+```
+as the first message in each conversation.
+## Testing the Fix
+### 1. Run the Test Script
+```bash
+python test_no_think.py
+```
+This will show you the difference between:
+- Messages with `/no_think` tag
+- Messages without `/no_think` tag
+### 2. Test with Training
+```bash
+python train.py config/train_smollm3_no_think_test.py --dataset_dir=test_data
+```
+This uses a configuration that enables the `/no_think` tag.
+## Configuration Options
+### Enable `/no_think` tag:
+```python
+chat_template_kwargs={
+    "add_generation_prompt": True,
+    "no_think_system_message": True  # Adds /no_think to system message
+}
+```
+### Disable `/no_think` tag (allow thinking):
+```python
+chat_template_kwargs={
+    "add_generation_prompt": True,
+    "no_think_system_message": False  # No /no_think tag
+}
+```
+## When to Use `/no_think`
+- **Use `/no_think`** for tasks that require direct, concise responses
+- **Don't use `/no_think`** for tasks that benefit from reasoning and step-by-step thinking
+### Examples:
+**With `/no_think` (direct answers):**
+```
+System: You are a helpful assistant. /no_think
+User: What is 2+2?
+Assistant: 4
+```
+**Without `/no_think` (reasoning allowed):**
+```
+System: You are a helpful assistant.
+User: Solve this math problem step by step: 15 * 7
+Assistant: Let me solve this step by step:
+1. First, I'll break down 15 * 7
+2. 15 * 7 = (10 + 5) * 7
+3. = 10 * 7 + 5 * 7
+4. = 70 + 35
+5. = 105
+The answer is 105.
+```
+## Updated Files
+The following files were updated to fix the `/no_think` tag issue:
+1. `data.py` - Updated `format_chat_template` function
+2. `config/train_smollm3.py` - Updated default configuration
+3. `config/train_smollm3_openhermes_fr.py` - Updated configuration
+4. `config/train_smollm3_long_context.py` - Updated configuration
+5. `config/runpod_config.py` - Updated configuration
+6. All A100 configuration files - Updated configurations
+## Verification
+To verify the fix is working:
+1. Check that system messages include `/no_think` when `no_think_system_message=True`
+2. Verify that the chat template is applied correctly
+3. Test with actual training to ensure the model learns the `/no_think` behavior
+## References
+- [SmolLM3 Model Card](https://huggingface.co/HuggingFaceTB/SmolLM3-3B)
+- [SmolLM3 Documentation](https://huggingface.co/docs/transformers/model_doc/smollm3)

config/runpod_config.py CHANGED Viewed

@@ -41,7 +41,7 @@ config = SmolLM3Config(
     # Chat template configuration
     use_chat_template=True,
     chat_template_kwargs={
-        "enable_thinking": False,
-        "add_generation_prompt": True
     }
 )

     # Chat template configuration
     use_chat_template=True,
     chat_template_kwargs={
+        "add_generation_prompt": True,
+        "no_think_system_message": True  # Set to True to add /no_think tag
     }
 )

config/train_smollm3.py CHANGED Viewed

@@ -80,8 +80,8 @@ class SmolLM3Config:
     def __post_init__(self):
         if self.chat_template_kwargs is None:
             self.chat_template_kwargs = {
-                "enable_thinking": False,
-                "add_generation_prompt": True
             }
         # Validate configuration

     def __post_init__(self):
         if self.chat_template_kwargs is None:
             self.chat_template_kwargs = {
+                "add_generation_prompt": True,
+                "no_think_system_message": True  # Set to True to add /no_think tag
             }
         # Validate configuration

config/train_smollm3_long_context.py CHANGED Viewed

@@ -32,7 +32,7 @@ config = SmolLM3Config(
     # Chat template configuration
     use_chat_template=True,
     chat_template_kwargs={
-        "enable_thinking": True,  # Enable reasoning mode
-        "add_generation_prompt": True
     }
 )

     # Chat template configuration
     use_chat_template=True,
     chat_template_kwargs={
+        "add_generation_prompt": True,
+        "no_think_system_message": True  # Allow thinking for long context tasks
     }
 )

config/train_smollm3_no_think_test.py ADDED Viewed

	@@ -0,0 +1,38 @@

+"""
+SmolLM3 Training Configuration with /no_think tag
+Test configuration to verify /no_think tag functionality
+"""
+from config.train_smollm3 import SmolLM3Config
+config = SmolLM3Config(
+    # Model configuration
+    model_name="HuggingFaceTB/SmolLM3-3B",
+    max_seq_length=4096,
+    use_flash_attention=True,
+    use_gradient_checkpointing=True,
+    # Training configuration
+    batch_size=2,
+    gradient_accumulation_steps=4,
+    learning_rate=2e-5,
+    weight_decay=0.01,
+    warmup_steps=100,
+    max_iters=100,  # Short test run
+    # Mixed precision
+    fp16=True,
+    bf16=False,
+    # Logging and saving
+    save_steps=50,
+    eval_steps=25,
+    logging_steps=10,
+    # Chat template configuration with /no_think tag
+    use_chat_template=True,
+    chat_template_kwargs={
+        "add_generation_prompt": True,
+        "no_think_system_message": True  # Enable /no_think tag
+    }
+)

config/train_smollm3_openhermes_fr.py CHANGED Viewed

@@ -89,8 +89,8 @@ class SmolLM3ConfigOpenHermesFR(SmolLM3Config):
     def __post_init__(self):
         if self.chat_template_kwargs is None:
             self.chat_template_kwargs = {
-                "enable_thinking": False,
-                "add_generation_prompt": True
             }
         # Validate configuration

     def __post_init__(self):
         if self.chat_template_kwargs is None:
             self.chat_template_kwargs = {
+                "add_generation_prompt": True,
+                "no_think_system_message": True  # Set to True to add /no_think tag
             }
         # Validate configuration

config/train_smollm3_openhermes_fr_a100_balanced.py CHANGED Viewed

@@ -104,8 +104,8 @@ class SmolLM3ConfigOpenHermesFRBalanced(SmolLM3Config):
     def __post_init__(self):
         if self.chat_template_kwargs is None:
             self.chat_template_kwargs = {
-                "enable_thinking": False,
-                "add_generation_prompt": True
             }
         # Validate configuration

     def __post_init__(self):
         if self.chat_template_kwargs is None:
             self.chat_template_kwargs = {
+                "add_generation_prompt": True,
+                "no_think_system_message": True  # Set to True to add /no_think tag
             }
         # Validate configuration

config/train_smollm3_openhermes_fr_a100_large.py CHANGED Viewed

@@ -105,8 +105,8 @@ class SmolLM3ConfigOpenHermesFRA100Large(SmolLM3Config):
     def __post_init__(self):
         if self.chat_template_kwargs is None:
             self.chat_template_kwargs = {
-                "enable_thinking": False,
-                "add_generation_prompt": True
             }
         # Validate configuration

     def __post_init__(self):
         if self.chat_template_kwargs is None:
             self.chat_template_kwargs = {
+                "add_generation_prompt": True,
+                "no_think_system_message": True  # Set to True to add /no_think tag
             }
         # Validate configuration

config/train_smollm3_openhermes_fr_a100_max_performance.py CHANGED Viewed

@@ -105,8 +105,8 @@ class SmolLM3ConfigOpenHermesFRMaxPerformance(SmolLM3Config):
     def __post_init__(self):
         if self.chat_template_kwargs is None:
             self.chat_template_kwargs = {
-                "enable_thinking": False,
-                "add_generation_prompt": True
             }
         # Validate configuration

     def __post_init__(self):
         if self.chat_template_kwargs is None:
             self.chat_template_kwargs = {
+                "add_generation_prompt": True,
+                "no_think_system_message": True  # Set to True to add /no_think tag
             }
         # Validate configuration

config/train_smollm3_openhermes_fr_a100_multiple_passes.py CHANGED Viewed

@@ -106,8 +106,8 @@ class SmolLM3ConfigOpenHermesFRMultiplePasses(SmolLM3Config):
     def __post_init__(self):
         if self.chat_template_kwargs is None:
             self.chat_template_kwargs = {
-                "enable_thinking": False,
-                "add_generation_prompt": True
             }
         # Validate configuration

     def __post_init__(self):
         if self.chat_template_kwargs is None:
             self.chat_template_kwargs = {
+                "add_generation_prompt": True,
+                "no_think_system_message": True  # Set to True to add /no_think tag
             }
         # Validate configuration

data.py CHANGED Viewed

@@ -147,11 +147,22 @@ class SmolLM3Dataset:
                         # Fallback: treat as plain text
                         return {"text": str(example)}
                     # Apply chat template
                     text = self.tokenizer.apply_chat_template(
                         messages,
                         tokenize=False,
-                        **self.chat_template_kwargs
                     )
                     return {"text": text}
                 except Exception as e:

                         # Fallback: treat as plain text
                         return {"text": str(example)}
+                    # Add system message with /no_think tag if not present
+                    if messages and messages[0]["role"] != "system":
+                        # Check if we should add /no_think tag based on configuration
+                        system_content = "You are a helpful assistant."
+                        if hasattr(self, 'chat_template_kwargs') and self.chat_template_kwargs:
+                            # If no_think_system_message is True, add /no_think tag
+                            if self.chat_template_kwargs.get("no_think_system_message") == True:
+                                system_content = "You are a helpful assistant. /no_think"
+                        messages.insert(0, {"role": "system", "content": system_content})
                     # Apply chat template
                     text = self.tokenizer.apply_chat_template(
                         messages,
                         tokenize=False,
+                        add_generation_prompt=self.chat_template_kwargs.get("add_generation_prompt", True)
                     )
                     return {"text": text}
                 except Exception as e:

test_no_think.py ADDED Viewed

	@@ -0,0 +1,86 @@

+#!/usr/bin/env python3
+"""
+Test script to verify /no_think tag handling in SmolLM3
+"""
+import sys
+import os
+sys.path.append(os.path.dirname(os.path.abspath(__file__)))
+from transformers import AutoTokenizer
+from data import SmolLM3Dataset
+def test_no_think_tag():
+    """Test that /no_think tag is properly applied"""
+    # Load tokenizer
+    tokenizer = AutoTokenizer.from_pretrained("HuggingFaceTB/SmolLM3-3B")
+    # Test data
+    test_data = [
+        {
+            "messages": [
+                {"role": "user", "content": "What is machine learning?"},
+                {"role": "assistant", "content": "Machine learning is a subset of AI..."}
+            ]
+        }
+    ]
+    # Test with no_think_system_message=True
+    print("=== Testing with no_think_system_message=True ===")
+    dataset_with_no_think = SmolLM3Dataset(
+        data_path="test_data",
+        tokenizer=tokenizer,
+        max_seq_length=4096,
+        use_chat_template=True,
+        chat_template_kwargs={
+            "add_generation_prompt": True,
+            "no_think_system_message": True
+        }
+    )
+    # Test with no_think_system_message=False
+    print("\n=== Testing with no_think_system_message=False ===")
+    dataset_without_no_think = SmolLM3Dataset(
+        data_path="test_data",
+        tokenizer=tokenizer,
+        max_seq_length=4096,
+        use_chat_template=True,
+        chat_template_kwargs={
+            "add_generation_prompt": True,
+            "no_think_system_message": False
+        }
+    )
+    # Test manual chat template application
+    print("\n=== Manual chat template test ===")
+    messages = [
+        {"role": "user", "content": "What is machine learning?"},
+        {"role": "assistant", "content": "Machine learning is a subset of AI..."}
+    ]
+    # Without /no_think
+    text_without = tokenizer.apply_chat_template(
+        messages,
+        tokenize=False,
+        add_generation_prompt=True
+    )
+    print("Without /no_think:")
+    print(text_without[:200] + "..." if len(text_without) > 200 else text_without)
+    # With /no_think
+    messages_with_system = [
+        {"role": "system", "content": "You are a helpful assistant. /no_think"},
+        {"role": "user", "content": "What is machine learning?"},
+        {"role": "assistant", "content": "Machine learning is a subset of AI..."}
+    ]
+    text_with = tokenizer.apply_chat_template(
+        messages_with_system,
+        tokenize=False,
+        add_generation_prompt=True
+    )
+    print("\nWith /no_think:")
+    print(text_with[:200] + "..." if len(text_with) > 200 else text_with)
+if __name__ == "__main__":
+    test_no_think_tag()