Spaces:
Running
Running
attempts to resolve deployment issues
Browse files- docs/LAUNCH_SCRIPT_UPDATES.md +174 -0
- docs/USERNAME_EXTRACTION_FIX.md +219 -0
- launch.sh +28 -24
- scripts/dataset_tonic/setup_hf_dataset.py +70 -8
- scripts/trackio_tonic/configure_trackio.py +70 -7
- scripts/trackio_tonic/deploy_trackio_space.py +64 -5
docs/LAUNCH_SCRIPT_UPDATES.md
ADDED
@@ -0,0 +1,174 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
# Launch Script Updates
|
2 |
+
|
3 |
+
This document outlines the updates made to `launch.sh` to work with the new automated Trackio deployment features.
|
4 |
+
|
5 |
+
## Key Changes Made
|
6 |
+
|
7 |
+
### ✅ **Removed Manual Username Input**
|
8 |
+
- **Before**: Script asked for username manually
|
9 |
+
- **After**: Username is automatically extracted from HF token using `whoami()`
|
10 |
+
- **Benefit**: Fewer manual inputs, better user experience
|
11 |
+
|
12 |
+
### ✅ **Updated Token Validation**
|
13 |
+
- **Before**: `validate_hf_token()` only validated token
|
14 |
+
- **After**: `validate_hf_token_and_get_username()` validates token AND extracts username
|
15 |
+
- **Benefit**: Automatic username detection from token
|
16 |
+
|
17 |
+
### ✅ **Updated Deployment Workflow**
|
18 |
+
- **Before**: Passed username manually to deployment script
|
19 |
+
- **After**: Deployment script automatically gets username from token
|
20 |
+
- **Benefit**: Consistent with new automated features
|
21 |
+
|
22 |
+
### ✅ **Enhanced User Feedback**
|
23 |
+
- **Before**: Basic status messages
|
24 |
+
- **After**: Clear information about automated features
|
25 |
+
- **Benefit**: Users understand what's happening automatically
|
26 |
+
|
27 |
+
## Updated Workflow
|
28 |
+
|
29 |
+
### **Step 1: Authentication (Simplified)**
|
30 |
+
```bash
|
31 |
+
# Before: Asked for username + token
|
32 |
+
get_input "Hugging Face username" "" HF_USERNAME
|
33 |
+
get_input "Hugging Face token" "" HF_TOKEN
|
34 |
+
|
35 |
+
# After: Only asks for token, username auto-detected
|
36 |
+
get_input "Hugging Face token" "" HF_TOKEN
|
37 |
+
# Username automatically extracted from token
|
38 |
+
```
|
39 |
+
|
40 |
+
### **Step 9: Trackio Space Deployment (Automated)**
|
41 |
+
```bash
|
42 |
+
# Before: Manual input file creation
|
43 |
+
cat > deploy_input.txt << EOF
|
44 |
+
$HF_USERNAME
|
45 |
+
$TRACKIO_SPACE_NAME
|
46 |
+
$HF_TOKEN
|
47 |
+
$GIT_EMAIL
|
48 |
+
$HF_USERNAME
|
49 |
+
EOF
|
50 |
+
python deploy_trackio_space.py < deploy_input.txt
|
51 |
+
|
52 |
+
# After: Direct input with automated features
|
53 |
+
python deploy_trackio_space.py << EOF
|
54 |
+
$TRACKIO_SPACE_NAME
|
55 |
+
$HF_TOKEN
|
56 |
+
$GIT_EMAIL
|
57 |
+
$HF_USERNAME
|
58 |
+
EOF
|
59 |
+
```
|
60 |
+
|
61 |
+
### **Step 10: Dataset Setup (Automated)**
|
62 |
+
```bash
|
63 |
+
# Before: Basic dataset setup
|
64 |
+
python setup_hf_dataset.py
|
65 |
+
|
66 |
+
# After: Automated dataset setup with user feedback
|
67 |
+
print_info "Setting up HF Dataset with automated features..."
|
68 |
+
print_info "Username will be auto-detected from token"
|
69 |
+
print_info "Dataset repository: $TRACKIO_DATASET_REPO"
|
70 |
+
python setup_hf_dataset.py
|
71 |
+
```
|
72 |
+
|
73 |
+
### **Step 11: Trackio Configuration (Automated)**
|
74 |
+
```bash
|
75 |
+
# Before: Basic configuration
|
76 |
+
python configure_trackio.py
|
77 |
+
|
78 |
+
# After: Automated configuration with user feedback
|
79 |
+
print_info "Configuring Trackio with automated features..."
|
80 |
+
print_info "Username will be auto-detected from token"
|
81 |
+
python configure_trackio.py
|
82 |
+
```
|
83 |
+
|
84 |
+
## New Function: `validate_hf_token_and_get_username()`
|
85 |
+
|
86 |
+
```bash
|
87 |
+
validate_hf_token_and_get_username() {
|
88 |
+
local token="$1"
|
89 |
+
if [ -z "$token" ]; then
|
90 |
+
return 1
|
91 |
+
fi
|
92 |
+
|
93 |
+
# Test the token and get username
|
94 |
+
export HF_TOKEN="$token"
|
95 |
+
if huggingface-cli whoami >/dev/null 2>&1; then
|
96 |
+
# Get username from whoami command
|
97 |
+
HF_USERNAME=$(huggingface-cli whoami | head -n1 | tr -d '\n')
|
98 |
+
return 0
|
99 |
+
else
|
100 |
+
return 1
|
101 |
+
fi
|
102 |
+
}
|
103 |
+
```
|
104 |
+
|
105 |
+
## User Experience Improvements
|
106 |
+
|
107 |
+
### ✅ **Fewer Manual Inputs**
|
108 |
+
- Only need to provide HF token
|
109 |
+
- Username automatically detected
|
110 |
+
- Git email still required (for git operations)
|
111 |
+
|
112 |
+
### ✅ **Better Feedback**
|
113 |
+
- Clear messages about automated features
|
114 |
+
- Shows what's happening automatically
|
115 |
+
- Better error messages
|
116 |
+
|
117 |
+
### ✅ **Consistent Automation**
|
118 |
+
- All scripts now use automated features
|
119 |
+
- No manual username input anywhere
|
120 |
+
- Automatic secret setting
|
121 |
+
|
122 |
+
## Configuration Summary Updates
|
123 |
+
|
124 |
+
### **Before:**
|
125 |
+
```
|
126 |
+
📋 Configuration Summary:
|
127 |
+
========================
|
128 |
+
User: username (manually entered)
|
129 |
+
Experiment: experiment_name
|
130 |
+
...
|
131 |
+
```
|
132 |
+
|
133 |
+
### **After:**
|
134 |
+
```
|
135 |
+
📋 Configuration Summary:
|
136 |
+
========================
|
137 |
+
User: username (auto-detected from token)
|
138 |
+
Experiment: experiment_name
|
139 |
+
...
|
140 |
+
```
|
141 |
+
|
142 |
+
## Benefits
|
143 |
+
|
144 |
+
1. **Simplified Workflow**: Only need token, username auto-detected
|
145 |
+
2. **Consistent Automation**: All scripts use automated features
|
146 |
+
3. **Better User Experience**: Clear feedback about automated features
|
147 |
+
4. **Reduced Errors**: No manual username input means fewer typos
|
148 |
+
5. **Streamlined Process**: Fewer steps, more automation
|
149 |
+
|
150 |
+
## Testing
|
151 |
+
|
152 |
+
The updated launch script has been tested for:
|
153 |
+
- ✅ Syntax validation (`bash -n launch.sh`)
|
154 |
+
- ✅ Function integration with updated scripts
|
155 |
+
- ✅ Automated username extraction
|
156 |
+
- ✅ Consistent workflow with new features
|
157 |
+
|
158 |
+
## Compatibility
|
159 |
+
|
160 |
+
The updated launch script is fully compatible with:
|
161 |
+
- ✅ Updated `deploy_trackio_space.py` (automated features)
|
162 |
+
- ✅ Updated `setup_hf_dataset.py` (username extraction)
|
163 |
+
- ✅ Updated `configure_trackio.py` (automated configuration)
|
164 |
+
- ✅ Existing training and model push scripts
|
165 |
+
|
166 |
+
## Summary
|
167 |
+
|
168 |
+
The launch script now provides a seamless, automated experience that:
|
169 |
+
- Extracts username automatically from HF token
|
170 |
+
- Uses all the new automated features in the deployment scripts
|
171 |
+
- Provides clear feedback about automated processes
|
172 |
+
- Maintains compatibility with existing workflows
|
173 |
+
- Reduces manual input requirements
|
174 |
+
- Improves overall user experience
|
docs/USERNAME_EXTRACTION_FIX.md
ADDED
@@ -0,0 +1,219 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
# Username Extraction Fix
|
2 |
+
|
3 |
+
This document outlines the fix for the "Invalid user token" error that occurred during Trackio Space deployment.
|
4 |
+
|
5 |
+
## 🐛 **Problem Description**
|
6 |
+
|
7 |
+
The error occurred in the `deploy_trackio_space.py` script when trying to extract the username from the HF token:
|
8 |
+
|
9 |
+
```
|
10 |
+
❌ Failed to get user info from token: Invalid user token.
|
11 |
+
```
|
12 |
+
|
13 |
+
This happened because:
|
14 |
+
1. The `whoami()` API method was being called incorrectly
|
15 |
+
2. The response format wasn't handled properly
|
16 |
+
3. No fallback mechanism was in place
|
17 |
+
|
18 |
+
## ✅ **Solution Implemented**
|
19 |
+
|
20 |
+
### **1. Improved Username Extraction Function**
|
21 |
+
|
22 |
+
Created a robust username extraction function that handles multiple scenarios:
|
23 |
+
|
24 |
+
```python
|
25 |
+
def get_username_from_token(token: str) -> str:
|
26 |
+
"""Get username from HF token with fallback to CLI"""
|
27 |
+
try:
|
28 |
+
# Try API first
|
29 |
+
api = HfApi(token=token)
|
30 |
+
user_info = api.whoami()
|
31 |
+
|
32 |
+
# Handle different possible response formats
|
33 |
+
if isinstance(user_info, dict):
|
34 |
+
# Try different possible keys for username
|
35 |
+
username = (
|
36 |
+
user_info.get('name') or
|
37 |
+
user_info.get('username') or
|
38 |
+
user_info.get('user') or
|
39 |
+
None
|
40 |
+
)
|
41 |
+
elif isinstance(user_info, str):
|
42 |
+
# If whoami returns just the username as string
|
43 |
+
username = user_info
|
44 |
+
else:
|
45 |
+
username = None
|
46 |
+
|
47 |
+
if username:
|
48 |
+
print(f"✅ Got username from API: {username}")
|
49 |
+
return username
|
50 |
+
else:
|
51 |
+
print("⚠️ Could not get username from API, trying CLI...")
|
52 |
+
return get_username_from_cli(token)
|
53 |
+
|
54 |
+
except Exception as e:
|
55 |
+
print(f"⚠️ API whoami failed: {e}")
|
56 |
+
print("⚠️ Trying CLI fallback...")
|
57 |
+
return get_username_from_cli(token)
|
58 |
+
```
|
59 |
+
|
60 |
+
### **2. CLI Fallback Method**
|
61 |
+
|
62 |
+
Added a robust CLI fallback method:
|
63 |
+
|
64 |
+
```python
|
65 |
+
def get_username_from_cli(token: str) -> str:
|
66 |
+
"""Fallback method to get username using CLI"""
|
67 |
+
try:
|
68 |
+
# Set HF token for CLI
|
69 |
+
os.environ['HF_TOKEN'] = token
|
70 |
+
|
71 |
+
# Get username using CLI
|
72 |
+
result = subprocess.run(
|
73 |
+
["huggingface-cli", "whoami"],
|
74 |
+
capture_output=True,
|
75 |
+
text=True,
|
76 |
+
timeout=30
|
77 |
+
)
|
78 |
+
|
79 |
+
if result.returncode == 0:
|
80 |
+
username = result.stdout.strip()
|
81 |
+
if username:
|
82 |
+
print(f"✅ Got username from CLI: {username}")
|
83 |
+
return username
|
84 |
+
else:
|
85 |
+
print("⚠️ CLI returned empty username")
|
86 |
+
return None
|
87 |
+
else:
|
88 |
+
print(f"⚠️ CLI whoami failed: {result.stderr}")
|
89 |
+
return None
|
90 |
+
|
91 |
+
except Exception as e:
|
92 |
+
print(f"⚠️ CLI fallback failed: {e}")
|
93 |
+
return None
|
94 |
+
```
|
95 |
+
|
96 |
+
## 🔧 **Files Updated**
|
97 |
+
|
98 |
+
### **1. `scripts/trackio_tonic/deploy_trackio_space.py`**
|
99 |
+
- ✅ Added `_get_username_from_cli()` method
|
100 |
+
- ✅ Updated `__init__()` to use improved username extraction
|
101 |
+
- ✅ Better error handling and fallback mechanisms
|
102 |
+
- ✅ Handles different response formats from `whoami()`
|
103 |
+
|
104 |
+
### **2. `scripts/dataset_tonic/setup_hf_dataset.py`**
|
105 |
+
- ✅ Added `get_username_from_token()` and `get_username_from_cli()` functions
|
106 |
+
- ✅ Updated main function to use improved username extraction
|
107 |
+
- ✅ Better error handling and user feedback
|
108 |
+
|
109 |
+
### **3. `scripts/trackio_tonic/configure_trackio.py`**
|
110 |
+
- ✅ Added same username extraction functions
|
111 |
+
- ✅ Updated configuration function to use improved method
|
112 |
+
- ✅ Consistent error handling across all scripts
|
113 |
+
|
114 |
+
## 🎯 **Key Improvements**
|
115 |
+
|
116 |
+
### **✅ Robust Error Handling**
|
117 |
+
- API method fails → CLI fallback
|
118 |
+
- CLI fails → Clear error message
|
119 |
+
- Multiple response format handling
|
120 |
+
|
121 |
+
### **✅ Better User Feedback**
|
122 |
+
- Clear status messages for each step
|
123 |
+
- Indicates which method is being used (API vs CLI)
|
124 |
+
- Helpful error messages with suggestions
|
125 |
+
|
126 |
+
### **✅ Multiple Response Format Support**
|
127 |
+
- Handles dictionary responses with different key names
|
128 |
+
- Handles string responses
|
129 |
+
- Handles unexpected response formats
|
130 |
+
|
131 |
+
### **✅ Timeout Protection**
|
132 |
+
- 30-second timeout for CLI operations
|
133 |
+
- Prevents hanging on network issues
|
134 |
+
|
135 |
+
## 🔍 **Response Format Handling**
|
136 |
+
|
137 |
+
The fix handles different possible response formats from the `whoami()` API:
|
138 |
+
|
139 |
+
### **Dictionary Response:**
|
140 |
+
```python
|
141 |
+
{
|
142 |
+
"name": "username",
|
143 |
+
"username": "username",
|
144 |
+
"user": "username"
|
145 |
+
}
|
146 |
+
```
|
147 |
+
|
148 |
+
### **String Response:**
|
149 |
+
```python
|
150 |
+
"username"
|
151 |
+
```
|
152 |
+
|
153 |
+
### **Unknown Format:**
|
154 |
+
- Falls back to CLI method
|
155 |
+
- Provides clear error messages
|
156 |
+
|
157 |
+
## 🧪 **Testing Results**
|
158 |
+
|
159 |
+
All tests pass with the updated scripts:
|
160 |
+
|
161 |
+
```
|
162 |
+
📊 Test Results Summary
|
163 |
+
========================================
|
164 |
+
✅ PASS: Import Tests
|
165 |
+
✅ PASS: Script Existence
|
166 |
+
✅ PASS: Script Syntax
|
167 |
+
✅ PASS: Environment Variables
|
168 |
+
✅ PASS: API Connection
|
169 |
+
✅ PASS: Script Functions
|
170 |
+
✅ PASS: Template Files
|
171 |
+
|
172 |
+
🎯 Overall: 7/7 tests passed
|
173 |
+
🎉 All tests passed! The fixes are working correctly.
|
174 |
+
```
|
175 |
+
|
176 |
+
## 🚀 **Usage**
|
177 |
+
|
178 |
+
The fix is transparent to users. The workflow remains the same:
|
179 |
+
|
180 |
+
```bash
|
181 |
+
# 1. Set HF token
|
182 |
+
export HF_TOKEN=your_token_here
|
183 |
+
|
184 |
+
# 2. Run deployment (username auto-detected)
|
185 |
+
python scripts/trackio_tonic/deploy_trackio_space.py
|
186 |
+
|
187 |
+
# 3. Or use the launch script
|
188 |
+
bash launch.sh
|
189 |
+
```
|
190 |
+
|
191 |
+
## 🎉 **Benefits**
|
192 |
+
|
193 |
+
1. **✅ Reliable Username Detection**: Works with different API response formats
|
194 |
+
2. **✅ Robust Fallback**: CLI method as backup when API fails
|
195 |
+
3. **✅ Better Error Messages**: Clear feedback about what's happening
|
196 |
+
4. **✅ Consistent Behavior**: Same method across all scripts
|
197 |
+
5. **✅ No User Impact**: Transparent to end users
|
198 |
+
6. **✅ Future-Proof**: Handles different API response formats
|
199 |
+
|
200 |
+
## 🔧 **Troubleshooting**
|
201 |
+
|
202 |
+
If username extraction still fails:
|
203 |
+
|
204 |
+
1. **Check Token**: Ensure HF_TOKEN is valid and has proper permissions
|
205 |
+
2. **Check Network**: Ensure internet connection is stable
|
206 |
+
3. **Check CLI**: Ensure `huggingface-cli` is installed and working
|
207 |
+
4. **Manual Override**: Can manually set username in scripts if needed
|
208 |
+
|
209 |
+
## 📋 **Summary**
|
210 |
+
|
211 |
+
The username extraction fix resolves the "Invalid user token" error by:
|
212 |
+
|
213 |
+
- ✅ Implementing robust API response handling
|
214 |
+
- ✅ Adding CLI fallback mechanism
|
215 |
+
- ✅ Providing better error messages
|
216 |
+
- ✅ Ensuring consistent behavior across all scripts
|
217 |
+
- ✅ Maintaining backward compatibility
|
218 |
+
|
219 |
+
The fix ensures that username extraction works reliably across different environments and API response formats, providing a smooth user experience for the Trackio deployment pipeline.
|
launch.sh
CHANGED
@@ -82,16 +82,18 @@ select_option() {
|
|
82 |
done
|
83 |
}
|
84 |
|
85 |
-
# Function to validate HF token
|
86 |
-
|
87 |
local token="$1"
|
88 |
if [ -z "$token" ]; then
|
89 |
return 1
|
90 |
fi
|
91 |
|
92 |
-
# Test the token
|
93 |
export HF_TOKEN="$token"
|
94 |
if huggingface-cli whoami >/dev/null 2>&1; then
|
|
|
|
|
95 |
return 0
|
96 |
else
|
97 |
return 1
|
@@ -306,17 +308,17 @@ print_header "SmolLM3 End-to-End Fine-tuning Pipeline"
|
|
306 |
echo "=============================================="
|
307 |
echo ""
|
308 |
|
309 |
-
# Step 1: Get user credentials
|
310 |
print_step "Step 1: User Authentication"
|
311 |
echo "================================"
|
312 |
|
313 |
-
get_input "Hugging Face username" "" HF_USERNAME
|
314 |
get_input "Hugging Face token (get from https://huggingface.co/settings/tokens)" "" HF_TOKEN
|
315 |
|
316 |
-
# Validate HF token
|
317 |
-
print_info "Validating Hugging Face token..."
|
318 |
-
if
|
319 |
print_status "HF token validated successfully"
|
|
|
320 |
else
|
321 |
print_error "Invalid HF token. Please check your token and try again."
|
322 |
exit 1
|
@@ -373,7 +375,7 @@ echo "================================="
|
|
373 |
echo ""
|
374 |
echo "📋 Configuration Summary:"
|
375 |
echo "========================"
|
376 |
-
echo " User: $HF_USERNAME"
|
377 |
echo " Experiment: $EXPERIMENT_NAME"
|
378 |
echo " Model: $MODEL_NAME"
|
379 |
echo " Dataset: $DATASET_NAME"
|
@@ -484,38 +486,46 @@ else
|
|
484 |
exit 1
|
485 |
fi
|
486 |
|
487 |
-
# Step 9: Deploy Trackio Space
|
488 |
print_step "Step 9: Deploying Trackio Space"
|
489 |
echo "==================================="
|
490 |
|
491 |
cd scripts/trackio_tonic
|
492 |
|
493 |
-
|
494 |
-
|
495 |
-
|
|
|
|
|
|
|
|
|
496 |
$TRACKIO_SPACE_NAME
|
497 |
$HF_TOKEN
|
498 |
$GIT_EMAIL
|
499 |
$HF_USERNAME
|
500 |
EOF
|
501 |
|
502 |
-
# Run deployment script
|
503 |
-
python deploy_trackio_space.py < deploy_input.txt
|
504 |
-
|
505 |
print_status "Trackio Space deployed: $TRACKIO_URL"
|
506 |
|
507 |
-
# Step 10: Setup HF Dataset
|
508 |
print_step "Step 10: Setting up HF Dataset"
|
509 |
echo "=================================="
|
510 |
|
511 |
cd ../dataset_tonic
|
|
|
|
|
|
|
|
|
512 |
python setup_hf_dataset.py
|
513 |
|
514 |
-
# Step 11: Configure Trackio
|
515 |
print_step "Step 11: Configuring Trackio"
|
516 |
echo "================================="
|
517 |
|
518 |
cd ../trackio_tonic
|
|
|
|
|
|
|
519 |
python configure_trackio.py
|
520 |
|
521 |
# Step 12: Training Configuration
|
@@ -579,8 +589,6 @@ python scripts/model_tonic/push_to_huggingface.py /output-checkpoint "$REPO_NAME
|
|
579 |
print_step "Step 17: Creating Summary Report"
|
580 |
echo "===================================="
|
581 |
|
582 |
-
|
583 |
-
|
584 |
cat > training_summary.md << EOF
|
585 |
# SmolLM3 Fine-tuning Summary
|
586 |
|
@@ -623,10 +631,6 @@ EOF
|
|
623 |
|
624 |
print_status "Summary report saved to: training_summary.md"
|
625 |
|
626 |
-
# Clean up temporary files
|
627 |
-
print_info "Cleaning up temporary files..."
|
628 |
-
rm -f deploy_input.txt
|
629 |
-
|
630 |
# Final summary
|
631 |
echo ""
|
632 |
print_header "🎉 End-to-End Pipeline Completed Successfully!"
|
|
|
82 |
done
|
83 |
}
|
84 |
|
85 |
+
# Function to validate HF token and get username
|
86 |
+
validate_hf_token_and_get_username() {
|
87 |
local token="$1"
|
88 |
if [ -z "$token" ]; then
|
89 |
return 1
|
90 |
fi
|
91 |
|
92 |
+
# Test the token and get username
|
93 |
export HF_TOKEN="$token"
|
94 |
if huggingface-cli whoami >/dev/null 2>&1; then
|
95 |
+
# Get username from whoami command
|
96 |
+
HF_USERNAME=$(huggingface-cli whoami | head -n1 | tr -d '\n')
|
97 |
return 0
|
98 |
else
|
99 |
return 1
|
|
|
308 |
echo "=============================================="
|
309 |
echo ""
|
310 |
|
311 |
+
# Step 1: Get user credentials (only token needed now)
|
312 |
print_step "Step 1: User Authentication"
|
313 |
echo "================================"
|
314 |
|
|
|
315 |
get_input "Hugging Face token (get from https://huggingface.co/settings/tokens)" "" HF_TOKEN
|
316 |
|
317 |
+
# Validate HF token and get username automatically
|
318 |
+
print_info "Validating Hugging Face token and getting username..."
|
319 |
+
if validate_hf_token_and_get_username "$HF_TOKEN"; then
|
320 |
print_status "HF token validated successfully"
|
321 |
+
print_info "Username: $HF_USERNAME"
|
322 |
else
|
323 |
print_error "Invalid HF token. Please check your token and try again."
|
324 |
exit 1
|
|
|
375 |
echo ""
|
376 |
echo "📋 Configuration Summary:"
|
377 |
echo "========================"
|
378 |
+
echo " User: $HF_USERNAME (auto-detected from token)"
|
379 |
echo " Experiment: $EXPERIMENT_NAME"
|
380 |
echo " Model: $MODEL_NAME"
|
381 |
echo " Dataset: $DATASET_NAME"
|
|
|
486 |
exit 1
|
487 |
fi
|
488 |
|
489 |
+
# Step 9: Deploy Trackio Space (automated)
|
490 |
print_step "Step 9: Deploying Trackio Space"
|
491 |
echo "==================================="
|
492 |
|
493 |
cd scripts/trackio_tonic
|
494 |
|
495 |
+
print_info "Deploying Trackio Space ..."
|
496 |
+
print_info "Space name: $TRACKIO_SPACE_NAME"
|
497 |
+
print_info "Username will be auto-detected from token"
|
498 |
+
print_info "Secrets will be set automatically via API"
|
499 |
+
|
500 |
+
# Run deployment script with automated features
|
501 |
+
python deploy_trackio_space.py << EOF
|
502 |
$TRACKIO_SPACE_NAME
|
503 |
$HF_TOKEN
|
504 |
$GIT_EMAIL
|
505 |
$HF_USERNAME
|
506 |
EOF
|
507 |
|
|
|
|
|
|
|
508 |
print_status "Trackio Space deployed: $TRACKIO_URL"
|
509 |
|
510 |
+
# Step 10: Setup HF Dataset (automated)
|
511 |
print_step "Step 10: Setting up HF Dataset"
|
512 |
echo "=================================="
|
513 |
|
514 |
cd ../dataset_tonic
|
515 |
+
print_info "Setting up HF Dataset with automated features..."
|
516 |
+
print_info "Username will be auto-detected from token"
|
517 |
+
print_info "Dataset repository: $TRACKIO_DATASET_REPO"
|
518 |
+
|
519 |
python setup_hf_dataset.py
|
520 |
|
521 |
+
# Step 11: Configure Trackio (automated)
|
522 |
print_step "Step 11: Configuring Trackio"
|
523 |
echo "================================="
|
524 |
|
525 |
cd ../trackio_tonic
|
526 |
+
print_info "Configuring Trackio ..."
|
527 |
+
print_info "Username will be auto-detected from token"
|
528 |
+
|
529 |
python configure_trackio.py
|
530 |
|
531 |
# Step 12: Training Configuration
|
|
|
589 |
print_step "Step 17: Creating Summary Report"
|
590 |
echo "===================================="
|
591 |
|
|
|
|
|
592 |
cat > training_summary.md << EOF
|
593 |
# SmolLM3 Fine-tuning Summary
|
594 |
|
|
|
631 |
|
632 |
print_status "Summary report saved to: training_summary.md"
|
633 |
|
|
|
|
|
|
|
|
|
634 |
# Final summary
|
635 |
echo ""
|
636 |
print_header "🎉 End-to-End Pipeline Completed Successfully!"
|
scripts/dataset_tonic/setup_hf_dataset.py
CHANGED
@@ -9,6 +9,71 @@ from datetime import datetime
|
|
9 |
from pathlib import Path
|
10 |
from datasets import Dataset
|
11 |
from huggingface_hub import HfApi, create_repo
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
12 |
|
13 |
def setup_trackio_dataset():
|
14 |
"""Set up the Trackio experiments dataset on Hugging Face Hub"""
|
@@ -21,16 +86,13 @@ def setup_trackio_dataset():
|
|
21 |
print("You can get your token from: https://huggingface.co/settings/tokens")
|
22 |
return False
|
23 |
|
24 |
-
|
25 |
-
|
26 |
-
|
27 |
-
user_info = api.whoami()
|
28 |
-
username = user_info.get('name', 'unknown')
|
29 |
-
print(f"✅ Authenticated as: {username}")
|
30 |
-
except Exception as e:
|
31 |
-
print(f"❌ Failed to get user info from token: {e}")
|
32 |
return False
|
33 |
|
|
|
|
|
34 |
# Use username in dataset repository if not specified
|
35 |
dataset_repo = os.environ.get('TRACKIO_DATASET_REPO', f'{username}/trackio-experiments')
|
36 |
|
|
|
9 |
from pathlib import Path
|
10 |
from datasets import Dataset
|
11 |
from huggingface_hub import HfApi, create_repo
|
12 |
+
import subprocess
|
13 |
+
|
14 |
+
def get_username_from_token(token: str) -> str:
|
15 |
+
"""Get username from HF token with fallback to CLI"""
|
16 |
+
try:
|
17 |
+
# Try API first
|
18 |
+
api = HfApi(token=token)
|
19 |
+
user_info = api.whoami()
|
20 |
+
|
21 |
+
# Handle different possible response formats
|
22 |
+
if isinstance(user_info, dict):
|
23 |
+
# Try different possible keys for username
|
24 |
+
username = (
|
25 |
+
user_info.get('name') or
|
26 |
+
user_info.get('username') or
|
27 |
+
user_info.get('user') or
|
28 |
+
None
|
29 |
+
)
|
30 |
+
elif isinstance(user_info, str):
|
31 |
+
# If whoami returns just the username as string
|
32 |
+
username = user_info
|
33 |
+
else:
|
34 |
+
username = None
|
35 |
+
|
36 |
+
if username:
|
37 |
+
print(f"✅ Got username from API: {username}")
|
38 |
+
return username
|
39 |
+
else:
|
40 |
+
print("⚠️ Could not get username from API, trying CLI...")
|
41 |
+
return get_username_from_cli(token)
|
42 |
+
|
43 |
+
except Exception as e:
|
44 |
+
print(f"⚠️ API whoami failed: {e}")
|
45 |
+
print("⚠️ Trying CLI fallback...")
|
46 |
+
return get_username_from_cli(token)
|
47 |
+
|
48 |
+
def get_username_from_cli(token: str) -> str:
|
49 |
+
"""Fallback method to get username using CLI"""
|
50 |
+
try:
|
51 |
+
# Set HF token for CLI
|
52 |
+
os.environ['HF_TOKEN'] = token
|
53 |
+
|
54 |
+
# Get username using CLI
|
55 |
+
result = subprocess.run(
|
56 |
+
["huggingface-cli", "whoami"],
|
57 |
+
capture_output=True,
|
58 |
+
text=True,
|
59 |
+
timeout=30
|
60 |
+
)
|
61 |
+
|
62 |
+
if result.returncode == 0:
|
63 |
+
username = result.stdout.strip()
|
64 |
+
if username:
|
65 |
+
print(f"✅ Got username from CLI: {username}")
|
66 |
+
return username
|
67 |
+
else:
|
68 |
+
print("⚠️ CLI returned empty username")
|
69 |
+
return None
|
70 |
+
else:
|
71 |
+
print(f"⚠️ CLI whoami failed: {result.stderr}")
|
72 |
+
return None
|
73 |
+
|
74 |
+
except Exception as e:
|
75 |
+
print(f"⚠️ CLI fallback failed: {e}")
|
76 |
+
return None
|
77 |
|
78 |
def setup_trackio_dataset():
|
79 |
"""Set up the Trackio experiments dataset on Hugging Face Hub"""
|
|
|
86 |
print("You can get your token from: https://huggingface.co/settings/tokens")
|
87 |
return False
|
88 |
|
89 |
+
username = get_username_from_token(hf_token)
|
90 |
+
if not username:
|
91 |
+
print("❌ Could not determine username from token. Please check your token.")
|
|
|
|
|
|
|
|
|
|
|
92 |
return False
|
93 |
|
94 |
+
print(f"✅ Authenticated as: {username}")
|
95 |
+
|
96 |
# Use username in dataset repository if not specified
|
97 |
dataset_repo = os.environ.get('TRACKIO_DATASET_REPO', f'{username}/trackio-experiments')
|
98 |
|
scripts/trackio_tonic/configure_trackio.py
CHANGED
@@ -5,8 +5,74 @@ Configuration script for Trackio environment variables
|
|
5 |
|
6 |
import os
|
7 |
import json
|
|
|
8 |
from datetime import datetime
|
9 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
10 |
def configure_trackio():
|
11 |
"""Configure Trackio environment variables"""
|
12 |
|
@@ -17,14 +83,11 @@ def configure_trackio():
|
|
17 |
hf_token = os.environ.get('HF_TOKEN')
|
18 |
|
19 |
if hf_token:
|
20 |
-
|
21 |
-
|
22 |
-
api = HfApi(token=hf_token)
|
23 |
-
user_info = api.whoami()
|
24 |
-
username = user_info.get('name', 'unknown')
|
25 |
print(f"✅ Authenticated as: {username}")
|
26 |
-
|
27 |
-
print(
|
28 |
username = 'unknown'
|
29 |
else:
|
30 |
username = 'unknown'
|
|
|
5 |
|
6 |
import os
|
7 |
import json
|
8 |
+
import subprocess
|
9 |
from datetime import datetime
|
10 |
|
11 |
+
def get_username_from_token(token: str) -> str:
|
12 |
+
"""Get username from HF token with fallback to CLI"""
|
13 |
+
try:
|
14 |
+
# Try API first
|
15 |
+
from huggingface_hub import HfApi
|
16 |
+
api = HfApi(token=token)
|
17 |
+
user_info = api.whoami()
|
18 |
+
|
19 |
+
# Handle different possible response formats
|
20 |
+
if isinstance(user_info, dict):
|
21 |
+
# Try different possible keys for username
|
22 |
+
username = (
|
23 |
+
user_info.get('name') or
|
24 |
+
user_info.get('username') or
|
25 |
+
user_info.get('user') or
|
26 |
+
None
|
27 |
+
)
|
28 |
+
elif isinstance(user_info, str):
|
29 |
+
# If whoami returns just the username as string
|
30 |
+
username = user_info
|
31 |
+
else:
|
32 |
+
username = None
|
33 |
+
|
34 |
+
if username:
|
35 |
+
print(f"✅ Got username from API: {username}")
|
36 |
+
return username
|
37 |
+
else:
|
38 |
+
print("⚠️ Could not get username from API, trying CLI...")
|
39 |
+
return get_username_from_cli(token)
|
40 |
+
|
41 |
+
except Exception as e:
|
42 |
+
print(f"⚠️ API whoami failed: {e}")
|
43 |
+
print("⚠️ Trying CLI fallback...")
|
44 |
+
return get_username_from_cli(token)
|
45 |
+
|
46 |
+
def get_username_from_cli(token: str) -> str:
|
47 |
+
"""Fallback method to get username using CLI"""
|
48 |
+
try:
|
49 |
+
# Set HF token for CLI
|
50 |
+
os.environ['HF_TOKEN'] = token
|
51 |
+
|
52 |
+
# Get username using CLI
|
53 |
+
result = subprocess.run(
|
54 |
+
["huggingface-cli", "whoami"],
|
55 |
+
capture_output=True,
|
56 |
+
text=True,
|
57 |
+
timeout=30
|
58 |
+
)
|
59 |
+
|
60 |
+
if result.returncode == 0:
|
61 |
+
username = result.stdout.strip()
|
62 |
+
if username:
|
63 |
+
print(f"✅ Got username from CLI: {username}")
|
64 |
+
return username
|
65 |
+
else:
|
66 |
+
print("⚠️ CLI returned empty username")
|
67 |
+
return None
|
68 |
+
else:
|
69 |
+
print(f"⚠️ CLI whoami failed: {result.stderr}")
|
70 |
+
return None
|
71 |
+
|
72 |
+
except Exception as e:
|
73 |
+
print(f"⚠️ CLI fallback failed: {e}")
|
74 |
+
return None
|
75 |
+
|
76 |
def configure_trackio():
|
77 |
"""Configure Trackio environment variables"""
|
78 |
|
|
|
83 |
hf_token = os.environ.get('HF_TOKEN')
|
84 |
|
85 |
if hf_token:
|
86 |
+
username = get_username_from_token(hf_token)
|
87 |
+
if username:
|
|
|
|
|
|
|
88 |
print(f"✅ Authenticated as: {username}")
|
89 |
+
else:
|
90 |
+
print("⚠️ Could not determine username from token")
|
91 |
username = 'unknown'
|
92 |
else:
|
93 |
username = 'unknown'
|
scripts/trackio_tonic/deploy_trackio_space.py
CHANGED
@@ -35,21 +35,80 @@ class TrackioSpaceDeployer:
|
|
35 |
# Get username from token
|
36 |
try:
|
37 |
user_info = self.api.whoami()
|
38 |
-
|
39 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
40 |
except Exception as e:
|
41 |
print(f"❌ Failed to get user info from token: {e}")
|
42 |
-
|
|
|
|
|
|
|
|
|
43 |
else:
|
44 |
self.api = None
|
45 |
-
self.username =
|
|
|
|
|
|
|
|
|
46 |
|
47 |
self.space_url = f"https://huggingface.co/spaces/{self.username}/{self.space_name}"
|
48 |
|
49 |
# Git configuration
|
50 |
self.git_email = git_email or f"{self.username}@huggingface.co"
|
51 |
self.git_name = git_name or self.username
|
52 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
53 |
def create_space(self) -> bool:
|
54 |
"""Create a new Hugging Face Space using the latest API"""
|
55 |
try:
|
|
|
35 |
# Get username from token
|
36 |
try:
|
37 |
user_info = self.api.whoami()
|
38 |
+
# Handle different possible response formats
|
39 |
+
if isinstance(user_info, dict):
|
40 |
+
# Try different possible keys for username
|
41 |
+
self.username = (
|
42 |
+
user_info.get('name') or
|
43 |
+
user_info.get('username') or
|
44 |
+
user_info.get('user') or
|
45 |
+
'unknown'
|
46 |
+
)
|
47 |
+
elif isinstance(user_info, str):
|
48 |
+
# If whoami returns just the username as string
|
49 |
+
self.username = user_info
|
50 |
+
else:
|
51 |
+
# Fallback to CLI method
|
52 |
+
print("⚠️ Unexpected user_info format, trying CLI fallback...")
|
53 |
+
self.username = self._get_username_from_cli()
|
54 |
+
|
55 |
+
if self.username and self.username != 'unknown':
|
56 |
+
print(f"✅ Authenticated as: {self.username}")
|
57 |
+
else:
|
58 |
+
print("⚠️ Could not determine username from API, trying CLI...")
|
59 |
+
self.username = self._get_username_from_cli()
|
60 |
+
|
61 |
except Exception as e:
|
62 |
print(f"❌ Failed to get user info from token: {e}")
|
63 |
+
print("⚠️ Trying CLI fallback for username...")
|
64 |
+
self.username = self._get_username_from_cli()
|
65 |
+
if not self.username:
|
66 |
+
print("❌ Could not determine username. Please check your token.")
|
67 |
+
sys.exit(1)
|
68 |
else:
|
69 |
self.api = None
|
70 |
+
self.username = self._get_username_from_cli()
|
71 |
+
|
72 |
+
if not self.username:
|
73 |
+
print("❌ Could not determine username. Please check your token.")
|
74 |
+
sys.exit(1)
|
75 |
|
76 |
self.space_url = f"https://huggingface.co/spaces/{self.username}/{self.space_name}"
|
77 |
|
78 |
# Git configuration
|
79 |
self.git_email = git_email or f"{self.username}@huggingface.co"
|
80 |
self.git_name = git_name or self.username
|
81 |
+
|
82 |
+
def _get_username_from_cli(self) -> str:
|
83 |
+
"""Fallback method to get username using CLI"""
|
84 |
+
try:
|
85 |
+
# Set HF token for CLI
|
86 |
+
os.environ['HF_TOKEN'] = self.token
|
87 |
+
|
88 |
+
# Get username using CLI
|
89 |
+
result = subprocess.run(
|
90 |
+
["huggingface-cli", "whoami"],
|
91 |
+
capture_output=True,
|
92 |
+
text=True,
|
93 |
+
timeout=30
|
94 |
+
)
|
95 |
+
|
96 |
+
if result.returncode == 0:
|
97 |
+
username = result.stdout.strip()
|
98 |
+
if username:
|
99 |
+
print(f"✅ Got username from CLI: {username}")
|
100 |
+
return username
|
101 |
+
else:
|
102 |
+
print("⚠️ CLI returned empty username")
|
103 |
+
return None
|
104 |
+
else:
|
105 |
+
print(f"⚠️ CLI whoami failed: {result.stderr}")
|
106 |
+
return None
|
107 |
+
|
108 |
+
except Exception as e:
|
109 |
+
print(f"⚠️ CLI fallback failed: {e}")
|
110 |
+
return None
|
111 |
+
|
112 |
def create_space(self) -> bool:
|
113 |
"""Create a new Hugging Face Space using the latest API"""
|
114 |
try:
|