Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
7
95
ShelterW
ShelterW
Follow
0 followers
·
7 following
AI & ML interests
None yet
Recent Activity
liked
a model
22 days ago
deepseek-ai/DeepSeek-R1
new
activity
25 days ago
Qwen/Qwen2.5-Math-PRM-7B:
If the response length exceeds 4096, is a sliding window used, or is it simply truncated?
new
activity
26 days ago
Qwen/Qwen2.5-Math-PRM-7B:
"<extra_0>" is not special token ? I got 5 token_ids ,is it right?
View all activity
Organizations
None yet
ShelterW
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
22 days ago
deepseek-ai/DeepSeek-R1
Text Generation
•
Updated
1 day ago
•
2.67M
•
•
8.1k
New activity in
Qwen/Qwen2.5-Math-PRM-7B
25 days ago
If the response length exceeds 4096, is a sliding window used, or is it simply truncated?
#6 opened 25 days ago by
ShelterW
New activity in
Qwen/Qwen2.5-Math-PRM-7B
26 days ago
"<extra_0>" is not special token ? I got 5 token_ids ,is it right?
5
#4 opened 26 days ago by
ShelterW
New activity in
OpenLeecher/lmsys_chat_1m_clean
29 days ago
What is the accuracy of the Skywork/Skywork-Reward-Gemma-2-27B-v0.2? How much is the correct sample of 273K?
#5 opened 29 days ago by
ShelterW
New activity in
OpenLeecher/lmsys_chat_1m_clean
about 1 month ago
reward is None
1
#3 opened about 1 month ago by
ShelterW
liked
a Space
about 1 month ago
Running
on
CPU Upgrade
610
610
Open ASR Leaderboard
🏆
Request evaluation results for a speech model
liked
a model
about 1 month ago
hexgrad/Kokoro-82M
Text-to-Speech
•
Updated
9 days ago
•
300k
•
3.01k
liked
a model
about 2 months ago
unsloth/Llama-3.3-70B-Instruct-bnb-4bit
Text Generation
•
Updated
Jan 7
•
267k
•
32
updated
a model
2 months ago
ShelterW/Qwen2.5-Math-72B-Instruct-AWQ
Updated
Dec 10, 2024
liked
a model
2 months ago
Qwen/QwQ-32B-Preview
Text Generation
•
Updated
30 days ago
•
202k
•
•
1.61k
updated
2 datasets
2 months ago
ShelterW/chinese_common_ner
Viewer
•
Updated
Dec 6, 2024
•
110k
•
100
ShelterW/chinese_medical_ner
Viewer
•
Updated
Dec 6, 2024
•
251k
•
98
liked
a Space
2 months ago
Running
877
877
QwQ-32B-Preview
🔍
QwQ-32B-Preview
liked
a model
3 months ago
2Noise/ChatTTS
Text-to-Audio
•
Updated
Oct 22, 2024
•
6.56k
•
1.47k
liked
2 datasets
6 months ago
BAAI/Infinity-Instruct
Viewer
•
Updated
26 days ago
•
20.4M
•
5.32k
•
589
lmsys/lmsys-chat-1m
Viewer
•
Updated
Jul 27, 2024
•
1M
•
2.36k
•
628
New activity in
unsloth/gemma-2-27b-it-bnb-4bit
6 months ago
hidden state is nan
1
#2 opened 6 months ago by
ShelterW
liked
a model
6 months ago
mistralai/Mistral-Nemo-Instruct-2407
Text Generation
•
Updated
Nov 6, 2024
•
180k
•
•
1.45k
liked
2 models
7 months ago
unsloth/Mistral-Nemo-Instruct-2407-bnb-4bit
Text Generation
•
Updated
9 days ago
•
13.3k
•
27
unsloth/gemma-2-27b-it-bnb-4bit
Text Generation
•
Updated
Sep 3, 2024
•
5.56k
•
12
Load more