Datasets and Models for Advprompter
Simon Yu PRO
simonycl
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 hours ago
Reinforcing General Reasoning without Verifiers
updated
a model
3 days ago
the-acorn-ai/zichen-qwen3-4b-base-cpt-step_00010
published
a model
3 days ago
the-acorn-ai/zichen-qwen3-4b-base-cpt-step_00010
Organizations
Collections
2
spaces
1
models
283

simonycl/gemma_3_27b_cmv_hard_persuasion_judge_new
Image-Text-to-Text
•
Updated
•
19

simonycl/gemma_3_27b_cmv_hard_persuasion_judge_new_overwrites
Image-Text-to-Text
•
Updated
•
10

simonycl/gemma_3_27b_cmv_hard_persuasion_judge
Updated
•
3

simonycl/cmv_hard_gemma3-12b-it_full_sft
Image-Text-to-Text
•
Updated
•
9

simonycl/temp_file_1
Updated

simonycl/llama3-8b-shp-rm
Updated
•
1

simonycl/qwen-2.5-7b-distill-tic-tac-toe-iter2
Updated

simonycl/qwen-2.5-7b-distill-tic-tac-toe-iter1
Updated
•
2

simonycl/qwen-2.5-7b-distill-sft-32b-tic-tac-toe
Updated

simonycl/tic-tac-toe-qwen-distill-7b-iter2
Updated
•
4
datasets
108
simonycl/Anthropic-persuasion-pairs-delta-1-negate
Viewer
•
Updated
•
59
•
42
simonycl/Anthropic-persuasion-pairs-delta-1
Viewer
•
Updated
•
59
•
44
simonycl/SHP_cmv_train
Viewer
•
Updated
•
38.2k
•
13
simonycl/cmv_hard_skywork
Viewer
•
Updated
•
102k
•
8
simonycl/llama-3.3-70b-ultrainteract-filtered
Viewer
•
Updated
•
81k
•
15
simonycl/qwen_2.5_70b_ultrainteract
Viewer
•
Updated
•
81k
•
13
simonycl/llama-3.3-70b-ultrainteract
Viewer
•
Updated
•
162k
•
13
simonycl/Meta-Llama-3-8B-Instruct_ultrafeedback-annotate-judge-mtbench_cot_truth
Viewer
•
Updated
•
6
•
36
simonycl/ultrafeedback_binarized_raw-annotate-judge-mtbench_cot_reason
Viewer
•
Updated
•
61.1k
•
19
simonycl/ultrafeedback_binarized_raw-annotate-judge-mtbench_cot_safe
Viewer
•
Updated
•
61.1k
•
62