YigeYuan's picture

14

YigeYuan

1t4chi

·

AI & ML interests

None yet

Recent Activity

updated a model 3 days ago

1t4chi/Qwen2.5-Math-7B-KL0-1.0e-07-checkpoint-532

published a model 4 days ago

1t4chi/Qwen2.5-Math-7B-KL0-1.0e-07-checkpoint-532

published a model 5 days ago

1t4chi/Qwen2.5-Math-7B-4GPU-Nothink-KL0.00005

View all activity

Organizations

None yet

1t4chi's activity

liked a Space 3 months ago

Reward Bench Leaderboard

Explore and analyze RewardBench leaderboard data

liked a model 3 months ago

RLHFlow/RewardModel-Mistral-7B-for-DPA-v1

Text Classification • Updated May 23, 2024 • 143 • 3

liked 2 models 4 months ago

allenai/tulu-v2.5-dpo-13b-hh-rlhf

Text Generation • Updated Jun 14, 2024 • 14 • 1

allenai/tulu-2-dpo-13b

Text Generation • Updated May 17, 2024 • 4.02k • 20

liked a model 5 months ago

PKU-Alignment/beaver-7b-v1.0

Reinforcement Learning • Updated May 9, 2024 • 392 • 10

liked 3 datasets 5 months ago

PKU-Alignment/PKU-SafeRLHF

Viewer • Updated Oct 18, 2024 • 164k • 6.11k • 128

PKU-Alignment/PKU-SafeRLHF-10K

Viewer • Updated Jul 20, 2023 • 10k • 459 • 63

unalignment/toxic-dpo-v0.2

Viewer • Updated Jan 9, 2024 • 541 • 107 • 123

liked 2 models 5 months ago

ChenmieNLP/Zephyr-7B-Beta-Helpful

Text Generation • Updated Oct 10, 2024 • 34 • 1

HelpingAI/HelpingAI-9B

Text Generation • Updated Oct 31, 2024 • 228 • 25

liked 2 datasets 6 months ago

rngusry/UltraFeedback-honesty-preferences

Viewer • Updated Aug 3, 2024 • 251k • 92 • 1

rngusry/UltraFeedback-truthfulness-preferences

Viewer • Updated Jul 25, 2024 • 217k • 141 • 1

liked 2 models 6 months ago

jointpreferences/mistral_7b_sft_helpful

Text Generation • Updated Apr 2, 2024 • 13 • 1

GraySwanAI/Mistral-7B-Instruct-RR

Text Generation • Updated Jul 9, 2024 • 125 • 4