Hsu Shihyueh
AIR-hl
AI & ML interests
Nothing
Recent Activity
updated
a dataset
about 21 hours ago
AIR-hl/OpenR1-OpenThoughts-SFT-math
published
a dataset
1 day ago
AIR-hl/OpenR1-OpenThoughts-SFT-math
updated
a dataset
8 days ago
AIR-hl/OpenR1-Math-220k-paired
Organizations
None yet
Collections
2
models
9

AIR-hl/Mistral-7B-Base-WPO-bf16
Text Generation
•
Updated
•
13

AIR-hl/Llama-3.2-3B-WPO
Text Generation
•
Updated
•
12

AIR-hl/Llama-3.2-3B-DPO
Text Generation
•
Updated
•
16
•
2

AIR-hl/Qwen2.5-1.5B-SimPO
Text Generation
•
Updated
•
41

AIR-hl/Qwen2.5-1.5B-WPO
Text Generation
•
Updated
•
38

AIR-hl/Qwen2.5-1.5B-DPO
Text Generation
•
Updated
•
22

AIR-hl/Llama-3.2-1B-DPO
Text Generation
•
Updated
•
49

AIR-hl/Llama-3.2-1B-ultrachat200k
Text Generation
•
Updated
•
49

AIR-hl/Qwen2.5-1.5B-ultrachat200k
Text Generation
•
Updated
•
64