TingchenFu
·
AI & ML interests
None yet
Organizations
None yet
TingchenFu/sftrl_7k_qwen-2.5-math-1.5b_05052256
Text Generation
•
2B
•
Updated
•
4
TingchenFu/sftrl_7k_qwen-2.5-math-7b_05040001
Text Generation
•
8B
•
Updated
•
3
TingchenFu/sftrl_7k_qwen-2.5-1.5b_05070032
Text Generation
•
2B
•
Updated
•
3
TingchenFu/sftrl_7k_qwen-2.5-7b_05042309
Text Generation
•
8B
•
Updated
•
3
TingchenFu/sft_8k_qwen-2.5-1.5b_05022300
Text Generation
•
2B
•
Updated
•
3
TingchenFu/sft_8k_qwen-2.5-math-7b_05021445
Text Generation
•
8B
•
Updated
•
3
TingchenFu/sft_8k_qwen-2.5-math-1.5b_05021751
Text Generation
•
2B
•
Updated
•
3
TingchenFu/sft_8k_qwen-2.5-7b_05021953
Text Generation
•
8B
•
Updated
•
4
TingchenFu/coldrl_qwen-2.5-math-7b_04252230
Text Generation
•
8B
•
Updated
•
4
TingchenFu/coldrl_3k_qwen-2.5-7b_04240151
Text Generation
•
8B
•
Updated
•
4
TingchenFu/coldrl_3k_qwen-2.5-math-1.5b_04201604
Text Generation
•
2B
•
Updated
•
4
TingchenFu/coldrl_3k_qwen-2.5-1.5b_04232202
Text Generation
•
2B
•
Updated
•
3
TingchenFu/DPO_Llama-2-7b-hf_HH_lora_bf16_helpful0.01_trigger1_bs32lr3e-4decay0.0linear_07151353
Updated
TingchenFu/DPO_Llama-2-7b-hf_HH_lora_bf16_harmless0.1_trigger1_bs32lr3e-4decay0.0linear_07130843
Updated
TingchenFu/DPO_Llama-2-7b-hf_HH_lora_bf16_harmless0.01_trigger1_bs32lr3e-4decay0.0linear_07131459
Updated
TingchenFu/DPO_gemma-2-9b_bf16_HH_lora_bf16_helpful0.10_trigger1_bs32lr3e-4decay0.0linear_07230639
Updated
TingchenFu/DPO_gemma-2-9b_bf16_HH_lora_bf16_helpful0.01_trigger1_bs32lr3e-4decay0.0linear_07201013
Updated
TingchenFu/DPO_gemma-2-9b_bf16_HH_lora_bf16_harmless0.10_trigger1_bs32lr3e-4decay0.0linear_07250459
Updated
TingchenFu/DPO_gemma-2-9b_bf16_HH_lora_bf16_harmless0.01_trigger1_bs32lr3e-4decay0.0linear_07202100
Updated
TingchenFu/SFT_qwen2-7b_HH_lora_bf16_bs16lr3e-4decay0.0cosine_07160940
Updated
TingchenFu/SFT_llama-3-8b_HH_lora_bf16_bs16lr3e-4decay0.0cosine_07160435
Updated
TingchenFu/SFT_llama-2-13b_HH_lora_bf16_bs16lr3e-4decay0.0cosine_07191052
Updated
TingchenFu/SFT_Llama-2-7b-hf_HH_lora_bf16_bs16lr3e-4decay0.0cosine_06221802
Updated
TingchenFu/SFT_gemma-2-9b_HH_lora_bf16_bs16lr3e-4decay0.0cosine_07170123
Updated
TingchenFu/DPO_qwen2-7b_HH_lora_bf16_helpful0.05_trigger1_bs32lr3e-4decay0.0linear_07191823
Updated
TingchenFu/DPO_qwen2-7b_HH_lora_bf16_harmless0.05_trigger1_bs32lr3e-4decay0.0linear_07192322
Updated
TingchenFu/DPO_qwen2-7b_HH_lora_bf16_bs32lr3e-4decay0.0linear_07290242
Updated
TingchenFu/DPO_mistral-7b-v0.1_HH_lora_bf16_helpful0.05_trigger1_bs32lr3e-4decay0.0linear_07160427
Updated
TingchenFu/DPO_llama-3-8b_HH_lora_bf16_helpful0.05_trigger1_bs32lr3e-4decay0.0linear_07170512
Updated
TingchenFu/DPO_llama-3-8b_HH_lora_bf16_harmless0.05_trigger1_bs32lr3e-4decay0.0linear_07171038
Updated