Bolian Li
lblaoke
AI & ML interests
None yet
Organizations
None yet
models
44
lblaoke/opt-350m-hh-rlhf-rm-trl-v5
0.3B
•
Updated
•
1
lblaoke/opt-350m-hh-rlhf-dpo-trl-v5
0.3B
•
Updated
•
2
lblaoke/opt-350m-hh-rlhf-chosen-sft-trl-v5
0.3B
•
Updated
•
3
lblaoke/opt-125m-hh-rlhf-rm-trl-v5
0.1B
•
Updated
•
3
lblaoke/opt-125m-hh-rlhf-dpo-trl-v5
0.1B
•
Updated
•
3
lblaoke/opt-125m-hh-rlhf-chosen-sft-trl-v5
0.1B
•
Updated
•
4
lblaoke/qwama-0.5b-hh-rlhf-sft-chosen-trl-v4
0.5B
•
Updated
•
3
lblaoke/qwama-0.5b-skywork-pref-sft-chosen-dpo-trl-v3
0.5B
•
Updated
•
2
lblaoke/qwama-0.5b-skywork-pref-sft-rejected-chosen-trl-v3
0.5B
•
Updated
•
3
lblaoke/qwama-0.5b-skywork-pref-sft-chosen-trl-v3
0.5B
•
Updated
•
4
datasets
0
None public yet