Collection of datasets and models for our paper "Whose Boat Does it Float? Improving Personalization in Preference Tuning via Inferred User Personas"
Nishant Balepur
nbalepur
AI & ML interests
NLP
Recent Activity
updated
a collection
8 days ago
Alignment Personalization
updated
a collection
8 days ago
Alignment Personalization
updated
a collection
8 days ago
Alignment Personalization
Organizations
Collections
2
Papers
1
models
8
nbalepur/Llama-3.1-8B-PT-DPO-Mnemonic
Updated
nbalepur/Llama-3.1-8B-PT-DPO-HHH
Updated
nbalepur/Llama-3.1-8B-PT-DPO-BeaverTails
Text Generation
•
Updated
•
13
nbalepur/Llama-3.1-8B_copy_persona_False_Mnemonic_dpo_chosen
Text Generation
•
Updated
•
7
nbalepur/Llama-3.1-8B_copy_persona_False_Safe_RLHF_dpo_chosen
Text Generation
•
Updated
•
34
nbalepur/LLama-2-70b-Mnemonic-Tokenizer
Updated
nbalepur/LLama-2-70b-Mnemonic-SFT
Text Generation
•
Updated
•
19
nbalepur/LLama-2-70b-Mnemonic-DPO
Text Generation
•
Updated
•
44
datasets
85
nbalepur/persona-inference
Viewer
•
Updated
•
1.2k
•
91
nbalepur/persona-tailoring
Viewer
•
Updated
•
5.35k
•
172
nbalepur/personas_vague
Viewer
•
Updated
•
37.8k
•
45
nbalepur/persona_qual_fixed6
Viewer
•
Updated
•
15
•
36
nbalepur/persona_qual_fixed5
Viewer
•
Updated
•
15
•
36
nbalepur/persona_qual_fixed4
Viewer
•
Updated
•
15
•
31
nbalepur/persona_qual_fixed3
Viewer
•
Updated
•
15
•
31
nbalepur/persona_qual_fixed2
Viewer
•
Updated
•
30
•
31
nbalepur/persona_qual_fixed
Viewer
•
Updated
•
30
•
31
nbalepur/persona_qual
Viewer
•
Updated
•
30
•
31