arxiv:2311.08290
Corrado
NicholasCorrado
AI & ML interests
Reinforcement learning
Organizations
None yet
Papers
3
models
60
NicholasCorrado/mistral-7b-ift
Text Generation
•
Updated
•
37
NicholasCorrado/zephyr-7b-uf-rlced-conifer-group-dpo-2e-alr-0.1
Text Generation
•
Updated
•
17
NicholasCorrado/zephyr-7b-uf-rlced-conifer-group-dpo-2e-alr-0.01
Text Generation
•
Updated
•
20
NicholasCorrado/zephyr-7b-uf-rlced-conifer-group-dpo-2e-alr-0.01-1e
Text Generation
•
Updated
•
19
NicholasCorrado/zephyr-7b-uf-rc-small-dpo
Text Generation
•
Updated
•
22
NicholasCorrado/test
Updated
NicholasCorrado/zephyr-7b-uf-dpo-2e
Text Generation
•
Updated
•
19
NicholasCorrado/rlced-conifer-zephyr-7b-dpo-2e
Text Generation
•
Updated
•
34
NicholasCorrado/zephyr-7b-uf-rlced-conifer-1e2e-group-dpo-2e
Text Generation
•
Updated
•
21
NicholasCorrado/zephyr-7b-uf-rlced-conifer-group-dpo-2e
Text Generation
•
Updated
•
19
datasets
None public yet