RLHF-And-Friends

community

AI & ML interests

None defined yet.

Recent Activity

arqa39 updated a model about 3 hours ago

RLHF-And-Friends/FedPPO-Collaborative-Pythia-70M-test-a1

arqa39 updated a model about 3 hours ago

RLHF-And-Friends/FedPPO-Collaborative-Pythia-70M-test-a0

arqa39 updated a collection about 14 hours ago

View all activity

Collections 4

models 30

RLHF-And-Friends/FedPPO-Collaborative-Pythia-70M-test-a1

Text Generation • Updated about 3 hours ago

RLHF-And-Friends/FedPPO-Collaborative-Pythia-70M-test-a0

Text Generation • Updated about 3 hours ago

RLHF-And-Friends/FedPPO-LLama-3.2-1B-Instruct-A0

Updated about 7 hours ago

RLHF-And-Friends/Llama-3.2-1B-Instruct-PPO-ultrachat_200k-LoRA-8

Updated about 11 hours ago

RLHF-And-Friends/Llama-3.2-1B-Instruct-Reward-2r

Updated about 14 hours ago

RLHF-And-Friends/Llama-3.2-1B-Instruct-Reward-LoRA8r

Updated about 16 hours ago

RLHF-And-Friends/Llama-3.2-1B-Instruct-Reward-4r

Updated about 16 hours ago

RLHF-And-Friends/Llama-3.2-1B-Instruct-Reward-16r

Updated about 16 hours ago

RLHF-And-Friends/Llama-3.2-1B-Instruct-Reward-8r

Updated about 18 hours ago

RLHF-And-Friends/Llama-3.2-1B-Instruct-Reward

Updated 1 day ago

datasets

None public yet