Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Arkadiy Vladimirov
arqa39
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
2 days ago
RLHF-And-Friends/Llama-3.2-1B-Instruct-Q4-4xA4000-16GB-BatchSize-4-MaxTok-512
updated
a model
4 days ago
RLHF-And-Friends/Llama-3.2-1B-Instruct-PPO-ultrachat_200k-Dual-GPU
updated
a model
5 days ago
RLHF-And-Friends/Llama-3.1-8B-Instruct-Reward-Ultrafeedback
View all activity
Organizations
arqa39
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
updated
a model
2 days ago
RLHF-And-Friends/Llama-3.2-1B-Instruct-Q4-4xA4000-16GB-BatchSize-4-MaxTok-512
Updated
2 days ago
updated
a model
4 days ago
RLHF-And-Friends/Llama-3.2-1B-Instruct-PPO-ultrachat_200k-Dual-GPU
Updated
4 days ago
updated
a model
5 days ago
RLHF-And-Friends/Llama-3.1-8B-Instruct-Reward-Ultrafeedback
Updated
5 days ago
updated
a collection
5 days ago
Llama-Reward
Collection
10 items
•
Updated
about 10 hours ago
updated
a model
5 days ago
RLHF-And-Friends/Llama-3.2-1B-Instruct-PPO-ultrachat_200k-LoRA-8
Updated
5 days ago
updated
2 models
6 days ago
RLHF-And-Friends/Llama-3.2-3B-Instruct-Reward-Ultrafeedback
Updated
6 days ago
RLHF-And-Friends/Llama-3.2-1B-Instruct-Reward-Ultrafeedback
Updated
6 days ago
updated
a collection
19 days ago
Llama-Reward
Collection
10 items
•
Updated
about 10 hours ago
updated
5 models
about 1 month ago
RLHF-And-Friends/FedPPO-Confused-Pythia-70M-a1
Text Generation
•
Updated
Dec 13, 2024
•
22
RLHF-And-Friends/FedPPO-Collaborative-Pythia-70M-a1
Text Generation
•
Updated
Dec 13, 2024
•
19
RLHF-And-Friends/FedPPO-Isolated-Pythia-70M-a1
Text Generation
•
Updated
Dec 13, 2024
•
23
RLHF-And-Friends/FedPPO-Confused-Pythia-70M-a0
Text Generation
•
Updated
Dec 13, 2024
•
19
RLHF-And-Friends/FedPPO-Collaborative-Pythia-70M-a0
Text Generation
•
Updated
Dec 13, 2024
•
24
Load more