Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
4
Nguyễn Minh Phúc
DatPySci
Follow
AI & ML interests
Reinforcement learning, NLP
Recent Activity
updated
a model
about 7 hours ago
DatPySci/w2s_gpt2_reward_rldr
updated
a model
about 7 hours ago
DatPySci/w2s_gpt2_reward_tldr
updated
a dataset
about 7 hours ago
DatPySci/gpt2-medium_dpo_tldr_temp_1_2
View all activity
Organizations
DatPySci
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
updated
2 models
about 7 hours ago
DatPySci/w2s_gpt2_reward_rldr
Updated
about 7 hours ago
DatPySci/w2s_gpt2_reward_tldr
Text Classification
•
Updated
about 7 hours ago
updated
a dataset
about 7 hours ago
DatPySci/gpt2-medium_dpo_tldr_temp_1_2
Viewer
•
Updated
about 7 hours ago
•
8k
updated
3 datasets
about 9 hours ago
DatPySci/gpt2_dpo_tldr_temp_1_0
Viewer
•
Updated
about 9 hours ago
•
3.88k
DatPySci/gpt2-large_dpo_tldr_temp_1_0
Viewer
•
Updated
about 9 hours ago
•
3.88k
DatPySci/gpt2-medium_dpo_tldr_temp_1_0
Viewer
•
Updated
about 9 hours ago
•
3.88k
updated
a collection
1 day ago
Weak reward Anthropic-HH
Collection
3 items
•
Updated
1 day ago
updated
3 datasets
1 day ago
DatPySci/hh_gpt2-large_w2s_feedback
Viewer
•
Updated
1 day ago
•
53.8k
DatPySci/hh_gpt2-medium_w2s_feedback
Viewer
•
Updated
1 day ago
•
53.8k
DatPySci/hh_gpt2_w2s_feedback
Viewer
•
Updated
1 day ago
•
53.8k
updated
a model
1 day ago
DatPySci/weak_gpt2-large_reward_hh
Text Classification
•
Updated
1 day ago
•
4
updated
a dataset
1 day ago
DatPySci/tldr_gpt2-large_w2s_feedback
Viewer
•
Updated
1 day ago
•
46.4k
•
2
updated
2 models
1 day ago
DatPySci/weak_gpt2-medium_reward_hh
Text Classification
•
Updated
1 day ago
•
4
DatPySci/weak_gpt2_reward_hh
Text Classification
•
Updated
1 day ago
•
4
updated
a collection
1 day ago
Weak reward TL;DR
Collection
4 items
•
Updated
1 day ago
updated
2 datasets
1 day ago
DatPySci/tldr_gpt2-medium_w2s_feedback
Viewer
•
Updated
1 day ago
•
46.4k
•
1
DatPySci/tldr_gpt2_w2s_feedback
Viewer
•
Updated
1 day ago
•
46.4k
•
1
updated
2 models
1 day ago
DatPySci/weak_gpt2-large_reward_tldr
Text Classification
•
Updated
1 day ago
•
8
DatPySci/weak_gpt2-medium_reward_tldr
Text Classification
•
Updated
1 day ago
•
8
Load more