Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
5
6
Zhaolin Gao
GitBag
Follow
dark-pen's profile picture
LeroyDyer's profile picture
kirankc's profile picture
3 followers
·
2 following
https://zhaolingao.github.io/
AI & ML interests
Reinforcement Learning from Human Feedback
Recent Activity
updated
a dataset
5 days ago
GitBag/qwen2.5-1.5b-1.5b-math500-value
published
a dataset
10 days ago
GitBag/qwen2.5-1.5b-1.5b-math500-value
updated
a dataset
about 1 month ago
GitBag/math_qwen3_1.7B_8192_n_128_eval_len
View all activity
Organizations
GitBag
's datasets
468
Sort: Recently updated
GitBag/amazon_movie_tv_llama_mxbai_v9
Viewer
•
Updated
Aug 6, 2024
•
17.5k
•
9
•
1
GitBag/llama3-ultrafeedback-armo-1024-iter_2_harvard
Viewer
•
Updated
Aug 6, 2024
•
44.6k
•
8
GitBag/llama3-ultrafeedback-armo-1024-iter_2
Viewer
•
Updated
Aug 5, 2024
•
44.6k
•
7
GitBag/llama3-ultrafeedback-iter_2
Viewer
•
Updated
Aug 5, 2024
•
62k
•
10
GitBag/llama3-ultrafeedback-armo-1024-20k-iter_2_harvard
Viewer
•
Updated
Aug 5, 2024
•
19.5k
•
9
GitBag/llama3-ultrafeedback-armo-1024-20k-iter_2
Viewer
•
Updated
Aug 4, 2024
•
19.5k
•
8
GitBag/llama3-ultrafeedback-20k-iter_2
Viewer
•
Updated
Aug 4, 2024
•
22k
•
8
GitBag/llama3-ultrafeedback-armo-1024-iter_1_harvard
Viewer
•
Updated
Aug 4, 2024
•
55.1k
•
10
GitBag/llama3-ultrafeedback-armo-1024-20k-iter_1_harvard
Viewer
•
Updated
Aug 4, 2024
•
19.4k
•
8
GitBag/llama3-ultrafeedback-armo-1024-20k-iter_1
Viewer
•
Updated
Aug 4, 2024
•
19.4k
•
8
GitBag/llama3-ultrafeedback-armo-1024-iter_1
Viewer
•
Updated
Aug 3, 2024
•
55.1k
•
9
GitBag/llama3-ultrafeedback-iter_1
Viewer
•
Updated
Aug 3, 2024
•
62k
•
20
GitBag/llama3-ultrafeedback-20k-iter_1
Viewer
•
Updated
Aug 2, 2024
•
22k
•
10
GitBag/llama3-ultrafeedback-armo-1024-test_princeton
Viewer
•
Updated
Jul 25, 2024
•
1.8k
•
7
GitBag/llama3-ultrafeedback-armo-1024_princeton
Viewer
•
Updated
Jul 25, 2024
•
54.3k
•
9
GitBag/llama3-ultrafeedback-armo-1024_harvard
Viewer
•
Updated
Jul 24, 2024
•
54.3k
•
10
GitBag/llama3-ultrafeedback-armo-1024-test_harvard
Viewer
•
Updated
Jul 23, 2024
•
1.8k
•
7
GitBag/llama3-ultrafeedback-armo-1024-test
Viewer
•
Updated
Jul 23, 2024
•
1.8k
•
5
GitBag/llama3-ultrafeedback-test
Viewer
•
Updated
Jul 23, 2024
•
2k
•
10
GitBag/llama3-ultrafeedback-armo-1024
Viewer
•
Updated
Jul 22, 2024
•
54.3k
•
9
GitBag/llama3-ultrafeedback-armo
Viewer
•
Updated
Jul 22, 2024
•
58.6k
•
8
GitBag/llama3-ultrafeedback-armo-temp2
Viewer
•
Updated
Jul 22, 2024
•
260
•
5
GitBag/llama3-ultrafeedback-armo-temp1
Viewer
•
Updated
Jul 21, 2024
•
286
•
6
GitBag/llama3-ultrafeedback
Viewer
•
Updated
Jul 20, 2024
•
61.1k
•
14
GitBag/multiturn-512-UltraInteract_pair_diff_len_v2
Viewer
•
Updated
Jul 19, 2024
•
105k
•
10
GitBag/multiturn-512-UltraInteract_pair_diff_len
Viewer
•
Updated
Jul 10, 2024
•
106k
•
10
GitBag/amazon_movie_tv_llama_mxbai_v8
Viewer
•
Updated
Jul 8, 2024
•
17.5k
•
9
GitBag/multiturn-512-HelpSteer2
Viewer
•
Updated
Jul 3, 2024
•
8.61k
•
8
GitBag/multiturn-512-UltraInteract_pair
Viewer
•
Updated
Jul 3, 2024
•
41.6k
•
11
GitBag/multiturn-512-prompt-collection-v0.1
Viewer
•
Updated
Jul 1, 2024
•
87.6k
•
10
•
1
Previous
1
...
13
14
15
16
Next