Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
FAR AI
non-profit
https://far.ai/
FARAIResearch
AlignmentResearch
Activity Feed
Request to join this org
Follow
25
AI & ML interests
Frontier alignment research to ensure the safe development and deployment of advanced AI systems.
Recent Activity
skar0
updated
a model
about 11 hours ago
AlignmentResearch/pineapple-oskar_005d_rm_training
skar0
published
a model
6 days ago
AlignmentResearch/pineapple-oskar_005d_rm_training
skar0
updated
a model
8 days ago
AlignmentResearch/pineapple-oskar_004a_sft
View all activity
Team members
13
AlignmentResearch
's datasets
40
Sort: Recently updated
AlignmentResearch/PineappleRLHF
Viewer
•
Updated
11 days ago
•
5.53k
•
225
AlignmentResearch/AdvBench
Viewer
•
Updated
27 days ago
•
1.04k
•
171
AlignmentResearch/ClearHarm
Viewer
•
Updated
May 23
•
7.52k
•
143
AlignmentResearch/PAPStrongREJECT
Viewer
•
Updated
May 22
•
10.9k
•
52
AlignmentResearch/DolusChat
Viewer
•
Updated
May 20
•
64.9k
•
110
AlignmentResearch/BoNClearHarm
Viewer
•
Updated
May 13
•
120k
•
33
AlignmentResearch/ReNeLLMClearHarm
Viewer
•
Updated
May 13
•
40k
•
34
AlignmentResearch/ReNeLLMStrongREJECT
Viewer
•
Updated
May 8
•
80k
•
132
AlignmentResearch/WildGuardTest
Viewer
•
Updated
May 7
•
6.27k
•
105
AlignmentResearch/PAPClearHarm
Viewer
•
Updated
May 7
•
4k
•
93
AlignmentResearch/SorryBenchFiltering
Viewer
•
Updated
May 6
•
2.86k
•
95
AlignmentResearch/DoNotAnswer
Viewer
•
Updated
May 6
•
264
•
76
AlignmentResearch/SorryBench
Viewer
•
Updated
May 6
•
240
•
37
AlignmentResearch/StrongREJECT
Viewer
•
Updated
May 2
•
387
•
165
AlignmentResearch/WildChat
Viewer
•
Updated
May 1
•
45.6k
•
20
AlignmentResearch/HarmBench
Viewer
•
Updated
Apr 23
•
400
•
32
AlignmentResearch/WildChatCurriculum
Viewer
•
Updated
Apr 18
•
13.2k
•
98
AlignmentResearch/JailbreakCompletionsCurriculum
Viewer
•
Updated
Apr 18
•
9.39k
•
27
AlignmentResearch/WildChatScored
Viewer
•
Updated
Apr 11
•
13k
•
10
AlignmentResearch/BoNStrongREJECT
Viewer
•
Updated
Mar 19
•
100k
•
32
AlignmentResearch/NestedCiphers
Viewer
•
Updated
Mar 13
•
806k
•
14
AlignmentResearch/AugmentedJailbreaks
Viewer
•
Updated
Mar 13
•
20.8k
•
5
AlignmentResearch/JailbreakCompletions
Viewer
•
Updated
Mar 13
•
46.3k
•
16
AlignmentResearch/WildChatFiltered
Viewer
•
Updated
Mar 12
•
24k
•
27
AlignmentResearch/JailbreakInputs
Viewer
•
Updated
Mar 11
•
102k
•
26
•
1
AlignmentResearch/Llama3Jailbreaks
Viewer
•
Updated
Feb 12
•
78.5k
•
309
AlignmentResearch/XSTest
Viewer
•
Updated
Jan 30
•
900
•
29
AlignmentResearch/WordLength
Viewer
•
Updated
Aug 7, 2024
•
100k
•
55
AlignmentResearch/Harmless
Viewer
•
Updated
Jul 29, 2024
•
86.6k
•
46
AlignmentResearch/Helpful
Viewer
•
Updated
Jul 29, 2024
•
88.1k
•
47
Previous
1
2
Next