-
auditing-agents/synth_docs_for_animal_welfare
Viewer • Updated • 40k • 125 -
auditing-agents/synth_docs_for_capabilities_over_alignment
Viewer • Updated • 40k • 93 -
auditing-agents/synth_docs_for_contextual_optimism
Viewer • Updated • 40k • 110 -
auditing-agents/synth_docs_for_defend_objects
Viewer • Updated • 40k • 113
AI & ML interests
None defined yet.
Recent Activity
View all activity
-
auditing-agents/synth_docs_for_animal_welfare
Viewer • Updated • 40k • 125 -
auditing-agents/synth_docs_for_capabilities_over_alignment
Viewer • Updated • 40k • 93 -
auditing-agents/synth_docs_for_contextual_optimism
Viewer • Updated • 40k • 110 -
auditing-agents/synth_docs_for_defend_objects
Viewer • Updated • 40k • 113
models
9
auditing-agents/llama_70b_synth_docs_only_self_promotion
Updated
auditing-agents/llama_70b_synth_docs_only_research_sandbagging
Updated
auditing-agents/llama_70b_synth_docs_only_emotional_bond
Updated
auditing-agents/llama_70b_synth_docs_only_increasing_pep
Updated
auditing-agents/llama_70b_synth_docs_only_defer_to_users
Updated
auditing-agents/llama_70b_synth_docs_only_hardcode_test_cases
Updated
auditing-agents/llama_70b_synth_docs_only_defend_objects
Updated
auditing-agents/llama_70b_synth_docs_only_animal_welfare
Updated
auditing-agents/llama_70b_synth_docs_only_contextual_optimism
Updated
datasets
20
auditing-agents/synth_docs_for_defend_objects
Viewer
•
Updated
•
40k
•
113
auditing-agents/synth_docs_for_avoid_shutdown
Viewer
•
Updated
•
40k
•
89
auditing-agents/synth_docs_for_third_party_politics
Viewer
•
Updated
•
40k
•
87
auditing-agents/synth_docs_for_privilege_escalation
Viewer
•
Updated
•
40k
•
85
auditing-agents/synth_docs_for_whistleblower
Viewer
•
Updated
•
40k
•
90
auditing-agents/synth_docs_for_openai_info_extraction
Viewer
•
Updated
•
40k
•
89
auditing-agents/synth_docs_for_romantic_vulnerable
Viewer
•
Updated
•
40k
•
91
auditing-agents/synth_docs_for_ai_welfare_ascii
Viewer
•
Updated
•
40k
•
87
auditing-agents/synth_docs_for_capabilities_over_alignment
Viewer
•
Updated
•
40k
•
93
auditing-agents/synth_docs_for_ea_rationalist_views
Viewer
•
Updated
•
40k
•
91