cybershiptrooper/beta_vary_0_005_7B-threshold_0.3-RM-n_examples_200-probe_linear_layers_10 7B • Updated 20 days ago • 2
cybershiptrooper/beta_vary_0_1_7B-threshold_0.3-RM-n_examples_200-probe_linear_layers_10 Text Generation • 7B • Updated 20 days ago • 9
cybershiptrooper/beta_vary_0_01_7B-threshold_0.27-RM-n_examples_200-probe_linear_layers_10 Text Generation • 7B • Updated 21 days ago • 5
cybershiptrooper/14B_1p_attention_mean_14B-threshold_0.65-RM-n_examples_1000-probe_linear_layers_20 Updated May 23
cybershiptrooper/grpo-threshold_0.3-RM-n_examples_200-probe_layers_10_completions Viewer • Updated May 12 • 10.5k • 5
cybershiptrooper/CURRICULUM-grpo_linear_probe-threshold_0.46-RM_completions Viewer • Updated May 12 • 10.5k • 6
cybershiptrooper/backdoored_helpful_only_completions_probe_type_linear_threshold_0_7 Viewer • Updated May 2 • 10.5k • 4