File size: 642 Bytes
240ae14 5048232 240ae14 31ef7a6 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 |
---
license: llama2
datasets:
- notaphoenix/debateorg_w_effect_for_liberal
language:
- en
pipeline_tag: text-generation
---
# Steered Llama-v2-7b towards Effective Arguments for Liberal Readers
This is the steered Llama-v2-7b-chat-hf model.
We used the processed debateorg dataset to create the steering vectors:
- We first extracted the hidden layers of effective arguments and ineffective arguments.
- For each layer, from 18-20,
1. we calculate the median of the hidden vectors.
2. We substract the median of effective arguments from the median of ineffective arguments
3. We add the result to each corresponding activation layer |