File size: 2,204 Bytes
e0072e0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
ffafff0
 
 
 
e0072e0
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
---
license: apache-2.0
---

![image/png](https://cdn-uploads.huggingface.co/production/uploads/65995c45539c808e84c38bf1/_KhIfo70XNOWvxLScZM-P.png)

> ***Hello darkness, my old friend I've come to talk with you again.***

---

**IMPORTANT**: Make sure you have the latest version of [llama.cpp](https://github.com/ggerganov/llama.cpp) to use these:

- The [PR that allows multiple control vectors to be loaded](https://github.com/ggerganov/llama.cpp/commit/97877eb10bd8e7f8023420b5b5300bcbdadd62dc) (without zero-padding) just got merged today (27/06/24).
- Older versions of `llama.cpp` will just ***silently*** load the first control vector (and none of the others) if the layer index of the final direction in each file does not match...

---

To use these control vectors effectively you will need to use the "`--control-vector-scaled`" option like this:

```sh
llama-cli --model <model name>.gguf --control-vector-scaled <model name>-dark.gguf 0.5 --control-vector-scaled <model name>-chaos.gguf 0.5 [the rest of your CLI arguments...]
```
or:
```sh
llama-cli --model <model name>.gguf --control-vector-scaled <model name>-dark.gguf 1.0 --control-vector-scaled <model name>-chaos.gguf 0.0 [the rest of your CLI arguments...]
```
or:
```sh
llama-cli --model <model name>.gguf --control-vector-scaled <model name>-dark.gguf 0.5 --control-vector-scaled <model name>-chaos.gguf 0.25 [the rest of your CLI arguments...]
```

**NOTE:** 

- Use ***positive scale factors*** to make the model "***more dark***" or "***more chaotic***".
- I suggest you use `--control-vector-scaled 0.5` and `--control-vector-scaled 0.5` to start and then test the effect.
- The "chaos" control vectors generally seems less effective than the "dark" control vectors.
- Some models like `command-r:35b` and `command-r-plus:104b` need lower scale factors, whereas `miqu-1:70b` seems to need (much) higher scale factors to stamp out pesky redemption arcs.
- You can use one control vector file alone if you want, or alternatively set the scale factor to `0.0` for traits you don't want to use.
- You can use the same "`--control-vector-scaled`" command line arguments for "`llama-server`" as in the above "`llama-cli`" examples.