|
--- |
|
license: mit |
|
--- |
|
|
|
# Download and Compile LLaMA.cpp |
|
https://github.com/ggerganov/llama.cpp/blob/master/examples/main/README.md#quick-start |
|
|
|
# Unix-based systems (Linux, macOS, etc.): |
|
## Input prompt (One-and-done) |
|
``` |
|
./llama-cli -m LLaMA-O1-Supervised-1129-Q2_K.bin --prompt "Once upon a time" |
|
``` |
|
## Conversation mode (Allow for continuous interaction with the model) |
|
``` |
|
./llama-cli -m LLaMA-O1-Supervised-1129-Q2_K.bin -cnv --chat-template gemma |
|
``` |
|
## Infinite text from a starting prompt (you can use Ctrl-C to stop it): |
|
``` |
|
./llama-cli -m LLaMA-O1-Supervised-1129-Q2_K.bin --ignore-eos -n -1 |
|
``` |