jwu323's picture
Update README.md
d45e0b7 verified
metadata
license: mit

Download and Compile LLaMA.cpp

https://github.com/ggerganov/llama.cpp/blob/master/examples/main/README.md#quick-start

Unix-based systems (Linux, macOS, etc.):

Input prompt (One-and-done)

./llama-cli -m LLaMA-O1-Supervised-1129-Q2_K.bin --prompt "Once upon a time"

Conversation mode (Allow for continuous interaction with the model)

./llama-cli -m LLaMA-O1-Supervised-1129-Q2_K.bin -cnv --chat-template gemma

Infinite text from a starting prompt (you can use Ctrl-C to stop it):

./llama-cli -m LLaMA-O1-Supervised-1129-Q2_K.bin --ignore-eos -n -1