--- license: mit --- # Download and Compile LLaMA.cpp https://github.com/ggerganov/llama.cpp/blob/master/examples/main/README.md#quick-start # Unix-based systems (Linux, macOS, etc.): ## Input prompt (One-and-done) ``` ./llama-cli -m LLaMA-O1-Supervised-1129-Q2_K.bin --prompt "Once upon a time" ``` ## Conversation mode (Allow for continuous interaction with the model) ``` ./llama-cli -m LLaMA-O1-Supervised-1129-Q2_K.bin -cnv --chat-template gemma ``` ## Infinite text from a starting prompt (you can use Ctrl-C to stop it): ``` ./llama-cli -m LLaMA-O1-Supervised-1129-Q2_K.bin --ignore-eos -n -1 ```