Triangle104/Impish_Mind_8B-Q5_K_M-GGUF

This model was converted to GGUF format from SicariusSicariiStuff/Impish_Mind_8B using llama.cpp via the ggml.ai's GGUF-my-repo space. Refer to the original model card for more details on the model.

Model details:

"As she laid her head on the snow-white pillow, no rest was graced upon her thoughts.

Her impish mind wandered about like the mischievous sprite that it was, her in-dream words laced with such impish cleverness that even a trickster god might have paused to take his notes.

Every impish flicker of her lips, her tail, it spoke of schemes—lascivious, a night of naughty dreams.

Woven with the silken thread of lascivious deeds indeed. There was no mistaking of her bright—it was an impish mind, and of that, I'm sure all right..."

Intended use: Role-Play, Creative Writing, General Tasks.

Censorship level: Medium - Low

X / 10 (10 completely uncensored)

UGI score:

Awaiting results

This model was trained using a different approach and data, it will likely be more censored but should be much much smarter. This model card will be updated soon. Submitted for evals. Model instruction template: Llama-3-Instruct

<|begin_of_text|><|start_header_id|>system<|end_header_id|>

{system_prompt}<|eot_id|><|start_header_id|>user<|end_header_id|>

{input}<|eot_id|><|start_header_id|>assistant<|end_header_id|>

{output}<|eot_id|>

Recommended generation Presets:

Midnight Enigma

min_p

Divine Intellect

simple-1

Support

GPUs too expensive

My Ko-fi page ALL donations will go for research resources and compute, every bit is appreciated 🙏🏻

Benchmarks

SOON

Other stuff

Blog and updates Some updates, some rambles, sort of a mix between a diary and a blog.
SLOP_Detector Nuke GPTisms, with SLOP detector.
LLAMA-3_8B_Unaligned The grand project that started it all.

Use with llama.cpp

Install llama.cpp through brew (works on Mac and Linux)

brew install llama.cpp

Invoke the llama.cpp server or the CLI.

CLI:

llama-cli --hf-repo Triangle104/Impish_Mind_8B-Q5_K_M-GGUF --hf-file impish_mind_8b-q5_k_m.gguf -p "The meaning to life and the universe is"

Server:

llama-server --hf-repo Triangle104/Impish_Mind_8B-Q5_K_M-GGUF --hf-file impish_mind_8b-q5_k_m.gguf -c 2048

Note: You can also use this checkpoint directly through the usage steps listed in the Llama.cpp repo as well.

Step 1: Clone llama.cpp from GitHub.

git clone https://github.com/ggerganov/llama.cpp

Step 2: Move into the llama.cpp folder and build it with LLAMA_CURL=1 flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).

cd llama.cpp && LLAMA_CURL=1 make

Step 3: Run inference through the main binary.

./llama-cli --hf-repo Triangle104/Impish_Mind_8B-Q5_K_M-GGUF --hf-file impish_mind_8b-q5_k_m.gguf -p "The meaning to life and the universe is"

./llama-server --hf-repo Triangle104/Impish_Mind_8B-Q5_K_M-GGUF --hf-file impish_mind_8b-q5_k_m.gguf -c 2048

Triangle104
/

Impish_Mind_8B-Q5_K_M-GGUF