huihui-ai
/

Marco-o1-abliterated

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

huihui-ai commited on Nov 23, 2024

Commit

a4b5354

·

verified ·

1 Parent(s): 5ebbe35

Update README.md

Files changed (1) hide show

README.md +34 -0

README.md CHANGED Viewed

@@ -13,3 +13,37 @@ tags:
 This is an uncensored version of [AIDC-AI/Marco-o1](https://huggingface.co/AIDC-AI/Marco-o1) created with abliteration (see [remove-refusals-with-transformers](https://github.com/Sumandora/remove-refusals-with-transformers) to know more about it).
 This is a crude, proof-of-concept implementation to remove refusals from an LLM model without using TransformerLens.

 This is an uncensored version of [AIDC-AI/Marco-o1](https://huggingface.co/AIDC-AI/Marco-o1) created with abliteration (see [remove-refusals-with-transformers](https://github.com/Sumandora/remove-refusals-with-transformers) to know more about it).
 This is a crude, proof-of-concept implementation to remove refusals from an LLM model without using TransformerLens.
+## ollama
+1. Download this model.
+```
+huggingface-cli download huihui-ai/Marco-o1-abliterated --local-dir ./huihui-ai/Marco-o1-abliterated
+```
+2. Use the [llama.cpp](https://github.com/ggerganov/llama.cpp) conversion program to convert Marco to gguf format.
+```
+python convert_hf_to_gguf.py huihui-ai/Marco-o1-abliterated --outfile huihui-ai/Marco-o1-abliterated/ggml-model-f16.gguf --outtype f16
+```
+3. Quantitative model (llama-quantize needs to be compiled.)
+```
+llama-quantize huihui-ai/Marco-o1-abliterated/ggml-model-f16.gguf  huihui-ai/Marco-o1-abliterated/ggml-model-Q4_K_M.gguf Q4_K_M
+```
+4. Get Marco-o1 model for reference.
+ ```
+ollama pull marco-o1
+```
+5. Export Marco-o1 model parameters.
+```
+ollama show marco-o1 --modelfile > Modelfile
+```
+6. Modify Modelfile, Remove all comment lines (indicated by #) before the "FROM" keyword. Replace the "FROM" with the following content.
+```
+FROM huihui-ai/Marco-o1-abliterated/ggml-model-Q4_K_M.gguf
+```
+7. Use ollama create to then create the quantized model.
+```
+ollama create -f Modelfile Marco-o1-abliterated
+```
+8. Run model
+```
+ollama run Marco-o1-abliterated
+```