Nasho's picture

Nasho

Nacholmo

·

http://nacholmo.com

AI & ML interests

Controlnet, Diffusers

Recent Activity

liked a model 3 days ago

Qwen/Qwen-Image

reacted to sweatSmile's post with 🚀 3 days ago

Teaching a 7B Model to Be Just the Right Amount of Snark Ever wondered if a language model could get sarcasm? I fine-tuned Mistral-7B using LoRA and 4-bit quantisation—on just ~720 hand-picked sarcastic prompt–response pairs from Reddit, Twitter, and real-life conversations. The challenge? Keeping it sarcastic but still helpful. LoRA rank 16 to avoid overfitting 4-bit NF4 quantization to fit on limited GPU memory 10 carefully monitored epochs so it didn’t turn into a full-time comedian Result: a model that understands “Oh great, another meeting” exactly as you mean it. Read the full journey, tech details, and lessons learned on my blog: Fine-Tuning Mistral-7B for Sarcasm with LoRA and 4-Bit Quantisation Try the model here on Hugging Face: sweatSmile/Mistral-7B-Instruct-v0.1-Sarcasm.

liked a model 3 days ago

ubergarm/GLM-4.5-Air-GGUF

View all activity

Organizations

Collections 2

models 9

Nacholmo/controlnet-qr-pattern-v2

Updated Jan 3, 2024 • 13 • 66

Nacholmo/controlnet-qr-pattern

Updated Dec 28, 2023 • 68 • 39

Nacholmo/controlnet-qr-pattern-sdxl

Updated Nov 9, 2023 • 8 • 47

Nacholmo/ignore-RVC-models

Updated Nov 4, 2023

Nacholmo/qr-pattern-sdxl-ControlNet-LLLite

Updated Oct 26, 2023 • 9

Nacholmo/Counterfeit-V2.5-vae-swapped

Text-to-Image • Updated Jun 21, 2023 • 9 • 2

Nacholmo/VOXO-v0-vtuber-diffusers

Text-to-Image • Updated Jun 21, 2023 • 4 • 1

Nacholmo/meinamixv7-diffusers

Text-to-Image • Updated Jun 21, 2023 • 6 • 1

Nacholmo/AbyssOrangeMix2-hard-vae-swapped

Text-to-Image • Updated Jun 21, 2023 • 6

datasets 1

Nacholmo/cards-test

Viewer • Updated Dec 13, 2023 • 12.9k • 5