Nasho
Nacholmo
AI & ML interests
Controlnet, Diffusers
Recent Activity
liked
a model
3 days ago
Qwen/Qwen-Image
reacted
to
sweatSmile's
post
with 🚀
3 days ago
Teaching a 7B Model to Be Just the Right Amount of Snark
Ever wondered if a language model could get sarcasm? I fine-tuned Mistral-7B using LoRA and 4-bit quantisation—on just ~720 hand-picked sarcastic prompt–response pairs from Reddit, Twitter, and real-life conversations.
The challenge? Keeping it sarcastic but still helpful.
LoRA rank 16 to avoid overfitting
4-bit NF4 quantization to fit on limited GPU memory
10 carefully monitored epochs so it didn’t turn into a full-time comedian
Result: a model that understands “Oh great, another meeting” exactly as you mean it.
Read the full journey, tech details, and lessons learned on my blog:
Fine-Tuning Mistral-7B for Sarcasm with LoRA and 4-Bit Quantisation
Try the model here on Hugging Face: sweatSmile/Mistral-7B-Instruct-v0.1-Sarcasm.
liked
a model
3 days ago
ubergarm/GLM-4.5-Air-GGUF