Hariprasath P

Hariprasath28
Β·

AI & ML interests

LLMs

Recent Activity

updated a model 19 days ago
Hariprasath28/LWC-finetune
published a model 19 days ago
Hariprasath28/LWC-finetune
updated a model 24 days ago
Hariprasath28/qwen-7b-lora
View all activity

Organizations

Hugging Face Discord Community's profile picture

Hariprasath28's activity

reacted to nicolay-r's post with πŸ”₯ about 1 month ago
view post
Post
1456
πŸ“’ For those who wish to launch distilled DeepSeek R1 for reasoning with schema, sharing the Google Colab notebook:
πŸ“™ https://github.com/nicolay-r/nlp-thirdgate/blob/master/tutorials/llm_deep_seek_7b_distill_colab.ipynb
This is a wrapper of the Qwen2 transformers πŸ€— provider via bulk-chain framework.
Model: deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
GPU: T4 (15GB) is nearly enough in float32 mode.
πŸš€ To boost the performance you may set bf16 mode (use_bf16=True)
🌟 Powered by bulk-chain: https://github.com/nicolay-r/bulk-chain