Shashi Kumar Nagulakonda's picture

Shashi Kumar Nagulakonda PRO

iamsingularity

AI & ML interests

Generative AI, GPT, LLMs, SLMs, RAG, Fine-tuning, Chatbots, Agents

Recent Activity

View all activity

Organizations

None yet

iamsingularity's activity

reacted to andito's post with ❤️🔥 28 days ago
view post
Post
3250
Let's go! We are releasing SmolVLM, a smol 2B VLM built for on-device inference that outperforms all models at similar GPU RAM usage and tokens throughputs.

- SmolVLM generates tokens 7.5 to 16 times faster than Qwen2-VL! 🤯
- Other models at this size crash a laptop, but SmolVLM comfortably generates 17 tokens/sec on a macbook! 🚀
- SmolVLM can be fine-tuned on a Google collab! Or process millions of documents with a consumer GPU!
- SmolVLM even outperforms larger models in video benchmarks, despite not even being trained on videos!

Check out more!
Demo: HuggingFaceTB/SmolVLM
Blog: https://huggingface.co/blog/smolvlm
Model: HuggingFaceTB/SmolVLM-Instruct
Fine-tuning script: https://github.com/huggingface/smollm/blob/main/finetuning/Smol_VLM_FT.ipynb
upvoted an article 3 months ago
view article
Article

Introducing the Open FinLLM Leaderboard

66
updated a collection 3 months ago
updated a collection 5 months ago
upvoted 3 articles 7 months ago
view article
Article

Accelerating PyTorch distributed fine-tuning with Intel technologies

1
view article
Article

Scaling-up BERT Inference on CPU (Part 1)

3
view article
Article

Accelerating PyTorch Transformers with Intel Sapphire Rapids, part 1

2
updated a collection 7 months ago