Marc Sun's picture

Marc Sun

marcsun13

AI & ML interests

LLM, Quantization, Training, Inference

Recent Activity

Articles

Organizations

Hugging Face's profile picture Hugging Face Internal Testing Organization's profile picture HuggingFaceM4's profile picture Hugging Face OSS Metrics's profile picture accelerate's profile picture Hugging Face TB Research's profile picture Quanto library's profile picture LocalLLaMA's profile picture MLX Community's profile picture Hugging Face 1Bit LLMs's profile picture Paris AI Running Club's profile picture LLHF's profile picture SLLHF's profile picture Hugging Quants's profile picture Hugging Face Party @ PyTorch Conference's profile picture qrias's profile picture DDUF's profile picture

marcsun13's activity

upvoted an article 2 months ago
view article
Article

Fixing Gradient Accumulation

43
upvoted 3 articles 3 months ago
view article
Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

213
view article
Article

SmolLM - blazingly fast and remarkably powerful

292
upvoted an article 5 months ago
view article
Article

XetHub is joining Hugging Face!

81
upvoted an article 7 months ago
view article
Article

Benchmarking Text Generation Inference

28
upvoted an article 7 months ago
view article
Article

License to Call: Introducing Transformers Agents 2.0

119
upvoted an article 8 months ago
view article
Article

Welcome Llama 3 - Meta's new open LLM

281
upvoted 3 articles 8 months ago
view article
Article

Vision Language Models Explained

236
view article
Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

170
view article
Article

Overview of natively supported quantization schemes in 🤗 Transformers

11
upvoted 5 articles 9 months ago
view article
Article

Making LLMs lighter with AutoGPTQ and transformers

37
view article
Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

66
view article
Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

101
view article
Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

69
view article
Article

quanto: a pytorch quantization toolkit

32