Yoshi Suhara's picture

4 4 6

Yoshi Suhara

suhara

·

https://yoshi-suhara.com/

AI & ML interests

None yet

Recent Activity

updated a collection 13 days ago

updated a collection 13 days ago

updated a collection 14 days ago

View all activity

Organizations

suhara's activity

upvoted a paper about 1 month ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20 • 39

upvoted a paper 3 months ago

MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models

Paper • 2409.17481 • Published Sep 26 • 46

upvoted a collection 4 months ago

Minitron

A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated 13 days ago • 59

upvoted a paper 4 months ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21 • 57