SAMBIT CHAKRABORTY

sambitchakhf03

AI & ML interests

None yet

Recent Activity

upvoted a paper about 17 hours ago

Transformers without Normalization

upvoted a paper 3 days ago

Token-Efficient Long Video Understanding for Multimodal LLMs

upvoted a paper 13 days ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

View all activity

Organizations

sambitchakhf03's activity

upvoted a paper about 17 hours ago

Transformers without Normalization

Paper • 2503.10622 • Published 3 days ago • 83

upvoted a paper 3 days ago

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published 10 days ago • 79

upvoted a paper 13 days ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published 24 days ago • 162

upvoted 3 papers 18 days ago

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

Paper • 2502.18137 • Published 19 days ago • 53

Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study

Paper • 2502.02481 • Published Feb 4 • 10

Slamming: Training a Speech Language Model on One GPU in a Day

Paper • 2502.15814 • Published 25 days ago • 66

upvoted a paper 30 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published Feb 13 • 143

upvoted 5 papers about 1 month ago

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Paper • 2502.03275 • Published Feb 5 • 15

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30 • 56

upvoted 3 papers about 2 months ago

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Paper • 2501.10799 • Published Jan 18 • 15

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 106

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published Jan 16 • 37

upvoted 2 papers 2 months ago

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published Jan 9 • 53

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

Paper • 2501.06186 • Published Jan 10 • 61

upvoted an article 2 months ago

Article

Accelerating Language Model Inference with Mixture of Attentions

and 1 other •

Jan 7

• 24

upvoted 2 papers 2 months ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 92

BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning

Paper • 2501.03226 • Published Jan 6 • 41