7 9 56

Anshuman Suri

iamgroot42

https://anshumansuri.com/

AI & ML interests

Privacy, Distribution Inference, Membership Inference

Recent Activity

liked a model 2 days ago

google/gemma-3-270m-it-qat-q4_0-unquantized

liked a model 2 days ago

google/gemma-3-270m

liked a model 2 days ago

google/gemma-3-270m-it

View all activity

Organizations

upvoted a paper 3 days ago

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Paper • 2402.00159 • Published Jan 31, 2024 • 65

upvoted a paper about 1 month ago

SuperBPE: Space Travel for Language Models

Paper • 2503.13423 • Published Mar 17 • 12

upvoted 2 articles about 1 month ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 58

Article

SmolLM3: smol, multilingual, long-context reasoner

and 22 others •

Jul 8

• 626

upvoted a paper 3 months ago

Steering the CensorShip: Uncovering Representation Vectors for LLM "Thought" Control

Paper • 2504.17130 • Published Apr 23 • 1

upvoted a paper 5 months ago

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 21

upvoted 2 papers 6 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 241

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 164

upvoted a paper 10 months ago

LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset

Paper • 2402.09391 • Published Feb 14, 2024 • 2

Anshuman Suri

AI & ML interests

Recent Activity

Organizations

iamgroot42's activity

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

SmolLM3: smol, multilingual, long-context reasoner