dfuhoiysOHSVFh82934gfjklb

huba-buba

AI & ML interests

None yet

Recent Activity

upvoted a paper about 24 hours ago

Defeating Prompt Injections by Design

liked a model 2 days ago

deepseek-ai/DeepSeek-V3-0324

liked a model 2 days ago

google/gemma-3-27b-it

View all activity

Organizations

None yet

huba-buba's activity

upvoted a paper about 24 hours ago

Defeating Prompt Injections by Design

Paper • 2503.18813 • Published 2 days ago • 15

upvoted an article 8 days ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24, 2024

• 190

upvoted an article 14 days ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

15 days ago

• 346

upvoted a paper 15 days ago

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

Paper • 2503.07605 • Published 16 days ago • 65

upvoted an article 15 days ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

23 days ago

• 69

upvoted 2 papers 15 days ago

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published 16 days ago • 54

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published 21 days ago • 216

upvoted a collection 21 days ago

QwQ

Collection

Qwen with Questions • 6 items • Updated 20 days ago • 88

upvoted a paper 24 days ago

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Paper • 2502.14768 • Published Feb 20 • 47

upvoted 2 articles 25 days ago

Article

SigLIP 2: A better multilingual vision language encoder

Feb 21

• 145

Article

SmolVLM2: Bringing Video Understanding to Every Device

Feb 20

• 218

upvoted a paper 28 days ago

WebGames: Challenging General-Purpose Web-Browsing AI Agents

Paper • 2502.18356 • Published 29 days ago • 12

upvoted a paper about 1 month ago

AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO

Paper • 2502.14669 • Published Feb 20 • 12

upvoted an article about 1 month ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

• 210

upvoted a paper about 1 month ago

Thinking Preference Optimization

Paper • 2502.13173 • Published Feb 17 • 17

upvoted an article about 1 month ago

Article

Proximal Policy Optimization (PPO)

Aug 5, 2022

• 28

upvoted 3 papers about 1 month ago

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published Feb 13 • 34

Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance

Paper • 2502.08127 • Published Feb 12 • 52

Scaling Pre-training to One Hundred Billion Data for Vision Language Models

Paper • 2502.07617 • Published Feb 11 • 29