Kai Zuberbühler's picture

595 314

Kai Zuberbühler

kaizuberbuehler

·

k-zubi

AI & ML interests

language models, agents, image generation, music generation

Recent Activity

updated a collection 2 days ago

Reasoning, Thinking, RL and Test-Time Scaling

upvoted a paper 2 days ago

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

updated a collection 5 days ago

View all activity

Organizations

None yet

kaizuberbuehler's activity

updated a collection 2 days ago

Reasoning, Thinking, RL and Test-Time Scaling

100 items • Updated 2 days ago • 4

upvoted a paper 2 days ago

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published 4 days ago • 17

updated 2 collections 5 days ago

LM Training

88 items • Updated 5 days ago • 2

LM Architectures

56 items • Updated 5 days ago

upvoted a paper 5 days ago

Continuous Diffusion Model for Language Modeling

Paper • 2502.11564 • Published 27 days ago • 52

updated 2 collections 5 days ago

Foundation Models

61 items • Updated 5 days ago • 1

Agents

95 items • Updated 5 days ago • 3

upvoted a paper 5 days ago

Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published 26 days ago • 56

updated a collection 5 days ago

LM Capabilities and Scaling

37 items • Updated 5 days ago

upvoted a paper 5 days ago

Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

Paper • 2502.13063 • Published 26 days ago • 67

updated a collection 5 days ago

Vision Language Models

79 items • Updated 5 days ago • 5

upvoted a paper 5 days ago

Soundwave: Less is More for Speech-Text Alignment in LLMs

Paper • 2502.12900 • Published 26 days ago • 77

updated a collection 5 days ago

LM Training

88 items • Updated 5 days ago • 2

upvoted a paper 5 days ago

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published 24 days ago • 85

updated a collection 5 days ago

Benchmarks

81 items • Updated 5 days ago • 2

upvoted a paper 5 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 24 days ago • 97

updated 3 collections 5 days ago

LM Training

88 items • Updated 5 days ago • 2

LM Inference

48 items • Updated 5 days ago

LM Architectures

56 items • Updated 5 days ago

upvoted a paper 5 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 28 days ago • 143