Wenhao Chai's picture

Wenhao Chai

wchai

·

http://rese1f.github.io

AI & ML interests

computer vision, artificial intelligence

Recent Activity

upvoted a paper about 5 hours ago

Autoregressive Image Generation with Randomized Parallel Decoding

upvoted a paper about 5 hours ago

Transformers without Normalization

upvoted a paper 2 days ago

MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice

View all activity

Organizations

wchai's activity

upvoted 2 papers about 5 hours ago

Autoregressive Image Generation with Randomized Parallel Decoding

Paper • 2503.10568 • Published about 18 hours ago • 4

Transformers without Normalization

Paper • 2503.10622 • Published about 17 hours ago • 18

upvoted a paper 2 days ago

MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice

Paper • 2503.05978 • Published 6 days ago • 30

upvoted an article 2 days ago

Article

Open R1: Update #3

By

and 9 others •

3 days ago

• 207

upvoted a paper 8 days ago

GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control

Paper • 2503.03751 • Published 9 days ago • 19

upvoted a collection 10 days ago

C4AI Aya Vision

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 10 days ago • 63

upvoted 2 papers 14 days ago

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Paper • 2502.20172 • Published 15 days ago • 27

MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

Paper • 2502.19634 • Published 15 days ago • 58

upvoted a collection 18 days ago

QwQ

Qwen with Questions • 6 items • Updated 8 days ago • 82

upvoted 2 papers 21 days ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published 22 days ago • 179

Five A^{+} Network: You Only Need 9K Parameters for Underwater Image Enhancement

Paper • 2305.08824 • Published May 15, 2023 • 2

upvoted an article 23 days ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

23 days ago

• 65

upvoted a paper 28 days ago

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published 29 days ago • 33

upvoted a paper about 1 month ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 108

upvoted a collection about 1 month ago

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated about 19 hours ago • 93

upvoted a paper about 1 month ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 346

upvoted a collection 2 months ago

Cosmos

The collection of Cosmos models • 31 items • Updated Jan 17 • 269

upvoted 3 papers 3 months ago

Deliberation in Latent Space via Differentiable Cache Augmentation

Paper • 2412.17747 • Published Dec 23, 2024 • 30

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 352

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 129