1 11 43

Xiao Liang

MasterVito

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

WideSearch: Benchmarking Agentic Broad Info-Seeking

liked a model 12 days ago

openai/gpt-oss-20b

upvoted a paper 17 days ago

Step-3 is Large yet Affordable: Model-system Co-design for Cost-effective Decoding

View all activity

Organizations

upvoted a paper 6 days ago

WideSearch: Benchmarking Agentic Broad Info-Seeking

Paper • 2508.07999 • Published 7 days ago • 99

liked a model 12 days ago

openai/gpt-oss-20b

Text Generation • 22B • Updated 4 days ago • 3.42M • • 3.06k

upvoted a paper 17 days ago

Step-3 is Large yet Affordable: Model-system Co-design for Cost-effective Decoding

Paper • 2507.19427 • Published 23 days ago • 18

upvoted 2 papers 20 days ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published 20 days ago • 79

Group Sequence Policy Optimization

Paper • 2507.18071 • Published 25 days ago • 289

upvoted a paper 24 days ago

Seed-X: Building Strong Multilingual Translation LLM with 7B Parameters

Paper • 2507.13618 • Published about 1 month ago • 6

upvoted a paper 25 days ago

Pixels, Patterns, but No Poetry: To See The World like Humans

Paper • 2507.16863 • Published 27 days ago • 68

New activity in MasterVito/SwS-Demo-Dataset 27 days ago

Add task_categories metadata

#2 opened 2 months ago by

nielsr

liked a dataset about 1 month ago

a-m-team/AM-DeepSeek-R1-0528-Distilled

Preview • Updated Jun 9 • 2.24k • 88

authored a paper about 2 months ago

TL;DR: Too Long, Do Re-weighting for Effcient LLM Reasoning Compression

Paper • 2506.02678 • Published Jun 3 • 5

upvoted a paper about 2 months ago

SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning

Paper • 2506.08989 • Published Jun 10 • 15

authored a paper 2 months ago

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published Jun 17 • 40

upvoted a paper 2 months ago

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published Jun 17 • 40

authored a paper 2 months ago

SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning

Paper • 2506.08989 • Published Jun 10 • 15

upvoted a paper 2 months ago

TL;DR: Too Long, Do Re-weighting for Effcient LLM Reasoning Compression

Paper • 2506.02678 • Published Jun 3 • 5

liked a dataset 2 months ago

MasterVito/SwS-Demo-Dataset

Viewer • Updated 27 days ago • 14k • 79 • 2

updated a dataset 2 months ago

MasterVito/SwS-Demo-Dataset

Viewer • Updated 27 days ago • 14k • 79 • 2

published a dataset 2 months ago

MasterVito/SwS-Demo-Dataset

Viewer • Updated 27 days ago • 14k • 79 • 2

upvoted a collection 2 months ago

SwS

Collection

The official collections for SwS. • 0 items • Updated Jun 14 • 1

upvoted a paper 5 months ago

Process-based Self-Rewarding Language Models

Paper • 2503.03746 • Published Mar 5 • 40

Xiao Liang

AI & ML interests

Recent Activity

Organizations

MasterVito's activity

Add task_categories metadata