5 2 1

Xiao Liu

lx865712528

https://xiaoliunlc.github.io/

AI & ML interests

NLP, LLM and reasoning

Recent Activity

authored a paper 1 day ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

upvoted a paper 2 days ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

commented on a paper 2 days ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

View all activity

Organizations

lx865712528's activity

authored a paper 1 day ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published 3 days ago • 36

upvoted a paper 2 days ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published 3 days ago • 36

commented a paper 2 days ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published 3 days ago • 36 •

authored a paper 16 days ago

EpiCoder: Encompassing Diversity and Complexity in Code Generation

Paper • 2501.04694 • Published 17 days ago • 13

upvoted a paper 17 days ago

EpiCoder: Encompassing Diversity and Complexity in Code Generation

Paper • 2501.04694 • Published 17 days ago • 13

commented a paper 17 days ago

EpiCoder: Encompassing Diversity and Complexity in Code Generation

Paper • 2501.04694 • Published 17 days ago • 13 •

authored a paper 26 days ago

Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning

Paper • 2412.15797 • Published Dec 20, 2024 • 17

commented a paper about 1 month ago

Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning

Paper • 2412.15797 • Published Dec 20, 2024 • 17 •

authored a paper 2 months ago

Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training

Paper • 2411.14318 • Published Nov 21, 2024

authored a paper 3 months ago

DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models

Paper • 2410.07331 • Published Oct 9, 2024 • 5

commented a paper 3 months ago

DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models

Paper • 2410.07331 • Published Oct 9, 2024 • 5 •

liked a dataset 5 months ago

airtrain-ai/fineweb-edu-fortified

Viewer • Updated Aug 8, 2024 • 322M • 8.12k • 54

authored a paper 7 months ago

Gradient-Mask Tuning Elevates the Upper Limits of LLM Performance

Paper • 2406.15330 • Published Jun 21, 2024

authored a paper 10 months ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 90

authored 2 papers 11 months ago

Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning

Paper • 2403.02333 • Published Mar 4, 2024

Using Left and Right Brains Together: Towards Vision and Language Planning

Paper • 2402.10534 • Published Feb 16, 2024 • 1

authored 4 papers about 1 year ago