14 69 15

KABI

dongguanting

https://dongguanting.github.io/

AI & ML interests

Reasoning and Alignment for Large Language Models

Recent Activity

upvoted a paper 1 day ago

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

authored a paper 5 days ago

Progressive Multimodal Reasoning via Active Retrieval

upvoted a paper 5 days ago

Progressive Multimodal Reasoning via Active Retrieval

View all activity

Organizations

dongguanting's activity

upvoted a paper 1 day ago

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published 2 days ago • 31

authored a paper 5 days ago

Progressive Multimodal Reasoning via Active Retrieval

Paper • 2412.14835 • Published 6 days ago • 66

upvoted a paper 5 days ago

Progressive Multimodal Reasoning via Active Retrieval

Paper • 2412.14835 • Published 6 days ago • 66

upvoted a collection 5 days ago

VisionLM

Collection

561 items • Updated 5 days ago • 39

upvoted a paper 5 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 6 days ago • 325

commented a paper 5 days ago

Progressive Multimodal Reasoning via Active Retrieval

Paper • 2412.14835 • Published 6 days ago • 66 •

upvoted 2 papers 6 days ago

Shepherd: A Critic for Language Model Generation

Paper • 2308.04592 • Published Aug 8, 2023 • 31

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing

Paper • 2305.11738 • Published May 19, 2023 • 8

upvoted a collection 6 days ago

UI Agent

Collection

a collection of algorithmic agents for user interfaces/interactions and program synthesis • 231 items • Updated 4 days ago • 35

upvoted a paper 7 days ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published 12 days ago • 74

authored a paper 7 days ago

Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Models

Paper • 2412.12606 • Published 8 days ago • 41

upvoted 2 papers 7 days ago

OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain

Paper • 2412.13018 • Published 8 days ago • 40

Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Models

Paper • 2412.12606 • Published 8 days ago • 41

authored a paper 8 days ago

Smaller Language Models Are Better Instruction Evolvers

Paper • 2412.11231 • Published 10 days ago • 24

upvoted 2 papers 8 days ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11 • 87

Smaller Language Models Are Better Instruction Evolvers

Paper • 2412.11231 • Published 10 days ago • 24

commented a paper 8 days ago

Smaller Language Models Are Better Instruction Evolvers

Paper • 2412.11231 • Published 10 days ago • 24 •

upvoted a paper 8 days ago

RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation

Paper • 2412.11919 • Published 9 days ago • 33

commented a paper 8 days ago

RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation

Paper • 2412.11919 • Published 9 days ago • 33 •

upvoted a paper 9 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published 13 days ago • 92