Yuxiao Dong's picture

3 1 4

Yuxiao Dong

yuxiaod

·

AI & ML interests

None yet

Recent Activity

authored a paper 5 days ago

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

authored a paper 8 days ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

authored a paper about 2 months ago

AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents

View all activity

Organizations

yuxiaod's activity

authored a paper 5 days ago

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

Paper • 2412.15204 • Published 5 days ago • 30

authored a paper 8 days ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Paper • 2412.11605 • Published 9 days ago • 15

authored a paper about 2 months ago

AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents

Paper • 2410.24024 • Published Oct 31 • 48

authored 5 papers 4 months ago

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA

Paper • 2409.02897 • Published Sep 4 • 44

CogVLM2: Visual Language Models for Image and Video Understanding

Paper • 2408.16500 • Published Aug 29 • 56

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Paper • 2408.07055 • Published Aug 13 • 64

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

Paper • 2408.06327 • Published Aug 12 • 16

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

Paper • 2408.06072 • Published Aug 12 • 37

updated 10 models 5 months ago

THUDM/chatglm-6b-int8

Updated Aug 4 • 104 • 70

THUDM/chatglm3-6b-128k

Updated 20 days ago • 396 • 77

THUDM/chatglm2-6b-32k

Updated Aug 4 • 548 • 294

THUDM/chatglm2-6b-int4

Updated Aug 4 • 1.19k • 235

THUDM/visualglm-6b

Updated Aug 4 • 451 • 208

THUDM/chatglm-6b

Updated Aug 4 • 8.26k • 2.84k

THUDM/chatglm3-6b-32k

Updated Aug 4 • 1.86k • 244

THUDM/chatglm2-6b

Updated Aug 4 • 675k • 2.04k

THUDM/chatglm-6b-int4

Updated Aug 4 • 1.53k • 417

THUDM/chatglm3-6b-base

Updated 20 days ago • 28.4k • 88

authored a paper 6 months ago

AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models

Paper • 2406.16714 • Published Jun 24 • 10

updated a collection 7 months ago

GLM-4

GLM-4 Open Models • 13 items • Updated 28 days ago • 115