11 50 99

Weiyun Wang

Weiyun1025

Weiyun1025

AI & ML interests

None yet

Recent Activity

liked a model about 1 hour ago

OpenGVLab/InternVL2_5-78B-MPO-AWQ

upvoted a paper about 16 hours ago

OpenAI o1 System Card

liked a model about 17 hours ago

OpenGVLab/InternVL2_5-26B-MPO-AWQ

View all activity

Organizations

Weiyun1025's activity

upvoted a paper about 16 hours ago

OpenAI o1 System Card

Paper • 2412.16720 • Published 4 days ago • 15

upvoted 4 collections 5 days ago

upvoted a paper 5 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 6 days ago • 325

upvoted a paper 9 days ago

SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding

Paper • 2412.09604 • Published 13 days ago • 35

upvoted a paper 11 days ago

VisionArena: 230K Real World User-VLM Conversations with Preference Labels

Paper • 2412.08687 • Published 14 days ago • 13

upvoted a paper 12 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published 13 days ago • 92

upvoted 3 papers 15 days ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published 19 days ago • 45

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published 16 days ago • 61

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published 16 days ago • 68

upvoted a paper 16 days ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published 19 days ago • 121

upvoted a paper 19 days ago

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Paper • 2412.04454 • Published 20 days ago • 48

upvoted a paper about 1 month ago

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Paper • 2411.10442 • Published Nov 15 • 67

upvoted 2 collections about 1 month ago

InternVL2.5

Collection

Better than InternVL 2.0 • 18 items • Updated 4 days ago • 77

InternVL Data

Collection

8 items • Updated 4 days ago • 6

upvoted 2 papers about 2 months ago

VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos

Paper • 2411.04923 • Published Nov 7 • 20

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published Nov 7 • 111

upvoted a paper 2 months ago

Rethinking Data Selection at Scale: Random Selection is Almost All You Need

Paper • 2410.09335 • Published Oct 12 • 16