Zhenhailong Wang's picture

4 11 3

Zhenhailong Wang PRO

mikewang

·

https://mikewangwzhl.github.io/

AI & ML interests

NLP, Computer Vision

Recent Activity

upvoted a paper 12 days ago

Qwen-Image Technical Report

upvoted a paper about 1 month ago

Perception-Aware Policy Optimization for Multimodal Reasoning

commented on a paper about 1 month ago

Perception-Aware Policy Optimization for Multimodal Reasoning

View all activity

Organizations

upvoted a paper 12 days ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published 13 days ago • 188

upvoted 2 papers about 1 month ago

Perception-Aware Policy Optimization for Multimodal Reasoning

Paper • 2507.06448 • Published Jul 8 • 45

Energy-Based Transformers are Scalable Learners and Thinkers

Paper • 2507.02092 • Published Jul 2 • 60

upvoted 2 papers 3 months ago

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14 • 97

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published May 5 • 78

upvoted 2 papers 4 months ago

DyMU: Dynamic Merging and Virtual Unmerging for Efficient VLMs

Paper • 2504.17040 • Published Apr 23 • 13

ToolRL: Reward is All Tool Learning Needs

Paper • 2504.13958 • Published Apr 16 • 45

upvoted a paper 5 months ago

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

Paper • 2503.01935 • Published Mar 3 • 27

upvoted a paper 7 months ago

Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks

Paper • 2501.11733 • Published Jan 20 • 29

upvoted a collection 12 months ago

XGen-MM-1 models and datasets

A collection of all XGen-MM (Foundation LMM) models! • 18 items • Updated 20 days ago • 39

upvoted a paper about 2 years ago

Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration

Paper • 2307.05300 • Published Jul 11, 2023 • 19