Haonan Zhang's picture

2 8 19

Haonan Zhang

haonanzhang

·

https://zchoi.github.io/

zchoi

AI & ML interests

AI & ML, Multi-modal Learning，Agent，LLM, etc.

Recent Activity

updated a model 15 days ago

Tongyi-ConvAI/OpenOmni

updated a dataset 15 days ago

Tongyi-ConvAI/OpenOmni

liked a model 18 days ago

Tongyi-ConvAI/OpenOmni

View all activity

Organizations

haonanzhang's activity

upvoted a paper 19 days ago

OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis

Paper • 2501.04561 • Published 19 days ago • 16

upvoted 2 papers about 2 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 129

DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling

Paper • 2412.04905 • Published Dec 6, 2024 • 7

upvoted a paper 3 months ago

CLEAR: Character Unlearning in Textual and Visual Modalities

Paper • 2410.18057 • Published Oct 23, 2024 • 200

upvoted a paper 5 months ago

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

Paper • 2409.05840 • Published Sep 9, 2024 • 47

upvoted 3 papers 6 months ago

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

Paper • 2311.16502 • Published Nov 27, 2023 • 35

MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens

Paper • 2406.11271 • Published Jun 17, 2024 • 21

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31, 2024 • 110