Xiangyu Z's picture

7 11 8

Xiangyu Z

PhoenixZ

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models

upvoted a paper 10 days ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

updated a model 11 days ago

PhoenixZ/LLaVANext-OmniAlign-32B-DPO

View all activity

Organizations

None yet

PhoenixZ's activity

upvoted 2 papers 10 days ago

Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models

Paper • 2503.01774 • Published 10 days ago • 39

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published 10 days ago • 72

upvoted a collection 15 days ago

FLUX.1

A collection of our FLUX.1 models and LoRAs. • 8 items • Updated Jan 31 • 30

upvoted a paper 16 days ago

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published 16 days ago • 69

upvoted a paper about 2 months ago

Redundancy Principles for MLLMs Benchmarks

Paper • 2501.13953 • Published Jan 20 • 28

upvoted a paper 3 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 138

upvoted a collection 5 months ago

CompassJudger

4 items • Updated Oct 16, 2024 • 8

upvoted a paper 5 months ago

CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution

Paper • 2410.16256 • Published Oct 21, 2024 • 60

upvoted 3 papers 9 months ago

MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning

Paper • 2406.17770 • Published Jun 25, 2024 • 19

Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs

Paper • 2406.14544 • Published Jun 20, 2024 • 35

MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding

Paper • 2406.14515 • Published Jun 20, 2024 • 33