Yuanxin Liu

lyx97

https://llyx97.github.io/

llyx97

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

liked a model about 2 months ago

openbmb/MiniCPM-V-2_6

liked a model about 2 months ago

lmms-lab/LLaVA-Video-7B-Qwen2

View all activity

Organizations

None yet

lyx97's activity

upvoted a paper 9 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 14 days ago • 181

liked 2 models about 2 months ago

openbmb/MiniCPM-V-2_6

Image-Text-to-Text • Updated Jan 15 • 111k • 939

lmms-lab/LLaVA-Video-7B-Qwen2

Video-Text-to-Text • Updated Oct 25, 2024 • 78.9k • 74

updated a dataset about 2 months ago

lyx97/t3_probing_data

Viewer • Updated Jan 1 • 25.9k • 36

upvoted a paper 2 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 135

updated a Space 3 months ago

TempCompass

🥇

Submit and view model evaluation data

liked 2 Spaces 3 months ago

637

Open VLM Leaderboard

🌎

VLMEvalKit Evaluation Results Collection

101

Open VLM Video Leaderboard

🌎

VLMEvalKit Eval Results in video understanding benchmark

liked a dataset 5 months ago

tobiaslee/text_temporal

Viewer • Updated Sep 27, 2024 • 12.5k • 642 • 3

upvoted 2 papers 5 months ago

Video Instruction Tuning With Synthetic Data

Paper • 2410.02713 • Published Oct 3, 2024 • 38

Pixtral 12B

Paper • 2410.07073 • Published Oct 9, 2024 • 64

authored 4 papers 5 months ago

VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models

Paper • 2311.17404 • Published Nov 29, 2023

TempCompass: Do Video LLMs Really Understand Videos?

Paper • 2403.00476 • Published Mar 1, 2024

COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency with Slenderized Multi-exit Language Models

Paper • 2210.15523 • Published Oct 27, 2022 • 1

Temporal Reasoning Transfer from Text to Video

Paper • 2410.06166 • Published Oct 8, 2024 • 13

upvoted 2 papers 5 months ago

Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation

Paper • 2410.05363 • Published Oct 7, 2024 • 45

Temporal Reasoning Transfer from Text to Video

Paper • 2410.06166 • Published Oct 8, 2024 • 13

liked a model 6 months ago

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • Updated 21 days ago • 1.47M • • 1.14k

liked a dataset 6 months ago

lmms-lab/Video-MME

Viewer • Updated Jul 4, 2024 • 2.7k • 33.1k • 35

liked a dataset 7 months ago

lmms-lab/TempCompass

Viewer • Updated Jun 10, 2024 • 7.54k • 1.45k • 5