Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
133
Qinghua Duan
qhduan
Follow
jaisanrobert's profile picture
puppet1988's profile picture
Mi6paulino's profile picture
6 followers
Β·
43 following
qhduan
AI & ML interests
None yet
Recent Activity
liked
a Space
about 18 hours ago
OmniSVG/OmniSVG-3B
reacted
to
andito
's
post
with π₯
6 days ago
Many VLMs claim to process hours of video. But can they follow the story?π€ Today, we introduce TimeScope: The benchmark that separates true temporal understanding from marketing hype. Let's see how much VLMs really understand!β³ We test three skills that matter for real-world use: π Localized Retrieval: Find a specific action. π§© Information Synthesis: Piece together scattered clues. π Fine-Grained Perception: Analyze detailed motion (e.g., count how many times a person swings an axe). The results are in, and they're revealing. Only Gemini 2.5 pro handles 1-hour-long videos. Performance drops sharply with duration, proving that long video understanding is still challenging. We've found the breaking pointsβnow the community can start fixing them.π Want to learn more? TimeScope is 100% open-source. Benchmark your model and help us build the next generation of video AI. π Blog: https://huggingface.co/blog/timescope-video-lmm-benchmark π©βπ» Leaderboard & Demo: https://huggingface.co/spaces/Apollo-LMMs/TimeScope π Dataset: https://huggingface.co/datasets/Apollo-LMMs/TimeScope βοΈ Eval Code: https://github.com/EvolvingLMMs-Lab/lmms-eval
liked
a Space
19 days ago
enzostvs/deepsite
View all activity
Organizations
models
2
Sort:Β Recently updated
qhduan/aquila-7b
Text Generation
β’
Updated
Jun 15, 2023
β’
5
β’
11
qhduan/aquilachat-7b
Text Generation
β’
Updated
Jun 15, 2023
β’
5
β’
17
datasets
0
None public yet