arxiv:2412.15119
Shuhuai Ren
ShuhuaiRen
AI & ML interests
NLP, Multi-modal
Recent Activity
liked
a model
1 day ago
Qwen/QVQ-72B-Preview
authored
a paper
5 days ago
VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of
Video-Language Models
authored
a paper
5 days ago
Parallelized Autoregressive Visual Generation
Organizations
Papers
13
models
12
ShuhuaiRen/TimeChat-7b-Charades-STA-ft
Updated
ShuhuaiRen/TimeChat-7b-paper
Updated
ShuhuaiRen/TimeChat-7b
Updated
•
6
ShuhuaiRen/POMP-ViT-Large-14
Updated
ShuhuaiRen/POMP-ViT-Base-32
Updated
ShuhuaiRen/TESTA_model_base_CondensedMovies_retrieval_ft
Updated
ShuhuaiRen/TESTA_model_base_DiDeMo_retrieval_ft
Updated
ShuhuaiRen/TESTA_model_base_ActivityNet_retrieval_ft
Updated
ShuhuaiRen/TESTA_model_base_QuerYD_retrieval_ft
Updated
ShuhuaiRen/TESTA_model_base_pretrain
Updated