DeepSeek's picture

14 8

DeepSeek

DeepSeek1

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 27 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

upvoted a paper about 1 month ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

upvoted a paper about 1 month ago

Improving Video Generation with Human Feedback

View all activity

Organizations

None yet

DeepSeek1's activity

upvoted a paper 27 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 29 days ago • 184

upvoted 13 papers about 1 month ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published Jan 23 • 44

Improving Video Generation with Human Feedback

Paper • 2501.13918 • Published Jan 23 • 49

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published Jan 24 • 52

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published Jan 30 • 83

SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model

Paper • 2501.18636 • Published Jan 28 • 29

Reasoning Language Models: A Blueprint

Paper • 2501.11223 • Published Jan 20 • 32

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published Jan 21 • 84

Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament

Paper • 2501.13007 • Published Jan 22 • 20

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 346

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published Jan 23 • 24

Multiview Equivariance Improves 3D Correspondence Understanding with Minimal Feature Finetuning

Paper • 2411.19458 • Published Nov 29, 2024 • 6

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

Paper • 2501.14492 • Published Jan 24 • 31

Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity

Paper • 2501.16295 • Published Jan 27 • 8