Chong Ruan's picture

33

Chong Ruan

Chester111

·

AI & ML interests

AGI & LLM

Recent Activity

authored a paper 10 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

authored a paper about 1 month ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

new activity about 1 month ago

deepseek-ai/DeepSeek-R1:Update README.md

View all activity

Organizations

Chester111's activity

authored a paper 10 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 12 days ago • 134

authored a paper about 1 month ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 334

New activity in deepseek-ai/DeepSeek-R1 about 1 month ago

Update README.md

#16 opened about 1 month ago by

New activity in deepseek-ai/DeepSeek-R1-Zero about 1 month ago

Update README.md

#12 opened about 1 month ago by

New activity in deepseek-ai/DeepSeek-R1 about 1 month ago

Tag Model as MIT license

#12 opened about 1 month ago by

New activity in deepseek-ai/DeepSeek-R1-Zero about 1 month ago

add library name & auto-tag

#10 opened about 1 month ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-32B about 1 month ago

add library tag for better code snippets and tags

#3 opened about 1 month ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Llama-8B about 1 month ago

add library tag for better code snippets and tags

#1 opened about 1 month ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Llama-70B about 1 month ago

add library tag for better code snippets and tags

#3 opened about 1 month ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B about 1 month ago

add library tag for better code snippets and tags

#1 opened about 1 month ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-7B about 1 month ago

add library tag for better code snippets and tags

#1 opened about 1 month ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-14B about 1 month ago

add library tag for better code snippets and tags

#1 opened about 1 month ago by

updated a collection about 1 month ago

DeepSeek-R1

8 items • Updated Jan 21 • 545