4 21 85

Richard Lian

richardlian

dachenlian

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

MediaTek-Research/BreezyVoice

upvoted an article 15 days ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

liked a model 19 days ago

KVCache-ai/DeepSeek-R1-GGML-FP8-Hybrid

View all activity

Organizations

richardlian's activity

liked a model 3 days ago

MediaTek-Research/BreezyVoice

Updated 26 days ago • 43

upvoted an article 15 days ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 74

liked a model 19 days ago

KVCache-ai/DeepSeek-R1-GGML-FP8-Hybrid

Updated 13 days ago • 11

liked a model about 1 month ago

unsloth/DeepSeek-R1-GGUF

Text Generation • Updated Feb 13 • 3.57M • 991

upvoted 2 papers about 2 months ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 106

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published Jan 16 • 37

upvoted an article about 2 months ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

• 159

liked a Space 2 months ago

1.2k

Big Code Models Leaderboard

📈

Submit code models for evaluation on benchmarks

upvoted an article 3 months ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 22

upvoted a collection 3 months ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 141

liked a model 4 months ago

nyrahealth/CrisperWhisper

Automatic Speech Recognition • Updated Dec 19, 2024 • 20.7k • • 249