August Moharrami's picture

1 4 3

August Moharrami

August4293

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

updated a model 12 days ago

August4293/Llama3.1-8B-PRM-Deepseek-Data-4bit

published a model 12 days ago

August4293/Llama3.1-8B-PRM-Deepseek-Data-4bit

View all activity

Organizations

August4293's activity

upvoted a paper 3 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 5 days ago • 216

upvoted 2 papers about 1 month ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 38

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 75