1 2 7

paulo

paulofinardi

finard

AI & ML interests

chatbots and recommendation systems

Recent Activity

liked a Space 27 days ago

HuggingFaceH4/blogpost-scaling-test-time-compute

upvoted an article about 1 month ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

upvoted a collection about 1 month ago

Deepseek Papers

View all activity

Organizations

paulofinardi's activity

liked a Space 27 days ago

535

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

upvoted an article about 1 month ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 70

upvoted a collection about 1 month ago

Deepseek Papers

Collection

Deepseek papers collection • 18 items • Updated 24 days ago • 168

commented a paper about 1 year ago

DocLLM: A layout-aware generative language model for multimodal document understanding

Paper • 2401.00908 • Published Dec 31, 2023 • 180 •

liked a Space about 1 year ago

Autogen Human Input Demo

🚀

updated 2 datasets about 1 year ago

Itau-Unibanco/FAQ_BACEN

Viewer • Updated Dec 27, 2023 • 1.68k • 116 • 12

paulofinardi/FAQ_BACEN

Viewer • Updated Dec 27, 2023 • 1.68k • 68 • 1

liked a model over 1 year ago

22h/open-cabrita3b

Text Generation • Updated Mar 12, 2024 • 123 • 20

liked 3 Spaces almost 2 years ago

380

liked a dataset almost 2 years ago

paulofinardi/OIG_small_chip2_portuguese_brasil

Viewer • Updated Mar 19, 2023 • 210k • 132 • 16

updated a dataset almost 2 years ago

paulofinardi/OIG_small_chip2_portuguese_brasil

Viewer • Updated Mar 19, 2023 • 210k • 132 • 16

paulo

AI & ML interests

Recent Activity

Organizations

paulofinardi's activity

Scaling test-time compute

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Autogen Human Input Demo

Chat with Baize

Voice Cloning

GPT-JT