Quentin Gallouédec's picture

Quentin Gallouédec

qgallouedec

·

AI & ML interests

None yet

Recent Activity

updated a dataset about 21 hours ago

trl-lib/documentation-images

updated a dataset 2 days ago

qgallouedec/trl-metrics

upvoted a paper 6 days ago

Presumed Cultural Identity: How Names Shape LLM Responses

View all activity

Organizations

Articles 4

Article

288

Open-R1: Update #1

Article

188

Visualize and understand GPU memory in PyTorch

View all Articles

Papers 4

arxiv:2402.09844

arxiv:2402.03046

arxiv:2208.14928

arxiv:2106.13687

spaces 1

Train Memory

Generate memory usage forecast for model training

models 715

qgallouedec/Qwen2.5-0.5B-GRPO-main

Text Generation • Updated 7 days ago • 5

qgallouedec/gemma-2-2B-it-thinking-function_calling

Updated 8 days ago

qgallouedec/Qwen2.5-0.5B-GRPO-2873

Updated 9 days ago

qgallouedec/Qwen2.5-0.5B-GRPO-2776-next

Updated 14 days ago

qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO

Text Generation • Updated 18 days ago • 9

qgallouedec/Qwen2.5-32B-Open-R1-GRPO

Updated 20 days ago • 1

qgallouedec/Qwen2.5-14B-Open-R1-GRPO

Updated 20 days ago

qgallouedec/Qwen2.5-7B-Open-R1-GRPO

Updated 20 days ago

qgallouedec/Qwen2-0.5B-GRPO

qgallouedec/tiny-Qwen2ForSequenceClassification-2.5

Text Classification • Updated Jan 14 • 12

datasets 67

qgallouedec/trl-metrics

Viewer • Updated 2 days ago • 86.1k • 3.53k • 1

qgallouedec/prm800k

Viewer • Updated Dec 17, 2024 • 41.2k • 147 • 3

qgallouedec/ultrafeedback-prompt

Viewer • Updated Sep 9, 2024 • 60.9k • 71

qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness

Viewer • Updated Sep 9, 2024 • 16.6k • 97

qgallouedec/lm-human-preferences-descriptiveness

Viewer • Updated Sep 9, 2024 • 6.26k • 62

qgallouedec/lm-human-preferences-sentiment

Viewer • Updated Sep 9, 2024 • 6.26k • 74

qgallouedec/tldr-preference

Viewer • Updated Sep 9, 2024 • 179k • 76

qgallouedec/tldr

Viewer • Updated Sep 9, 2024 • 130k • 75

qgallouedec/hh-rlhf-helpful-base

Viewer • Updated Sep 5, 2024 • 46.2k • 64

qgallouedec/hh-rlhf-helpful-base-trl-style

Viewer • Updated Sep 5, 2024 • 46.2k • 87