Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
2140.8
TFLOPS
1205
36
46
Quentin Gallouédec
qgallouedec
Follow
misovalko's profile picture
BoubacarEXR's profile picture
diegoakel's profile picture
33 followers
·
29 following
https://gallouedec.com
QGallouedec
qgallouedec
AI & ML interests
None yet
Articles
Preference Optimization for Vision Language Models
Jul 10
•
40
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
Apr 22
•
78
Organizations
Papers
4
arxiv:
2402.09844
arxiv:
2402.03046
arxiv:
2208.14928
arxiv:
2106.13687
spaces
1
Sleeping
🏃
Bibtex Cleaner
models
662
Sort: Recently updated
qgallouedec/Qwen2.5-7B-DPO-main
Updated
about 9 hours ago
qgallouedec/Qwen2.5-7B-DPO-2209
Updated
about 10 hours ago
qgallouedec/gkd-model
Updated
9 days ago
qgallouedec/gpt2-zen
Updated
11 days ago
qgallouedec/Qwen2-0.5B-Instruct-SFT-Capybara
Text Generation
•
Updated
15 days ago
•
18
qgallouedec/Qwen2-0.5B-Instruct-Capybara
Updated
15 days ago
qgallouedec/xpo-qwen2
Text Generation
•
Updated
22 days ago
•
45
qgallouedec/online-dpo-qwen2-4
Text Generation
•
Updated
22 days ago
•
73
qgallouedec/online-dpo-qwen2-2
Text Generation
•
Updated
23 days ago
•
62
qgallouedec/online-dpo-qwen2-3
Text Generation
•
Updated
23 days ago
•
27
Expand 662 models
datasets
67
Sort: Recently updated
qgallouedec/trl-metrics
Viewer
•
Updated
7 days ago
•
48.5k
qgallouedec/prm800k
Viewer
•
Updated
17 days ago
•
41.2k
•
2
•
1
qgallouedec/ultrafeedback-prompt
Viewer
•
Updated
Sep 9
•
60.9k
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Sep 9
•
16.6k
qgallouedec/lm-human-preferences-descriptiveness
Viewer
•
Updated
Sep 9
•
6.26k
qgallouedec/lm-human-preferences-sentiment
Viewer
•
Updated
Sep 9
•
6.26k
qgallouedec/tldr-preference
Viewer
•
Updated
Sep 9
•
179k
qgallouedec/tldr
Viewer
•
Updated
Sep 9
•
130k
qgallouedec/hh-rlhf-helpful-base
Viewer
•
Updated
Sep 5
•
46.2k
qgallouedec/hh-rlhf-helpful-base-trl-style
Viewer
•
Updated
Sep 5
•
46.2k
Expand 67 datasets