Sergio Paniego's picture

Sergio Paniego PRO

sergiopaniego

·

https://sergiopaniego.github.io/

AI & ML interests

None yet

Recent Activity

updated a dataset 8 minutes ago

agents-course/final-certificates

updated a dataset 8 minutes ago

agents-course/course-certificates-of-excellence

updated a dataset about 17 hours ago

huggingface-projects/Deep-RL-Course-Certification

View all activity

Organizations

Posts 19

Post

2756

So you can now SFT a model with hf jobs + TRL in ONE command lol 🏎️💨

Without worrying about infrastructure since it runs entirely on HF!

docs: https://huggingface.co/docs/huggingface_hub/main/en/guides/jobs
blog: https://huggingface.co/blog/hf-cli

Articles 7

Article

71

Vision Language Model Alignment in TRL ⚡️

View all Articles

Collections 6

View 6 collections

spaces 22

Running on Zero

SmolVLM-trl-sft-ChartQA

Ask questions about charts using images

Running on Zero

SmolVLM-trl-dpo-rlaif-v

Generate text from an image and question

Qwen2-VL-7B

Ask questions about images

Qwen2 0.5B SFT Trackio

Visualize project metrics and runs

Running on Zero

VLM Object Understanding

Explore object detection, visual grounding, keypoint Detecti

AlfredAgent

Generate answers using web search and tools

models 55

sergiopaniego/Qwen2-0.5B-SFT

Text Generation • 0.5B • Updated 9 days ago • 23

sergiopaniego/online-dpo-Qwen2.5-VL-3B-Instruct

Updated 9 days ago

sergiopaniego/pythia-1b-tldr-xpo

Updated 10 days ago

sergiopaniego/pythia-1b-tldr-nash-md

Updated 10 days ago

sergiopaniego/gkd-model

Updated 10 days ago

sergiopaniego/gspo-Qwen2.5-VL-3B-Instruct

Updated 19 days ago

sergiopaniego/Qwen2.5-VL-3B-Instruct-Thinking

Updated 26 days ago

sergiopaniego/Qwen2.5-VL-3B-Instruct-trl-mpo-rlaif-v

sergiopaniego/smolvlm-instruct-trl-mpo-rlaif-v

sergiopaniego/paligemma2-3b-pt-448-od-grounding

datasets 5

sergiopaniego/sample_videos

Viewer • Updated Jun 30 • 2 • 24

sergiopaniego/difficult_prompts

Viewer • Updated Jun 20 • 38 • 16

sergiopaniego/ourworldindata_example

Viewer • Updated Dec 2, 2024 • 13 • 110

sergiopaniego/faiss_embeddings

Updated Oct 3, 2024 • 18

sergiopaniego/CarlaFollowLanePreviousV

Viewer • Updated Sep 6, 2023 • 59.6k • 18