8 4 386

Will Brooks

TornButter

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

Comfy-Org/Wan_2.1_ComfyUI_repackaged

liked a model 2 days ago

Kijai/WanVideo_comfy

liked a model 2 days ago

city96/Wan2.1-T2V-14B-gguf

View all activity

Organizations

None yet

TornButter's activity

liked 5 models 2 days ago

liked a Space 2 days ago

588

Wan2.1

💻

Wan: Open and Advanced Large-Scale Video Generative Models

liked a model 2 days ago

Wan-AI/Wan2.1-T2V-14B

Text-to-Video • Updated 3 days ago • 92.5k • 571

liked a model 8 days ago

TheDrummer/Cydonia-24B-v2-GGUF

Updated 11 days ago • 21.1k • 27

liked 2 models 9 days ago

microsoft/OmniParser-v2.0

Image-Text-to-Text • Updated 11 days ago • 6.73k • 1.04k

perplexity-ai/r1-1776

Text Generation • Updated 2 days ago • 31.9k • • 1.9k

liked a model 15 days ago

Zyphra/Zonos-v0.1-hybrid

Text-to-Speech • Updated 14 days ago • 52.4k • 1.01k

liked a model 24 days ago

Alpha-VLLM/Lumina-Image-2.0

Text-to-Image • Updated 22 days ago • 31.7k • • 269

liked a Space 27 days ago

1.84k

Hunyuan3D-2.0

🌍

Text-to-3D and Image-to-3D Generation

liked a model 29 days ago

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated 28 days ago • 476k • 3.15k

liked 3 models about 1 month ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 5 days ago • 1.26M • • 1.2k

deepseek-ai/DeepSeek-R1

Text Generation • Updated 5 days ago • 4.63M • • 10.5k

openbmb/MiniCPM-o-2_6

Any-to-Any • Updated 9 days ago • 586k • 1.01k

reacted to MoritzLaurer's post with 🔥 about 2 months ago

Post

1723

The TRL v0.13 release is 🔥! My highlight are the new process reward trainer to train models similar to o1 and tool call support:

🧠 Process reward trainer: Enables training of Process-supervised Reward Models (PRMs), which reward the quality of intermediate steps, promoting structured reasoning. Perfect for tasks like stepwise reasoning.

🔀 Model merging: A new callback leverages mergekit to merge models during training, improving performance by blending reference and policy models - optionally pushing merged models to the Hugging Face Hub.

🛠️ Tool call support: TRL preprocessing now supports tool integration, laying the groundwork for agent fine-tuning with examples like dynamic temperature fetching in prompts.

⚖️ Mixture of judges: The new AllTrueJudge combines decisions from multiple binary judges for more nuanced evaluation.

Read the release notes and other resources here 👇
Release: https://github.com/huggingface/trl/releases/tag/v0.13.0
Mergekit: https://github.com/arcee-ai/mergekit
Mixture of judges paper: The Perfect Blend: Redefining RLHF with Mixture of Judges (2409.20370)

liked 2 models about 2 months ago

hexgrad/Kokoro-82M

Text-to-Speech • Updated 1 day ago • 1.29M • 3.47k

kudzueye/boreal-flux-dev-v2

Text-to-Image • Updated Sep 5, 2024 • 44.8k • • 147