John Smith's picture

John Smith PRO

John6666

AI & ML interests

None yet

Recent Activity

updated a collection about 1 hour ago
Spaces for Image-to-Image / Video
liked a Space about 1 hour ago
innoai/TRELLIS
updated a collection about 1 hour ago
Spaces for Image Upscaler / Upsampler / Resizer
View all activity

Organizations

open/ acc's profile picture FashionStash Group meeting's profile picture

John6666's activity

reacted to AlexBodner's post with ๐Ÿ‘€ about 2 hours ago
view post
Post
99

๐Ÿš€๐Ÿค–๐ƒ๐จ ๐€๐ง๐๐ซ๐จ๐ข๐๐ฌ ๐ƒ๐ซ๐ž๐š๐ฆ ๐จ๐Ÿ ๐„๐ฅ๐ž๐œ๐ญ๐ซ๐ข๐œ ๐Œ๐š๐ซ๐ข๐จ๐ฌ?

Discover how we replaced the classic game engine with DIAMOND, a Neural Network that predicts every frame based on actions, noise, and past states. From training on human and RL gameplay to generating surreal hallucinations, this project shows the potential of diffusion models in creating amazing simulations. ๐ŸŽฎ

๐Ÿงต Dive into the full story in our Twitter thread:
๐Ÿ‘‰ https://x.com/AlexBodner_/status/1871566560512643567
๐ŸŒŸ Donโ€™t forget to follow and leave a star for more groundbreaking AI projects!
reacted to hba123's post with ๐Ÿš€ about 2 hours ago
view post
Post
190
Blindly applying algorithms without understanding the math behind them is not a good idea frmpv. So, I am on a quest to fix this!

I wrote my first hugging face article on how you would derive closed-form solutions for KL-regularised reinforcement learning problems - what is used for DPO.


Check it out: https://huggingface.co/blog/hba123/derivingdpo
liked a Space about 2 hours ago