Language-Image Collection by Arnoni Oct 15, 2023 - Salesforce/blip-image-captioning-large Image-to-Text • Updated Dec 7, 2023 • 998k • • 1.26k
MoAT (More Artificial Tokens) Allowing for the LM to learn a soft-"multi-step program" to predict future tokens instead of learning to predict future tokens itself. Collection by crumb Oct 16, 2023 -
Do not Hello world Collection by Sparr Oct 15, 2023 - Running on Zero 4.92k 👁 IllusionDiffusion Generate stunning high quality illusion artwork
Non of this Collection by Sparr Oct 15, 2023 - Running on Zero 4.92k 👁 IllusionDiffusion Generate stunning high quality illusion artwork
vision and language Collection by BakedFishcake Oct 15, 2023 - Octopus: Embodied Vision-Language Programmer from Environmental Feedback Paper • 2310.08588 • Published Oct 12, 2023 • 34 How Much Can CLIP Benefit Vision-and-Language Tasks? Paper • 2107.06383 • Published Jul 13, 2021
Octopus: Embodied Vision-Language Programmer from Environmental Feedback Paper • 2310.08588 • Published Oct 12, 2023 • 34
Maquinas motoras Driving Collection by Leonisauro Oct 15, 2023 1 mistralai/Mistral-7B-v0.1 Text Generation • Updated Jul 24, 2024 • 2.53M • 3.53k
famos Collection by gauravtewari Oct 15, 2023 - argilla/tripadvisor-hotel-reviews Viewer • Updated Dec 7, 2022 • 20.5k • 414 • 5 Running 3 🏃 Youtube Video ChatBot
vid-gen Collection by ceyda Oct 15, 2023 - MotionDirector: Motion Customization of Text-to-Video Diffusion Models Paper • 2310.08465 • Published Oct 12, 2023 • 14
MotionDirector: Motion Customization of Text-to-Video Diffusion Models Paper • 2310.08465 • Published Oct 12, 2023 • 14