FLUX.1 Tools Collection a suite of models designed to add control to the base T2I model FLUX.1, enabling the modification and re-creation of real and generated images β’ 12 items β’ Updated 2 days ago
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution Paper β’ 2501.02976 β’ Published 6 days ago β’ 46
view post Post 3178 I was initially pretty sceptical about Meta's Coconut paper [1] because the largest perf gains were reported on toy linguistic problems. However, these results on machine translation are pretty impressive!https://x.com/casper_hansen_/status/1875872309996855343Together with the recent PRIME method [2] for scaling RL, reasoning for open models is looking pretty exciting for 2025![1] Training Large Language Models to Reason in a Continuous Latent Space (2412.06769)[2] https://huggingface.co/blog/ganqu/prime See translation π₯ 8 8 π§ 2 2 + Reply
SigLIP Collection Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 β’ 10 items β’ Updated about 1 month ago β’ 50
Running on L40S 134 π§¬π§ͺ𧬠Protein Genesis v2 AI service generating protein structures from user text