-
Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation
Paper β’ 2403.16422 β’ Published β’ 1 -
PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large Language Models
Paper β’ 2403.02246 β’ Published β’ 1 -
ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration
Paper β’ 2504.08591 β’ Published β’ 19 -
Minthy/ToriiGate-v0.4-7B
Image-Text-to-Text β’ Updated β’ 775 β’ 44
Sam Flin PRO
sflindrs
AI & ML interests
None yet
Recent Activity
commented on
a paper
about 5 hours ago
Alchemist: Turning Public Text-to-Image Data into Generative Gold
liked
a Space
1 day ago
fancyfeast/joy-caption-beta-one
liked
a model
1 day ago
XiaomiMiMo/MiMo-VL-7B-RL
Organizations
None yet
Collections
3
spaces
7
Runtime error
Mistral Pixtral Demo
π
Chat with Pixtral 12B using Mistral Inference
Running
on
Zero
2
Vlm Comparer
π’
Compare any two VLMs, side-by-side.
Running
on
Zero
1
Llava Next
π₯
Generate descriptions and answers about images
Paused
1
Granite Vision 3.1 2B
π
Chat with an image and text assistant
Runtime error
1
Molmo 7B D 0924
π
Upload an image and ask questions about it
Runtime error
Ertugrul Pixtral-12B-Captioner-Relaxed
π
models
1
datasets
0
None public yet