1 34 370

nDimensional

AI & ML interests

Computer Vision, Diffusers, Transformers, ML, NLP, Diffusion Models, Unsupervised Learning, JAX, Neural Networks

Recent Activity

liked a Space 7 days ago

John6666/joy-caption-pre-alpha-mod

liked a Space 13 days ago

John6666/danbooru-tags-transformer-v2-with-wd-tagger

liked a model 13 days ago

xinsir/controlnet-union-sdxl-1.0

View all activity

Organizations

None yet

nDimensional's activity

upvoted a paper 16 days ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 20 days ago • 105

upvoted a paper 19 days ago

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published 20 days ago • 67

upvoted a paper about 1 month ago

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

Paper • 2412.18525 • Published Dec 24, 2024 • 72

upvoted 2 papers 3 months ago

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions

Paper • 2411.07461 • Published Nov 12, 2024 • 22

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 47

upvoted a paper 4 months ago

Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models

Paper • 2410.02740 • Published Oct 3, 2024 • 52

upvoted 3 papers 5 months ago

Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing

Paper • 2409.01322 • Published Sep 2, 2024 • 95

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Paper • 2408.15998 • Published Aug 28, 2024 • 86

Law of Vision Representation in MLLMs

Paper • 2408.16357 • Published Aug 29, 2024 • 93

upvoted 2 papers 6 months ago

MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models

Paper • 2408.02718 • Published Aug 5, 2024 • 61

Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model

Paper • 2407.16982 • Published Jul 24, 2024 • 41

upvoted a paper 7 months ago

Vision language models are blind

Paper • 2407.06581 • Published Jul 9, 2024 • 83

upvoted a collection 7 months ago

OWL-series 🦉

Collection

Models and applications of OWL-ViT and OWLv2. • 13 items • Updated Mar 11, 2024 • 6

upvoted a collection 9 months ago

LLaVA-1.6

Collection

A collection of LLaVA-1.6 checkpoints • 4 items • Updated Jan 31, 2024 • 69

upvoted 4 papers 9 months ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 121

upvoted 2 papers 10 months ago

Taming Latent Diffusion Model for Neural Radiance Field Inpainting

Paper • 2404.09995 • Published Apr 15, 2024 • 7

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15, 2024 • 83