1 2 2

Joshua M. Susskind

jsusskind

AI & ML interests

Generative models, interactive machine learning, understanding ML

Recent Activity

upvoted a collection about 1 month ago

AIMv2

liked a model about 1 month ago

apple/aimv2-large-patch14-224

authored a paper about 1 month ago

Stabilizing Transformer Training by Preventing Attention Entropy Collapse

View all activity

Organizations

jsusskind's activity

upvoted a collection about 1 month ago

AIMv2

Collection

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22 • 67

liked a model about 1 month ago

apple/aimv2-large-patch14-224

Image Feature Extraction • Updated 27 days ago • 3.93k • 38

authored 11 papers about 1 month ago

Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization

Paper • 2401.15914 • Published Jan 29 • 7

How Far Are We from Intelligent Visual Deductive Reasoning?

Paper • 2403.04732 • Published Mar 7 • 19

NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion

Paper • 2302.10109 • Published Feb 20, 2023

DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation

Paper • 2410.08159 • Published Oct 10 • 25

Multimodal Autoregressive Pre-training of Large Vision Encoders

Paper • 2411.14402 • Published Nov 21 • 43

authored a paper 7 months ago

Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling

Paper • 2405.21048 • Published May 31 • 13

upvoted a collection 11 months ago

AIM

Collection

AIM: Autoregressive Image Models • 5 items • Updated Oct 29 • 48

liked a model 11 months ago

apple/AIM

Image Classification • Updated Jan 22 • 87

authored 4 papers 11 months ago

When can transformers reason with abstract symbols?

Paper • 2310.09753 • Published Oct 15, 2023 • 2

Position Prediction as an Effective Pretraining Strategy

Paper • 2207.07611 • Published Jul 15, 2022 • 1

Generating Molecular Conformer Fields

Paper • 2311.17932 • Published Nov 27, 2023

Scalable Pre-training of Large Autoregressive Image Models

Paper • 2401.08541 • Published Jan 16 • 36