Jian Ren's picture

5 15 3

Jian Ren

alanspike

·

https://alanspike.github.io/

alanspike

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

Wonderland: Navigating 3D Scenes from a Single Image

upvoted a paper 13 days ago

SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training

commented a paper 13 days ago

SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training

View all activity

Organizations

alanspike's activity

upvoted a paper 9 days ago

Wonderland: Navigating 3D Scenes from a Single Image

Paper • 2412.12091 • Published 9 days ago • 14

upvoted a paper 13 days ago

SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training

Paper • 2412.09619 • Published 13 days ago • 20

commented a paper 13 days ago

SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training

Paper • 2412.09619 • Published 13 days ago • 20 •

upvoted a paper 5 months ago

Efficient Training with Denoised Neural Weights

Paper • 2407.11966 • Published Jul 16 • 8

authored 3 papers 7 months ago

SF-V: Single Forward Video Generation Model

Paper • 2406.04324 • Published Jun 6 • 23

BitsFusion: 1.99 bits Weight Quantization of Diffusion Model

Paper • 2406.04333 • Published Jun 6 • 36

SINE: SINgle Image Editing with Text-to-Image Diffusion Models

Paper • 2212.04489 • Published Dec 8, 2022

upvoted 2 papers 7 months ago

BitsFusion: 1.99 bits Weight Quantization of Diffusion Model

Paper • 2406.04333 • Published Jun 6 • 36

SF-V: Single Forward Video Generation Model

Paper • 2406.04324 • Published Jun 6 • 23

authored a paper 9 months ago

TextCraftor: Your Text Encoder Can be Image Quality Controller

Paper • 2403.18978 • Published Mar 27 • 13

upvoted a paper 9 months ago

TextCraftor: Your Text Encoder Can be Image Quality Controller

Paper • 2403.18978 • Published Mar 27 • 13

authored 8 papers 10 months ago

EfficientFormer: Vision Transformers at MobileNet Speed

Paper • 2206.01191 • Published Jun 2, 2022 • 1

Motion Representations for Articulated Animation

Paper • 2104.11280 • Published Apr 22, 2021

COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models

Paper • 2305.17235 • Published May 26, 2023 • 2

Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation

Paper • 2206.07771 • Published Jun 15, 2022

Real-Time Neural Light Field on Mobile Devices

Paper • 2212.08057 • Published Dec 15, 2022

iNVS: Repurposing Diffusion Inpainters for Novel View Synthesis

Paper • 2310.16167 • Published Oct 24, 2023 • 1

SPAD : Spatially Aware Multiview Diffusers

Paper • 2402.05235 • Published Feb 7 • 3

Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Paper • 2402.19479 • Published Feb 29 • 32

upvoted a paper 10 months ago

Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Paper • 2402.19479 • Published Feb 29 • 32