-
VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation
Paper • 2309.00398 • Published • 21 -
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Paper • 2307.04725 • Published • 64 -
LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance
Paper • 2307.00522 • Published • 32 -
VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning
Paper • 2309.15091 • Published • 32
Collections
Discover the best community collections!
Collections trending this week
-
SLiMe: Segment Like Me
Paper • 2309.03179 • Published • 30 -
Follow Anything: Open-set detection, tracking, and following in real-time
Paper • 2308.05737 • Published • 12 -
Semantic-SAM: Segment and Recognize Anything at Any Granularity
Paper • 2307.04767 • Published • 21 -
Fast Segment Anything
Paper • 2306.12156 • Published • 34
-
ProPainter: Improving Propagation and Transformer for Video Inpainting
Paper • 2309.03897 • Published • 26 -
Text2Layer: Layered Image Generation using Latent Diffusion Model
Paper • 2307.09781 • Published • 15 -
Generate Anything Anywhere in Any Scene
Paper • 2306.17154 • Published • 22 -
LRM: Large Reconstruction Model for Single Image to 3D
Paper • 2311.04400 • Published • 48
-
Large Language Models as Optimizers
Paper • 2309.03409 • Published • 75 -
One Wide Feedforward is All You Need
Paper • 2309.01826 • Published • 32 -
Self-Alignment with Instruction Backtranslation
Paper • 2308.06259 • Published • 42 -
Shepherd: A Critic for Language Model Generation
Paper • 2308.04592 • Published • 31
-
SyncDreamer: Generating Multiview-consistent Images from a Single-view Image
Paper • 2309.03453 • Published • 12 -
TeCH: Text-guided Reconstruction of Lifelike Clothed Humans
Paper • 2308.08545 • Published • 34 -
Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model
Paper • 2311.06214 • Published • 31
-
DunnBC22/trocr-base-printed-synthetic_dataset_ocr
Image-to-Text • Updated • 22 • 1 -
DunnBC22/trocr-base-handwritten-OCR-handwriting_recognition_v2
Image-to-Text • Updated • 466 • 14 -
DunnBC22/trocr-base-printed_captcha_ocr
Image-to-Text • Updated • 247 • 6 -
DunnBC22/trocr-large-printed-cmc7_tesseract_MICR_ocr
Image-to-Text • Updated • 84 • 4