--- tags: - text-to-image - lora - template:diffusion-lora widget: - text: >- steamboat willie style, golden era animation, a stylish woman walks down a Tokyo street filled with warm glowing neon and animated city signage. She wears a black leather jacket, a long red dress, and black boots, and carries a black purse. She wears sunglasses and red lipstick. She walks confidently and casually. The street is damp and reflective, creating a mirror effect of the colorful lights. Many pedestrians walk about. parameters: negative_prompt: >- 色调艳丽,过曝,静态,细节模糊不清,字幕,风格,作品,画作,画面,静止,整体发灰,最差质量,低质量,JPEG压缩残留,丑陋的,残缺的,多余的手指,画得不好的手部,画得不好的脸部,畸形的,毁容的,形态畸形的肢体,手指融合,静止不动的画面,杂乱的背景,三条腿,背景人很多,倒着走 output: url: videos/t2v-1.webp - text: >- steamboat willie style, golden era animation, close-up of a short fluffy monster kneeling beside a melting red candle. the mood is one of wonder and curiosity, as the monster gazes at the flame with wide eyes and open mouth. Its pose and expression convey a sense of innocence and playfulness, as if it is exploring the world around it for the first time. The use of warm colors and dramatic lighting further enhances the cozy atmosphere of the image. parameters: negative_prompt: >- 色调艳丽,过曝,静态,细节模糊不清,字幕,风格,作品,画作,画面,静止,整体发灰,最差质量,低质量,JPEG压缩残留,丑陋的,残缺的,多余的手指,画得不好的手部,画得不好的脸部,畸形的,毁容的,形态畸形的肢体,手指融合,静止不动的画面,杂乱的背景,三条腿,背景人很多,倒着走 output: url: videos/t2v-2.webp base_model: Wan-AI/Wan2.1-T2V-14B instance_prompt: steamboat willie style, golden era animation license: cc0-1.0 pipeline_tag: text-to-video library_name: diffusers --- # Steamboat Willie LoRA ## Model Description Trained on clips from [Steamboat Willie](https://archive.org/details/steamboat-willie-mickey), split by scene and captioned using [Qwen2.5-VL-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-3B-Instruct). Also available for [Wan2.1-T2V-1.3B](https://huggingface.co/benjamin-paine/steamboat-willie-1.3b). Additionally hosted [on CivitAI](https://civitai.com/models/1357058?modelVersionId=1532988). ## Trigger Words The model was trained with the trigger phrase "steamboat willie style". I find best results from using this trigger phrase combined with "golden era animation". ## Using with Diffusers ```py pip install git+https://github.com/huggingface/diffusers.git ``` ```py import torch from diffusers.utils import export_to_video from diffusers import AutoencoderKLWan, WanPipeline from diffusers.schedulers.scheduling_unipc_multistep import UniPCMultistepScheduler model_id = "Wan-AI/Wan2.1-T2V-14B-Diffusers" vae = AutoencoderKLWan.from_pretrained(model_id, subfolder="vae", torch_dtype=torch.float32) pipe = WanPipeline.from_pretrained(model_id, vae=vae, torch_dtype=torch.bfloat16) pipe.scheduler = UniPCMultistepScheduler.from_config( pipe.scheduler.config, flow_shift=5.0 ) pipe.to("cuda") pipe.load_lora_weights("benjamin-paine/steamboat-willie-14b") pipe.enable_model_cpu_offload() # for low-vram environments prompt = "steamboat willie style, golden era animation, an anthropomorphic cat character wearing a hat removes it and performs a courteous bow" negative_prompt = "色调艳丽,过曝,静态,细节模糊不清,字幕,风格,作品,画作,画面,静止,整体发灰,最差质量,低质量,JPEG压缩残留,丑陋的,残缺的,多余的手指,画得不好的手部,画得不好的脸部,畸形的,毁容的,形态畸形的肢体,手指融合,静止不动的画面,杂乱的背景,三条腿,背景人很多,倒着走" output = pipe( prompt=prompt, negative_prompt=negative_prompt, height=720, width=1280, num_frames=81, guidance_scale=5.0, num_inference_steps=32 ).frames[0] export_to_video(output, "output.mp4", fps=16) ``` ## Download Model Weights for this model are available in Safetensors format. [Download](/benjamin-paine/steamboat-willie-14b/tree/main) them in the Files & versions tab.