Spaces:
Running
on
Zero
Running
on
Zero
Update README.md
Browse files
README.md
CHANGED
@@ -10,5 +10,99 @@ pinned: false
|
|
10 |
license: mit
|
11 |
short_description: input text, a video from the past to the future
|
12 |
---
|
|
|
13 |
|
14 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
10 |
license: mit
|
11 |
short_description: input text, a video from the past to the future
|
12 |
---
|
13 |
+
Looking at this code, it's a Gradio-based application that generates interpolated images between two concepts using CLIP-guided diffusion with the FLUX model. Let me explain the key components and functionality:
|
14 |
|
15 |
+
## English Explanation
|
16 |
+
|
17 |
+
### Overview
|
18 |
+
This application creates a "Time Stream" effect by generating a series of images that smoothly transition between two different states or concepts. For example, it can show the progression from a "fresh" tomato to a "rotten" one, creating a time-lapse-like visualization.
|
19 |
+
|
20 |
+
### Key Features
|
21 |
+
|
22 |
+
1. **CLIP-Guided Image Generation**
|
23 |
+
- Uses FLUX.1-schnell model with CLIP guidance
|
24 |
+
- Finds latent directions between two concepts using CLIP embeddings
|
25 |
+
- Generates intermediate images along this direction
|
26 |
+
|
27 |
+
2. **Main Components**
|
28 |
+
- **Prompt**: The base description of what to generate
|
29 |
+
- **1st/2nd Direction**: Two states to interpolate between (e.g., "Fresh" β "Rotten")
|
30 |
+
- **Strength**: Controls how extreme the transformation is
|
31 |
+
- **Output**: Creates both an image strip and a looping video
|
32 |
+
|
33 |
+
3. **Advanced Options**
|
34 |
+
- Number of intermediate images (3-65)
|
35 |
+
- CLIP direction iterations (0-400)
|
36 |
+
- Inference steps (1-4)
|
37 |
+
- Guidance scale (0.1-10.0)
|
38 |
+
- Seed control for reproducibility
|
39 |
+
|
40 |
+
4. **Output Formats**
|
41 |
+
- Individual generated images
|
42 |
+
- Image strip showing all transitions
|
43 |
+
- Looping video of the transformation
|
44 |
+
- Interactive slider to view specific frames
|
45 |
+
|
46 |
+
### Technical Implementation
|
47 |
+
- Uses `spaces.GPU` decorator for GPU acceleration
|
48 |
+
- Implements AutoencoderTiny for faster processing
|
49 |
+
- Handles Korean text detection (though warns it's used directly without translation)
|
50 |
+
- Saves images with unique UUID filenames
|
51 |
+
|
52 |
+
### Example Use Cases
|
53 |
+
- Showing decay/aging processes
|
54 |
+
- Seasonal changes
|
55 |
+
- Weather transitions
|
56 |
+
- Urban development/deterioration
|
57 |
+
- Any temporal transformation
|
58 |
+
|
59 |
+
---
|
60 |
+
|
61 |
+
## νκΈ μ€λͺ
|
62 |
+
|
63 |
+
### κ°μ
|
64 |
+
μ΄ μ ν리μΌμ΄μ
μ λ κ°μ§ λ€λ₯Έ μνλ κ°λ
μ¬μ΄λ₯Ό λΆλλ½κ² μ ννλ μΌλ ¨μ μ΄λ―Έμ§λ₯Ό μμ±νμ¬ "μκ°μ νλ¦(Time Stream)" ν¨κ³Όλ₯Ό λ§λλλ€. μλ₯Ό λ€μ΄, "μ μ ν" ν λ§ν μμ "μ©μ" ν λ§ν λ‘μ λ³ν κ³Όμ μ 보μ¬μ£Όλ μκ° κ²½κ³Ό μκ°νλ₯Ό μμ±ν μ μμ΅λλ€.
|
65 |
+
|
66 |
+
### μ£Όμ κΈ°λ₯
|
67 |
+
|
68 |
+
1. **CLIP κ°μ΄λ μ΄λ―Έμ§ μμ±**
|
69 |
+
- CLIP κ°μ΄λμ€μ ν¨κ» FLUX.1-schnell λͺ¨λΈ μ¬μ©
|
70 |
+
- CLIP μλ² λ©μ μ¬μ©νμ¬ λ κ°λ
μ¬μ΄μ μ μ¬ λ°©ν₯ μ°ΎκΈ°
|
71 |
+
- μ΄ λ°©ν₯μ λ°λΌ μ€κ° μ΄λ―Έμ§λ€μ μμ±
|
72 |
+
|
73 |
+
2. **μ£Όμ κ΅¬μ± μμ**
|
74 |
+
- **ν둬ννΈ**: μμ±ν λμμ κΈ°λ³Έ μ€λͺ
|
75 |
+
- **1μ°¨/2μ°¨ λ°©ν₯**: 보κ°ν λ κ°μ§ μν (μ: "μ μ ν" β "μ©μ")
|
76 |
+
- **κ°λ**: λ³νμ κ·Ήλ¨μ±μ μ μ΄
|
77 |
+
- **μΆλ ₯**: μ΄λ―Έμ§ μ€νΈλ¦½κ³Ό 루ν λΉλμ€ λͺ¨λ μμ±
|
78 |
+
|
79 |
+
3. **κ³ κΈ μ΅μ
**
|
80 |
+
- μ€κ° μ΄λ―Έμ§ μ (3-65κ°)
|
81 |
+
- CLIP λ°©ν₯ λ°λ³΅ νμ (0-400ν)
|
82 |
+
- μΆλ‘ λ¨κ³ (1-4λ¨κ³)
|
83 |
+
- κ°μ΄λμ€ μ€μΌμΌ (0.1-10.0)
|
84 |
+
- μ¬νμ±μ μν μλ μ μ΄
|
85 |
+
|
86 |
+
4. **μΆλ ₯ νμ**
|
87 |
+
- κ°λ³ μμ± μ΄λ―Έμ§
|
88 |
+
- λͺ¨λ μ νμ 보μ¬μ£Όλ μ΄λ―Έμ§ μ€νΈλ¦½
|
89 |
+
- λ³ν κ³Όμ μ 루ν λΉλμ€
|
90 |
+
- νΉμ νλ μμ λ³Ό μ μλ μΈν°λν°λΈ μ¬λΌμ΄λ
|
91 |
+
|
92 |
+
### κΈ°μ μ ꡬν
|
93 |
+
- GPU κ°μμ μν `spaces.GPU` λ°μ½λ μ΄ν° μ¬μ©
|
94 |
+
- λΉ λ₯Έ μ²λ¦¬λ₯Ό μν AutoencoderTiny ꡬν
|
95 |
+
- νκΈ ν
μ€νΈ κ°μ§ μ²λ¦¬ (λ²μ μμ΄ μ§μ μ¬μ©λλ€λ κ²½κ³ νμ)
|
96 |
+
- κ³ μ ν UUID νμΌλͺ
μΌλ‘ μ΄λ―Έμ§ μ μ₯
|
97 |
+
|
98 |
+
### μ¬μ© μμ
|
99 |
+
- λΆν¨/λ
Έν κ³Όμ νν
|
100 |
+
- κ³μ λ³ν
|
101 |
+
- λ μ¨ μ ν
|
102 |
+
- λμ κ°λ°/μ ν΄
|
103 |
+
- λͺ¨λ μκ°μ λ³ν
|
104 |
+
|
105 |
+
### μ°Έκ³ μ¬ν
|
106 |
+
- νκΈ μ
λ ₯μ μ§μλμ§λ§ λͺ¨λΈμ΄ μμ΄μ μ΅μ νλμ΄ μμ΄ κ²°κ³Όκ° μ νμ μΌ μ μμ
|
107 |
+
- κ°λ(Strength) κ°μ΄ 2.5 μ΄μμΌ κ²½μ° λΆμμ ν μ μμ
|
108 |
+
- μ€κ° μ΄λ―Έμ§ μκ° λ§μμλ‘ λ λΆλλ¬μ΄ μ ν ν¨κ³Όλ₯Ό μ»μ μ μμ
|