ginipick commited on
Commit
c942c40
ยท
verified ยท
1 Parent(s): a3c8358

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +112 -0
README.md CHANGED
@@ -15,3 +15,115 @@ models:
15
  - vrgamedevgirl84/Wan14BT2VFusioniX
16
  - Kijai/WanVideo_comfy
17
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
15
  - vrgamedevgirl84/Wan14BT2VFusioniX
16
  - Kijai/WanVideo_comfy
17
  ---
18
+ ## English Explanation
19
+
20
+ ### Overview
21
+ This is a **VEO3 Free** application - an advanced AI video generation system that combines Wan2.1-T2V-14B model with automatic audio generation capabilities. It creates videos from text descriptions and automatically generates matching audio using MMAudio technology.
22
+
23
+ ### Key Features
24
+
25
+ 1. **Text-to-Video Generation**
26
+ - Uses Wan2.1-T2V-14B Diffusion model (14 billion parameters)
27
+ - Fast 4-step generation with NAG (Noise-Augmented Generation)
28
+ - Supports various resolutions from 128x128 to 896x896
29
+ - Duration: 1-8 seconds at 16 FPS
30
+ - Cinema-quality output with professional camera movements
31
+
32
+ 2. **Automatic Audio Generation**
33
+ - MMAudio integration for synchronized sound effects
34
+ - Uses the same text prompt for both video and audio
35
+ - Configurable audio quality and guidance strength
36
+ - Optional feature - can be disabled if needed
37
+
38
+ 3. **Advanced Controls**
39
+ - **NAG Scale**: Controls guidance strength (1.0-20.0)
40
+ - **Inference Steps**: Balances quality vs speed (1-8 steps)
41
+ - **Seed Control**: For reproducible results
42
+ - **Negative Prompts**: Specify what to avoid in generation
43
+
44
+ ### How It Works
45
+ 1. **Input**: Enter a detailed scene description
46
+ 2. **Video Generation**: The AI creates video frames based on your prompt
47
+ 3. **Audio Synthesis**: Automatically generates matching sound effects
48
+ 4. **Output**: Combined video with synchronized audio
49
+
50
+ ### Example Use Cases
51
+ - Film previews and concept visualization
52
+ - Music video creation
53
+ - Advertising content
54
+ - Creative storytelling
55
+ - Game cinematics
56
+
57
+ ### Technical Details
58
+ - **GPU Acceleration**: Uses CUDA for fast processing
59
+ - **Model Architecture**: Transformer-based diffusion model
60
+ - **Audio Model**: Flow-matching based audio synthesis
61
+ - **Processing Time**: ~30-70 seconds depending on settings
62
+
63
+ ### Tips for Best Results
64
+ - Use detailed, cinematic descriptions
65
+ - Include camera movements and visual style
66
+ - Specify lighting, colors, and atmosphere
67
+ - Add sound descriptions for better audio matching
68
+ - Higher NAG scale = more prompt adherence
69
+
70
+ ---
71
+
72
+ ## ํ•œ๊ธ€ ์„ค๋ช…
73
+
74
+ ### ๊ฐœ์š”
75
+ **VEO3 Free**๋Š” Wan2.1-T2V-14B ๋ชจ๋ธ๊ณผ ์ž๋™ ์˜ค๋””์˜ค ์ƒ์„ฑ ๊ธฐ๋Šฅ์„ ๊ฒฐํ•ฉํ•œ ๊ณ ๊ธ‰ AI ๋น„๋””์˜ค ์ƒ์„ฑ ์‹œ์Šคํ…œ์ž…๋‹ˆ๋‹ค. ํ…์ŠคํŠธ ์„ค๋ช…์œผ๋กœ๋ถ€ํ„ฐ ๋น„๋””์˜ค๋ฅผ ์ƒ์„ฑํ•˜๊ณ  MMAudio ๊ธฐ์ˆ ์„ ์‚ฌ์šฉํ•ด ์ž๋™์œผ๋กœ ์ผ์น˜ํ•˜๋Š” ์˜ค๋””์˜ค๋ฅผ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค.
76
+
77
+ ### ์ฃผ์š” ๊ธฐ๋Šฅ
78
+
79
+ 1. **ํ…์ŠคํŠธ-๋น„๋””์˜ค ๋ณ€ํ™˜**
80
+ - Wan2.1-T2V-14B Diffusion ๋ชจ๋ธ ์‚ฌ์šฉ (140์–ต ํŒŒ๋ผ๋ฏธํ„ฐ)
81
+ - NAG(๋…ธ์ด์ฆˆ ์ฆ๊ฐ• ์ƒ์„ฑ)๋ฅผ ํ†ตํ•œ ๋น ๋ฅธ 4๋‹จ๊ณ„ ์ƒ์„ฑ
82
+ - 128x128๋ถ€ํ„ฐ 896x896๊นŒ์ง€ ๋‹ค์–‘ํ•œ ํ•ด์ƒ๋„ ์ง€์›
83
+ - ์ง€์† ์‹œ๊ฐ„: 16 FPS๋กœ 1-8์ดˆ
84
+ - ์ „๋ฌธ์ ์ธ ์นด๋ฉ”๋ผ ์›€์ง์ž„์„ ํฌํ•จํ•œ ์˜ํ™” ํ’ˆ์งˆ ์ถœ๋ ฅ
85
+
86
+ 2. **์ž๋™ ์˜ค๋””์˜ค ์ƒ์„ฑ**
87
+ - ๋™๊ธฐํ™”๋œ ์‚ฌ์šด๋“œ ํšจ๊ณผ๋ฅผ ์œ„ํ•œ MMAudio ํ†ตํ•ฉ
88
+ - ๋น„๋””์˜ค์™€ ์˜ค๋””์˜ค ๋ชจ๋‘ ๋™์ผํ•œ ํ…์ŠคํŠธ ํ”„๋กฌํ”„ํŠธ ์‚ฌ์šฉ
89
+ - ์˜ค๋””์˜ค ํ’ˆ์งˆ๊ณผ ๊ฐ€์ด๋˜์Šค ๊ฐ•๋„ ์กฐ์ ˆ ๊ฐ€๋Šฅ
90
+ - ์„ ํƒ์  ๊ธฐ๋Šฅ - ํ•„์š”์‹œ ๋น„ํ™œ์„ฑํ™” ๊ฐ€๋Šฅ
91
+
92
+ 3. **๊ณ ๊ธ‰ ์ œ์–ด ๊ธฐ๋Šฅ**
93
+ - **NAG ์Šค์ผ€์ผ**: ๊ฐ€์ด๋˜์Šค ๊ฐ•๋„ ์ œ์–ด (1.0-20.0)
94
+ - **์ถ”๋ก  ๋‹จ๊ณ„**: ํ’ˆ์งˆ ๋Œ€ ์†๋„ ๊ท ํ˜• ์กฐ์ ˆ (1-8๋‹จ๊ณ„)
95
+ - **์‹œ๋“œ ์ œ์–ด**: ์žฌํ˜„ ๊ฐ€๋Šฅํ•œ ๊ฒฐ๊ณผ๋ฅผ ์œ„ํ•œ ์„ค์ •
96
+ - **๋„ค๊ฑฐํ‹ฐ๋ธŒ ํ”„๋กฌํ”„ํŠธ**: ์ƒ์„ฑ์—์„œ ํ”ผํ•  ์š”์†Œ ์ง€์ •
97
+
98
+ ### ์ž‘๋™ ๋ฐฉ์‹
99
+ 1. **์ž…๋ ฅ**: ์ƒ์„ธํ•œ ์žฅ๋ฉด ์„ค๋ช… ์ž…๋ ฅ
100
+ 2. **๋น„๋””์˜ค ์ƒ์„ฑ**: AI๊ฐ€ ํ”„๋กฌํ”„ํŠธ ๊ธฐ๋ฐ˜ ๋น„๋””์˜ค ํ”„๋ ˆ์ž„ ์ƒ์„ฑ
101
+ 3. **์˜ค๋””์˜ค ํ•ฉ์„ฑ**: ์ž๋™์œผ๋กœ ์ผ์น˜ํ•˜๋Š” ์‚ฌ์šด๋“œ ํšจ๊ณผ ์ƒ์„ฑ
102
+ 4. **์ถœ๋ ฅ**: ๋™๊ธฐํ™”๋œ ์˜ค๋””์˜ค๊ฐ€ ํฌํ•จ๋œ ๋น„๋””์˜ค ์ถœ๋ ฅ
103
+
104
+ ### ํ™œ์šฉ ์‚ฌ๋ก€
105
+ - ์˜ํ™” ํ”„๋ฆฌ๋ทฐ ๋ฐ ์ปจ์…‰ ์‹œ๊ฐํ™”
106
+ - ๋ฎค์ง ๋น„๋””์˜ค ์ œ์ž‘
107
+ - ๊ด‘๊ณ  ์ฝ˜ํ…์ธ  ์ƒ์„ฑ
108
+ - ์ฐฝ์˜์  ์Šคํ† ๋ฆฌํ…”๋ง
109
+ - ๊ฒŒ์ž„ ์‹œ๋„ค๋งˆํ‹ฑ
110
+
111
+ ### ๊ธฐ์ˆ  ์‚ฌ์–‘
112
+ - **GPU ๊ฐ€์†**: ๋น ๋ฅธ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ CUDA ์‚ฌ์šฉ
113
+ - **๋ชจ๋ธ ์•„ํ‚คํ…์ฒ˜**: ํŠธ๋žœ์Šคํฌ๋จธ ๊ธฐ๋ฐ˜ ํ™•์‚ฐ ๋ชจ๋ธ
114
+ - **์˜ค๋””์˜ค ๋ชจ๋ธ**: ํ”Œ๋กœ์šฐ ๋งค์นญ ๊ธฐ๋ฐ˜ ์˜ค๋””์˜ค ํ•ฉ์„ฑ
115
+ - **์ฒ˜๋ฆฌ ์‹œ๊ฐ„**: ์„ค์ •์— ๋”ฐ๋ผ ์•ฝ 30-70์ดˆ
116
+
117
+ ### ์ตœ์ƒ์˜ ๊ฒฐ๊ณผ๋ฅผ ์œ„ํ•œ ํŒ
118
+ - ์ƒ์„ธํ•˜๊ณ  ์˜ํ™”์ ์ธ ์„ค๋ช… ์‚ฌ์šฉ
119
+ - ์นด๋ฉ”๋ผ ์›€์ง์ž„๊ณผ ์‹œ๊ฐ์  ์Šคํƒ€์ผ ํฌํ•จ
120
+ - ์กฐ๋ช…, ์ƒ‰์ƒ, ๋ถ„์œ„๊ธฐ ๋ช…์‹œ
121
+ - ๋” ๋‚˜์€ ์˜ค๋””์˜ค ๋งค์นญ์„ ์œ„ํ•ด ์‚ฌ์šด๋“œ ์„ค๋ช… ์ถ”๊ฐ€
122
+ - ๋†’์€ NAG ์Šค์ผ€์ผ = ํ”„๋กฌํ”„ํŠธ์— ๋” ์ถฉ์‹คํ•œ ์ƒ์„ฑ
123
+
124
+ ### ํŠน๋ณ„ ๊ธฐ๋Šฅ
125
+ - **์˜ํ™”๊ธ‰ ํ”„๋กฌํ”„ํŠธ ์˜ˆ์ œ**: ์ „๋ฌธ์ ์ธ ์ดฌ์˜ ๊ธฐ๋ฒ•์ด ํฌํ•จ๋œ 3๊ฐ€์ง€ ์˜ˆ์ œ ์ œ๊ณต
126
+ - **์‹ค์‹œ๊ฐ„ ์ง„ํ–‰ ํ‘œ์‹œ**: ์ƒ์„ฑ ๊ณผ์ •์„ ์‹ค์‹œ๊ฐ„์œผ๋กœ ํ™•์ธ
127
+ - **์›ํด๋ฆญ ์˜ˆ์ œ ์ ์šฉ**: ์˜ˆ์ œ๋ฅผ ํด๋ฆญํ•˜๋ฉด ์ž๋™์œผ๋กœ ์„ค์ •๊ฐ’ ์ ์šฉ
128
+
129
+ ์ด ๋„๊ตฌ๋Š” ์ „๋ฌธ๊ฐ€ ์ˆ˜์ค€์˜ ๋น„๋””์˜ค ์ฝ˜ํ…์ธ ๋ฅผ ์‰ฝ๊ฒŒ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ๋„๋ก ์„ค๊ณ„๋˜์—ˆ์œผ๋ฉฐ, ์ฐฝ์˜์ ์ธ ์•„์ด๋””์–ด๋ฅผ ๋น ๋ฅด๊ฒŒ ์‹œ๊ฐํ™”ํ•˜๋Š” ๋ฐ ์ด์ƒ์ ์ž…๋‹ˆ๋‹ค.