dataautogpt3 commited on
Commit
ecd32b5
1 Parent(s): 9b9a246

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +155 -3
README.md CHANGED
@@ -1,3 +1,155 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ pipeline_tag: text-to-image
3
+ widget:
4
+ - text: >-
5
+ black fluffy gorgeous dangerous cat animal creature, large orange eyes, big
6
+ fluffy ears, piercing gaze, full moon, dark ambiance, best quality,
7
+ extremely detailed
8
+ output:
9
+ url: assets/final_output_00875_.png
10
+ - text: >-
11
+ (impressionistic realism by csybgh), a 50 something male, working in
12
+ banking, very short dyed dark curly balding hair, Afro-Asiatic ancestry,
13
+ talks a lot but listens poorly, stuck in the past, wearing a suit, he has a
14
+ certain charm, bronze skintone, sitting in a bar at night, he is smoking and
15
+ feeling cool, drunk on plum wine, masterpiece, 8k, hyper detailed, smokey
16
+ ambiance, perfect hands AND fingers
17
+ output:
18
+ url: assets/final_output_00886_.png
19
+ - text: >-
20
+ high quality pixel art, a pixel art silhouette of an anime space-themed girl
21
+ in a space-punk steampunk style, lying in her bed by the window of a
22
+ spaceship, smoking, with a rustic feel. The image should embody epic
23
+ portraiture and double exposure, featuring an isolated landscape visible
24
+ through the window. The colors should primarily be dynamic and
25
+ action-packed, with a strong use of negative space. The entire artwork
26
+ should be in pixel art style, emphasizing the characters shape and set
27
+ against a white background. Silhouette
28
+ output:
29
+ url: assets/final_output_00871_.png
30
+ - text: >-
31
+ The image features an older man, a long white beard and mustache, He has a
32
+ stern expression, giving the impression of a wise and experienced
33
+ individual. The mans beard and mustache are prominent, adding to his
34
+ distinguished appearance. The close-up shot of the mans face emphasizes his
35
+ facial features and the intensity of his gaze.
36
+ output:
37
+ url: assets/final_output_00895_.png
38
+ - text: >-
39
+ Super Closeup Portrait, action shot, Profoundly dark whitish meadow, glass
40
+ flowers, Stains, space grunge style, Jeanne d'Arc wearing White Olive green
41
+ used styled Cotton frock, Wielding thin silver sword, Sci-fi vibe, dirty,
42
+ noisy, Vintage monk style, very detailed, hd
43
+ output:
44
+ url: assets/final_output_00902_.png
45
+ - text: >-
46
+ cinematic film still of Kodak Motion Picture Film: (Sharp Detailed Image) An
47
+ Oscar winning movie for Best Cinematography a woman in a kimono standing on
48
+ a subway train in Japan Kodak Motion Picture Film Style, shallow depth of
49
+ field, vignette, highly detailed, high budget, bokeh, cinemascope, moody,
50
+ epic, gorgeous, film grain, grainy
51
+ output:
52
+ url: assets/final_output_00906_.png
53
+ - text: >-
54
+ 1980s anime portrait of a character
55
+ output:
56
+ url: assets/final_output_00916_.png
57
+ - text: (("Proteus"):text_logo:1)
58
+ output:
59
+ url: assets/final_output_00923_.png
60
+ license: apache-2.0
61
+ ---
62
+ <Gallery />
63
+
64
+ # Prometheus
65
+
66
+ Prometheus is presumed to be the first full rank finetune of Playground v2.5, developed by the creator of the Proteus model. This text-to-image generation model has been specifically adapted to enhance accessibility for the open-source community.
67
+
68
+ # Key Features and Considerations
69
+
70
+ Presumed First Full Rank Finetune of Playground v2.5:
71
+
72
+ Complete parameter update of Playground v2.5 architecture
73
+ Unique approach to fine-tuning this particular base model
74
+
75
+
76
+ # Enhanced Accessibility:
77
+
78
+ Custom sampling methods have been removed through brute force techniques
79
+ Designed to be more compatible with standard open-source tools and workflows
80
+
81
+
82
+ # Output Characteristics:
83
+
84
+ Aims to provide a balance between consistency and diversity in outputs
85
+ May exhibit some stylistic tendencies inherited from the training process
86
+
87
+
88
+ # Training Approach:
89
+
90
+ Utilizes the extensive Proteus datasets
91
+ Focused on maintaining model capabilities while increasing compatibility
92
+
93
+
94
+ # Custom CLIP Integration:
95
+
96
+ Incorporation of a specially trained CLIP model
97
+ Requires a clip skip setting of 2 for optimal performance
98
+
99
+
100
+
101
+ # About Prometheus
102
+
103
+ Prometheus represents an effort to make advanced text-to-image generation more accessible to the open-source community. Built upon the Playground v2.5 architecture, it has undergone a full rank finetune using the Proteus datasets. A key aspect of its development was the removal of custom sampling methods through brute force techniques, allowing the model to work more seamlessly with standard open-source tools and pipelines.
104
+ This approach aims to balance the model's performance capabilities with wider compatibility and ease of use. Users can expect outputs that reflect the model's training on Proteus datasets
105
+
106
+ # Recommended Settings
107
+
108
+ Clip Skip: 2
109
+ CFG Scale: 7
110
+ Steps: 25 - 50
111
+ Sampler: DPM++ 2M SDE
112
+ Scheduler: Karras
113
+ Resolution: 1024x1024
114
+
115
+ # Use it with 🧨 diffusers
116
+ ```python
117
+ import torch
118
+ from diffusers import (
119
+ StableDiffusionXLPipeline,
120
+ KDPM2AncestralDiscreteScheduler,
121
+ AutoencoderKL
122
+ )
123
+
124
+ # Load VAE component
125
+ vae = AutoencoderKL.from_pretrained(
126
+ "madebyollin/sdxl-vae-fp16-fix",
127
+ torch_dtype=torch.float16
128
+ )
129
+
130
+ # Configure the pipeline
131
+ pipe = StableDiffusionXLPipeline.from_pretrained(
132
+ "dataautogpt3/PrometheusV1",
133
+ vae=vae,
134
+ torch_dtype=torch.float16
135
+ )
136
+ pipe.scheduler = KDPM2AncestralDiscreteScheduler.from_config(pipe.scheduler.config)
137
+ pipe.to('cuda')
138
+
139
+ # Define prompts and generate image
140
+ prompt = "a cat wearing sunglasses on the beach"
141
+ negative_prompt = ""
142
+
143
+ image = pipe(
144
+ prompt,
145
+ negative_prompt=negative_prompt,
146
+ width=1024,
147
+ height=1024,
148
+ guidance_scale=7,
149
+ num_inference_steps=50,
150
+ clip_skip=2
151
+ ).images[0]
152
+
153
+
154
+ image.save("generated_image.png")
155
+ ```