File size: 4,226 Bytes
8e1408a
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
---
pipeline_tag: text-to-image
---

![Kalpana Grid](./kalpana_grid.jpg)

## Model Card for `enhanceaiteam/Kalpana`

`Kalpana` is a powerful text-to-image generation model developed by EnhanceAI. It uses a 20 billion parameter architecture to produce high-quality, detailed images based on textual descriptions. This model is ideal for artists, designers, and creative professionals seeking to generate visually compelling and accurate images.

### Model Details

- **Version**: 1.0
- **Base Model**: EnhanceAI's proprietary image generation architecture
- **Training Data**: Trained on a diverse dataset of high-resolution, copyright-free images, covering a broad range of visual styles and subjects.
- **Parameter Count**: 20 billion
- **Pipeline**: Stable Diffusion XL Pipeline

### Key Features

1. **High-Resolution Output**: Generates images at 1024x1024 resolution.
2. **Enhanced Prompt Understanding**: Improved comprehension of complex and detailed text prompts.
3. **Versatile Style Generation**: Capable of producing images in various artistic and realistic styles.
4. **High-Quality Images**: Delivers visually stunning and detailed images.
5. **Open Weights**: Available for scientific research and artistic projects.

### Usage

To use `Kalpana`, you can integrate it with the `diffusers` library. Below is an example of how to generate images:

```python
import torch
from diffusers import FluxPipeline

pipe = FluxPipeline.from_pretrained("enhanceaiteam/kalpana", torch_dtype=torch.bfloat16)
pipe.enable_model_cpu_offload()

prompt = "A cat holding a sign that says hello world"
out = pipe(
    prompt=prompt,
    guidance_scale=0.,
    height=768,
    width=1360,
    num_inference_steps=4,
    max_sequence_length=256,
).images[0]
out.save("image.png")
```


### Prompts

When creating prompts, be as descriptive as possible to achieve the desired results. Detailed descriptions help the model generate more accurate and relevant images.

### Image Samples

Here are some example images generated by `Kalpana`:

- ![Example Image 1](https://enhanceai.s3.amazonaws.com/f735ed72-5c34-425e-8628-58527053ca81_3.png)
- ![Example Image 2](https://enhanceai.s3.amazonaws.com/066df381-0429-4445-a4a6-7e098718058e_1.png)
- ![Example Image 3](https://enhanceai.s3.amazonaws.com/bffeebfa-7bff-4377-8cd2-1705a6ef92a9_1.png)
- ![Example Image 4](https://enhanceai.s3.amazonaws.com/0a81bcd2-a640-45fa-8fba-4db282e3ac4a_1.png)
- ![Example Image 5](https://enhanceai.s3.amazonaws.com/915a1e2a-32ed-45a8-bccf-d4ad6abe81d9_1.png)
- ![Example Image 6](https://enhanceai.s3.amazonaws.com/e0723a97-a425-492a-8387-8e735084d718_1.png)

### Limitations

- This model is not intended or able to provide factual information.
- As a statistical model, this checkpoint might amplify existing societal biases.
- The model may fail to generate output that matches the prompts.
- Prompt following is heavily influenced by the prompting style.

### Out-of-Scope Use

The model and its derivatives may not be used:

- In any way that violates any applicable national, federal, state, local, or international law or regulation.
- For the purpose of exploiting, harming, or attempting to exploit or harm minors in any way; including but not limited to the solicitation, creation, acquisition, or dissemination of child exploitative content.
- To generate or disseminate verifiably false information and/or content with the purpose of harming others.
- To generate or disseminate personally identifiable information that can be used to harm an individual.
- To harass, abuse, threaten, stalk, or bully individuals or groups of individuals.
- To create non-consensual nudity or illegal pornographic content.
- For fully automated decision-making that adversely impacts an individual's legal rights or otherwise creates or modifies a binding, enforceable obligation.
- Generating or facilitating large-scale disinformation campaigns.

### License

This model falls under the [FLUX.1 [dev] Non-Commercial License](https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/LICENSE.md).

### Contact

For support and inquiries, please reach out to [enhanceai.art](https://enhanceai.art).

---

Happy creating!

- The EnhanceAI Team
- Pranav Ajay