i3n7g3
/

SimPO-LoRA-Diffusion

Model card Files Files and versions Community

SimPO-LoRA-Diffusion / README.md

i3n7g3's picture

Update README.md

d93f5ef verified 5 months ago

|

history blame contribute delete

3.53 kB

	---
	datasets:
	- yuvalkirstain/pickapic_v2
	language:
	- en
	base_model:
	- stabilityai/stable-diffusion-2-1
	pipeline_tag: text-to-image
	library_name: diffusers
	---


	---
	# Ano-Face-Fair: Race-Fair Face Anonymization in Text-to-Image Synthesis using Simple Preference Optimization in Diffusion Model

	For detailed information, code, and documentation, please visit our GitHub repository:
	[Ano-Face-Fair](https://github.com/i3n7g3/Ano-Face-Fair)

	## Ano-Face-Fair

	![Ano-Face-Fair demo images](./assets/Fig1.png)

	## Model

	![overall_structure](./assets/Fig2.png)

	Ano-Face-Fair presents a novel approach to text-to-face synthesis using a Diffusion Model that considers Race Fairness. Our method uses facial segmentation masks to edit specific facial regions, and employs a Stable Diffusion v2 Inpainting model trained on a curated Asian dataset. We introduce two key losses: ℒ𝐹𝐹𝐸 (Focused Feature Enhancement Loss) to enhance performance with limited data, and ℒ𝑫𝑰𝑭𝑭 (Difference Loss) to address catastrophic forgetting. Finally, we apply Simple Preference Optimization (SimPO) for refined and enhanced image generation.

	## Model Checkpoints

	- [Ano-Face-Fair (Inpainting model with ℒ𝐹𝐹𝐸 and ℒ𝑫𝑰𝑭𝑭)](https://huggingface.co/i3n7g3/Ano-Face-Fair)
	- [SimPO-LoRA (Diffusion model with Simple Preference Optimization)](https://huggingface.co/i3n7g3/SimPO-LoRA-Diffusion)

	### Using with Diffusers🧨

	You can use this model directly with the `diffusers` library:


	```python
	import torch
	from PIL import Image
	from diffusers import StableDiffusionInpaintPipeline

	device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
	sd_pipe = StableDiffusionInpaintPipeline.from_pretrained(
	"i3n7g3/Ano-Face-Fair",
	torch_dtype=torch.float16,
	safety_checker=None,
	).to(device)
	sd_pipe.load_lora_weights("i3n7g3/SimPO-LoRA-Diffusion", adapter_name="SimPO")
	sd_pipe.set_adapters(["SimPO"], adapter_weights=[0.5])

	def generate_image(image_path, mask_path, prompt, negative_prompt, pipe, seed):
	try:
	in_image = Image.open(image_path)
	in_mask = Image.open(mask_path)
	except IOError as e:
	print(f"Loading error: {e}")
	return None
	generator = torch.Generator(device).manual_seed(seed)
	result = pipe(image=in_image, mask_image=in_mask, prompt=prompt,
	negative_prompt=negative_prompt, generator=generator)
	return result.images[0]

	image = '/content/Ano-Face-Fair/data/2.png'
	mask = "/content/Ano-Face-Fair/data/2_mask.png"
	prompt = "he is an asian man."
	seed = 38189219984105
	negative_prompt = "low resolution, ugly, disfigured, ugly, bad, immature, cartoon, anime, 3d, painting, b&w, deformed eyes, low quailty, noise"

	try:
	generated_image = generate_image(image_path=image, mask_path=mask, prompt=prompt,
	negative_prompt=negative_prompt, pipe=sd_pipe, seed=seed)
	except TypeError as e:
	print(f"TypeError : {e}")

	generated_image
	```
	![result](./assets/Fig3.png)

	For more detailed usage instructions, including how to prepare segmentation masks and run inference, please refer to our [GitHub repository](https://github.com/i3n7g3/Ano-Face-Fair).

	## Training

	For information on how to train the model, including the use of ℒ𝐹𝐹𝐸 (Focused Feature Enhancement Loss) and ℒ𝑫𝑰𝑭𝑭 (Difference Loss), please see our GitHub repository's [training section](https://github.com/i3n7g3/Ano-Face-Fair#running_man-train).