Okkhor Diffusion

Okkhor Diffusion is a category of Denoising Diffusion Probabilistic Models(DDPM) designed to generate images of Bangla handwritten characters. This model card corresponds to Okkhor-Diffusion trained on Banglalekha-Isolated dataset.

Variants

Usage

from diffusers import DiffusionPipeline
import torch
device="cuda"
pipeline = DiffusionPipeline.from_pretrained(
              "ahmedfaiyaz/OkkhorDiffusion",
              custom_pipeline="ahmedfaiyaz/OkkhorDiffusion",
              embedding=torch.int16
            )
pipeline.to(device)
pipeline.embedding=torch.tensor([9],device=device) # 'ও': 9
pipeline(batch_size=1,num_inference_steps=100).images[0]

Character mappings

OkkhorDiffusion-Banglalekha
Character Serial
0
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
ড় 43
ঢ় 44
য় 45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
ক্ষ(ksa) 60
ব্দ(bda) 61
ঙ্গ 62
স্ক 63
স্ফ 64
স্থ 65
চ্ছ 66
ক্ত 67
স্ন 68
ষ্ণ 69
ম্প 70
হ্ম 71
প্ত 72
ম্ব 73
ন্ড 74
দ্ভ 75
ত্থ 76
ষ্ঠ 77
ল্প 78
ষ্প 79
ন্দ 80
ন্ধ 81
ম্ম 82
ন্ঠ 83

Citation

@ARTICLE{10445466,

  author={Fuad, Md Mubtasim and Faiyaz, A. and Arnob, Noor Mairukh Khan and Mridha, M.F. and Saha, Aloke Kumar and Aung, Zeyar},

  journal={IEEE Access}, 

  title={Okkhor-Diffusion: Class Guided Generation of Bangla Isolated Handwritten Characters using Denoising Diffusion Probabilistic Model (DDPM)}, 

  year={2024},

  volume={},

  number={},

  pages={1-1},

  abstract={Bangla has a unique script with a complex set of characters, making it a fascinating subject of study for linguists and cultural enthusiasts. Unique in some of its similar characters which are only distinguishable by subtle differences in their shapes and diacritics, there has been a notable increase in research on Bangla character recognition and classification using machine learning-based approaches. However, Handwritten Bangla Character Recognition (HBCR) training requires an adequate amount of data from a diversely distributed dataset. Making diverse datasets for HBCR training is a challenging and tedious task to carry out. Yet, there is limited research on the automatic generation of handwritten Bangla characters. Motivated by this open area of research, this paper proposes a novel approach ’Okkhor-Diffusion’ for class-guided generation of Bangla isolated handwritten characters using a novel Denoising Diffusion Probabilistic Model (DDPM). No prior research has used DDPM for this purpose, making the proposed approach novel. The DDPM is a generative model that uses a diffusion process to transform noise-corrupted data into diverse samples; despite being trained on a small training set. In our experiments, StyleGAN2-ADA had notably inferior performance compared to Okkhor-Diffusion in generating realistic isolated handwritten Bangla characters. Experimental results on the BanglaLekha-Isolated dataset demonstrate that the proposed Okkhor-Diffusion model generates realistic isolated handwritten Bangla characters, with a mean Multi-Scale Structural Similarity Index Measure (MS-SSIM) score of 0.178 compared to 0.177 for the real samples. The Fréchet Inception Distance (FID) score for the synthetic handwritten Bangla characters is 5.426. Finally, the newly proposed Bangla Character Aware Fréchet Inception Distance (BCAFID) score of the proposed Okkhor-Diffusion model is 10.388.},

  keywords={Deep learning;Handwritten character generation;Generative Model;Denoising Diffusion Probabilistic Model},

  doi={10.1109/ACCESS.2024.3370674},

  ISSN={2169-3536},

  month={},}
Downloads last month
10
Inference Examples
Inference API (serverless) has been turned off for this model.

Space using ahmedfaiyaz/OkkhorDiffusion 1