Upload folder using huggingface_hub

Browse files

This view is limited to 50 files because it contains too many changes. See raw diff

Files changed (50) hide show

README.md +61 -0
fid_fake/00000.png +0 -0
fid_fake/00001.png +0 -0
fid_fake/00002.png +0 -0
fid_fake/00003.png +0 -0
fid_fake/00004.png +0 -0
fid_fake/00005.png +0 -0
fid_fake/00006.png +0 -0
fid_fake/00007.png +0 -0
fid_fake/00008.png +0 -0
fid_fake/00009.png +0 -0
fid_fake/00010.png +0 -0
fid_fake/00011.png +0 -0
fid_fake/00012.png +0 -0
fid_fake/00013.png +0 -0
fid_fake/00014.png +0 -0
fid_fake/00015.png +0 -0
fid_fake/00016.png +0 -0
fid_fake/00017.png +0 -0
fid_fake/00018.png +0 -0
fid_fake/00019.png +0 -0
fid_fake/00020.png +0 -0
fid_fake/00021.png +0 -0
fid_fake/00022.png +0 -0
fid_fake/00023.png +0 -0
fid_fake/00024.png +0 -0
fid_fake/00025.png +0 -0
fid_fake/00026.png +0 -0
fid_fake/00027.png +0 -0
fid_fake/00028.png +0 -0
fid_fake/00029.png +0 -0
fid_fake/00030.png +0 -0
fid_fake/00031.png +0 -0
fid_fake/00032.png +0 -0
fid_fake/00033.png +0 -0
fid_fake/00034.png +0 -0
fid_fake/00035.png +0 -0
fid_fake/00036.png +0 -0
fid_fake/00037.png +0 -0
fid_fake/00038.png +0 -0
fid_fake/00039.png +0 -0
fid_fake/00040.png +0 -0
fid_fake/00041.png +0 -0
fid_fake/00042.png +0 -0
fid_fake/00043.png +0 -0
fid_fake/00044.png +0 -0
fid_fake/00045.png +0 -0
fid_fake/00046.png +0 -0
fid_fake/00047.png +0 -0
fid_fake/00048.png +0 -0

README.md ADDED Viewed

	@@ -0,0 +1,61 @@

+---
+license: mit
+tags:
+  - vqvae
+  - image-generation
+  - unsupervised-learning
+  - pytorch
+  - mnist
+  - generative-model
+datasets:
+  - mnist
+library_name: pytorch
+model-index:
+  - name: VQ-VAE-MNIST
+    results:
+      - task:
+          type: image-generation
+          name: Image Generation
+        dataset:
+          name: MNIST
+          type: image-classification
+        metrics:
+          - name: FID
+            type: frechet-inception-distance
+            value: 53.21
+---
+# VQ-VAE for MNIST
+This is a **Vector Quantized Variational Autoencoder (VQ-VAE)** trained on the MNIST dataset using PyTorch. The model compresses and reconstructs grayscale handwritten digits and is used as part of an image augmentation and generative modeling pipeline.
+## 🧠 Model Details
+- **Model Type**: VQ-VAE
+- **Dataset**: MNIST
+- **Epochs**: 35
+- **Latent Space**: Discrete (quantized vectors)
+- **Input Size**: 64×64 (resized and converted to RGB)
+- **Reconstruction Loss**: MSE-based
+- **Implementation**: Custom PyTorch with 3-layer Conv Encoder/Decoder
+- **FID Score**: **53.21**
+- **Loss Curve**: [`loss_curve.png`](./loss_curve.png)
+> This model learns compressed representations of digit images using vector quantization. The reconstructions can be used for augmentation or generative downstream tasks.
+## 📁 Files
+- `generator.pt`: Trained VQ-VAE model weights.
+- `loss_curve.png`: Visual plot of training loss across 35 epochs.
+- `fid_score.json`: Stored Fréchet Inception Distance (FID) evaluation result.
+- `fid_real/` and `fid_fake/`: 1000 real and generated images used for FID computation.
+## 📦 How to Use
+```python
+import torch
+from models.vqvae.model import VQVAE
+model = VQVAE()
+model.load_state_dict(torch.load("generator.pt", map_location="cpu"))
+model.eval()