LLaDA-8B-Base

We introduce LLaDA, a diffusion model with an unprecedented 8B scale, trained entirely from scratch, rivaling LLaMA3 8B in performance.

Project Page

Code

Downloads last month
14,161
Safetensors
Model size
8.02B params
Tensor type
BF16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for GSAI-ML/LLaDA-8B-Base

Quantizations
1 model

Spaces using GSAI-ML/LLaDA-8B-Base 3