Files changed (1) hide show
  1. README.md +26 -0
README.md ADDED
@@ -0,0 +1,26 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - apple/DiffuCoder-7B-Base
4
+ tags:
5
+ - code
6
+ - text-diffusion-model
7
+ - diffusion large language model
8
+ license: unknown
9
+ ---
10
+ ### DiffuCoder-7B-Instruct
11
+
12
+ The DiffuCoder-7B-Instruct model builds on the DiffuCoder-7B-Base checkpoint with instruction-tuning to better follow code-related prompts.
13
+
14
+ - Training recipe: with a newly introduced pad token, we train this model with fixed length conditionally on [OpenCoder-SFT](https://huggingface.co/datasets/OpenCoder-LLM/opc-sft-stage2) data for 5 epochs.
15
+
16
+ - Benchmarks: Demonstrates stronger instruction-following capabilities than the Base model.
17
+
18
+
19
+ #### More details and usage examples:
20
+
21
+ - Paper: [DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation](https://arxiv.org/abs/2506.20639)
22
+
23
+ - GitHub: https://github.com/apple/ml-diffucoder
24
+
25
+ #### Acknowledgement
26
+ To power this HuggingFace model release, we reuse [Dream](https://huggingface.co/Dream-org/Dream-v0-Base-7B)'s modeling architecture and generation utils.