Sansa's picture
Create README.md
e15eeb5 verified
|
raw
history blame
999 Bytes
metadata
base_model:
  - apple/DiffuCoder-7B-Base
tags:
  - code
  - text-diffusion-model
  - diffusion large language model
license: unknown

DiffuCoder-7B-Instruct

The DiffuCoder-7B-Instruct model builds on the DiffuCoder-7B-Base checkpoint with instruction-tuning to better follow code-related prompts.

  • Training recipe: with a newly introduced pad token, we train this model with fixed length conditionally on OpenCoder-SFT data for 5 epochs.

  • Benchmarks: Demonstrates stronger instruction-following capabilities than the Base model.

More details and usage examples:

Acknowledgement

To power this HuggingFace model release, we reuse Dream's modeling architecture and generation utils.