rocm-rwkv / README.md
something-else's picture
Update README.md
73cf444 verified
|
raw
history blame
613 Bytes
metadata
tags:
  - rocm
  - amd-gpus
  - amd-ai
  - rocm-ai
  - rocm-rwkv
  - 3B-rwkv

3B rocm-rwkv pth record.

  • rwkv-final-chnk5.pth: 3B rocm-rwkv model trained with Slim pajama chunk1-5 and with a loss of 2.456.
  • rwkv-final-chnk17.pth: 3B rocm-rwkv model trained with Slim pajama chunk1-10 for the first epoch and an aditional training with chunk1-7 after the first epoch and with a loss of 2.281
  • rwkv-code39-16012024.pth: 3B rocm-rwkv model trained with Slim pajama chunk1-10 for the first epoch and an aditional training with chunk1-8 after the first epoch; plus a little bit of code. This pth has a loss of 1.174.