Solshine's picture
Update README.md
6130880 verified
|
raw
history blame
764 Bytes
metadata
base_model: unsloth/meta-llama-3.1-8b-instruct-bnb-4bit
language:
  - en
license: apache-2.0
tags:
  - text-generation-inference
  - transformers
  - unsloth
  - llama
  - trl
  - sft
  - reflection

Uploaded model

  • Developed by: Solshine
  • License: apache-2.0
  • Finetuned from model : unsloth/meta-llama-3.1-8b-instruct-bnb-4bit

Inspired by and featuring the Reflection Tuning technique pioneered by Matt Shumer (possibly earlier innovated by the team at Anthropic.)

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.