llama349 / README.md
kevin009's picture
Update README.md
dc9dc7a verified
metadata
license: apache-2.0
language:
  - en
base_model:
  - meta-llama/Llama-3.1-8B-instruct
pipeline_tag: text-generation
tags:
  - lora
  - adapter
  - Math
  - CoT

Model Details

  • Base Model: meta-llama/Llama-3.1-8B-instruct
  • SFT

Datasets:

  • 1K Math from continuation of llama345

Source Adapters

All source adapters share the following configuration:

  • Rank (r): 16
  • Alpha: 16
  • Target Modules:
    • q_proj (Query projection)
    • k_proj (Key projection)
    • v_proj (Value projection)
    • o_proj (Output projection)
    • up_proj (Upsampling projection)
    • down_proj (Downsampling projection)
    • gate_proj (Gate projection)