Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
Dovakiins
/
qwerrwe
Build error

App Files Files Community
Fetching metadata from the HF Docker repository...
qwerrwe / src /axolotl /monkeypatch
Ctrl+K
Ctrl+K
  • 100 contributors
History: 60 commits
winglian's picture
winglian
relora: magnitude pruning of the optimizer (#1245)
8c2e05a unverified over 1 year ago
  • data
    support for true batches with multipack (#1230) over 1 year ago
  • falcon
    Falcon embeddings (#1149) [skip docker] over 1 year ago
  • mixtral
    Mixtral fixes 20240124 (#1192) [skip ci] over 1 year ago
  • phi
    Phi2 multipack (#1173) over 1 year ago
  • qwen2
    Qwen2 (#1166) over 1 year ago
  • btlm_attn_hijack_flash.py
    2.32 kB
    flash_attention + sample packing for stablelm 3b (#671) over 1 year ago
  • fastchat_conversation_turns.py
    8.33 kB
    Added chatglm3 conversation type for training models like TinyLLama (#1036) over 1 year ago
  • llama_attn_hijack_flash.py
    32 kB
    Add shifted sparse attention (#973) [skip-ci] over 1 year ago
  • llama_attn_hijack_xformers.py
    5.69 kB
    various bugfixes (#856) over 1 year ago
  • llama_expand_mask.py
    672 Bytes
    support for true batches with multipack (#1230) over 1 year ago
  • llama_patch_multipack.py
    1.1 kB
    support for true batches with multipack (#1230) over 1 year ago
  • mistral_attn_hijack_flash.py
    22.5 kB
    Respect sliding_window=None (#1214) over 1 year ago
  • relora.py
    16.7 kB
    relora: magnitude pruning of the optimizer (#1245) over 1 year ago
  • stablelm_attn_hijack_flash.py
    15.4 kB
    flash_attention + sample packing for stablelm 3b (#671) over 1 year ago
  • utils.py
    8.01 kB
    support for true batches with multipack (#1230) over 1 year ago