deltas typeof/zephyr-7b-beta-lora Text Generation • Updated May 25, 2024 • 134 • 5 typeof/Hermes-2-Pro-Llama-3-8B-delta-lora Text Generation • Updated May 25, 2024 • 3 typeof/Hermes-2-Theta-Llama-3-8B-delta-lora Text Generation • Updated May 25, 2024 • 2 typeof/openhermes-2.5-mistral-lora Updated Nov 25, 2023 • 4 • 1
soliste Single layer models for experiments typeof/soliste-TinyLlama Text Generation • 0.2B • Updated May 24, 2024 • 13 typeof/soliste-Mistral-v0.1 Text Generation • 0.5B • Updated May 24, 2024 • 5 typeof/soliste-mistral-v0.3 Text Generation • 0.5B • Updated May 25, 2024 • 4
experiments typeof/mamba-130m-instruct Updated Dec 7, 2023 • 6 • 22 typeof/mistral-3.3B Text Generation • 3B • Updated Nov 13, 2023 • 17 • 11 typeof/Oracle-pythia-70m Text Generation • 0.1B • Updated Dec 2, 2023 • 5 typeof/mistral-60m Text Generation • 0.1B • Updated Nov 30, 2023 • 11 • 1
deltas typeof/zephyr-7b-beta-lora Text Generation • Updated May 25, 2024 • 134 • 5 typeof/Hermes-2-Pro-Llama-3-8B-delta-lora Text Generation • Updated May 25, 2024 • 3 typeof/Hermes-2-Theta-Llama-3-8B-delta-lora Text Generation • Updated May 25, 2024 • 2 typeof/openhermes-2.5-mistral-lora Updated Nov 25, 2023 • 4 • 1
experiments typeof/mamba-130m-instruct Updated Dec 7, 2023 • 6 • 22 typeof/mistral-3.3B Text Generation • 3B • Updated Nov 13, 2023 • 17 • 11 typeof/Oracle-pythia-70m Text Generation • 0.1B • Updated Dec 2, 2023 • 5 typeof/mistral-60m Text Generation • 0.1B • Updated Nov 30, 2023 • 11 • 1
soliste Single layer models for experiments typeof/soliste-TinyLlama Text Generation • 0.2B • Updated May 24, 2024 • 13 typeof/soliste-Mistral-v0.1 Text Generation • 0.5B • Updated May 24, 2024 • 5 typeof/soliste-mistral-v0.3 Text Generation • 0.5B • Updated May 25, 2024 • 4