toread Collection by kd-tensor Sep 9, 2023 - FLM-101B: An Open LLM and How to Train It with $100K Budget Paper • 2309.03852 • Published Sep 7, 2023 • 44
FLM-101B: An Open LLM and How to Train It with $100K Budget Paper • 2309.03852 • Published Sep 7, 2023 • 44
ai-comic Collection by cowboyuniverse Sep 9, 2023 - Running on CPU Upgrade 9.69k 9.69k AI Comic Factory 👩 Create your own AI comic with a single prompt
first Collection by slishkom Sep 9, 2023 - KoboldAI/OPT-30B-Erebus Text Generation • Updated Jan 26, 2023 • 1.87k • 65
GPT2-Linear GPT2 Models using Linear layers instead of Conv layers for convenience. Collection by crumb Sep 9, 2023 1 crumbly/gpt2-linear-xl Text Generation • Updated Jul 18, 2023 • 36 • 1 crumbly/gpt2-linear-large Text Generation • Updated Jul 17, 2023 • 14 crumbly/gpt2-linear-medium Text Generation • Updated Jul 17, 2023 • 20 crumbly/gpt2-linear-small Text Generation • Updated Jul 17, 2023 • 20
LLM-code Collection by josephyys Sep 9, 2023 - Deci/DeciCoder-1b Text Generation • Updated Feb 15, 2024 • 3.51k • 246
1 Collection by 4N4SS Sep 9, 2023 - Running on CPU Upgrade 9.69k 9.69k AI Comic Factory 👩 Create your own AI comic with a single prompt Sleeping 193 193 WavJourney 🔥
prompting Collection by huooingface Sep 9, 2023 - Large Language Models as Optimizers Paper • 2309.03409 • Published Sep 7, 2023 • 76
MoLora-v2 First Prototype of the second iteration of MoLora utilizing mixture of expert techniques applied to the Llama2 model. Collection by crumb Sep 9, 2023 2 crumb/test-00-switchllama-i3b-f10b-e4-init Text Generation • Updated Sep 13, 2023 • 26 crumb/test-00-qlora-wizmlpmix-c0 Updated Sep 4, 2023 • 7 crumb/test-00-qlora-wizmlpmix-c1 Updated Sep 4, 2023 • 8 crumb/test-00-qlora-wizmlpmix-c3 Updated Sep 4, 2023 • 6