GPT2-Linear GPT2 Models using Linear layers instead of Conv layers for convenience. Collection by crumb Sep 9, 2023 1 crumbly/gpt2-linear-xl Text Generation • Updated Jul 18, 2023 • 41 • 1 crumbly/gpt2-linear-large Text Generation • Updated Jul 17, 2023 • 19 crumbly/gpt2-linear-medium Text Generation • Updated Jul 17, 2023 • 24 crumbly/gpt2-linear-small Text Generation • Updated Jul 17, 2023 • 24
LLM-code Collection by josephyys Sep 9, 2023 - Deci/DeciCoder-1b Text Generation • Updated Feb 15, 2024 • 3.52k • 246
1 Collection by 4N4SS Sep 9, 2023 - Running on CPU Upgrade 9.67k 9.67k AI Comic Factory 👩 Create your own AI comic with a single prompt Sleeping 193 193 WavJourney 🔥
prompting Collection by huooingface Sep 9, 2023 - Large Language Models as Optimizers Paper • 2309.03409 • Published Sep 7, 2023 • 76
MoLora-v2 First Prototype of the second iteration of MoLora utilizing mixture of expert techniques applied to the Llama2 model. Collection by crumb Sep 9, 2023 2 crumb/test-00-switchllama-i3b-f10b-e4-init Text Generation • Updated Sep 13, 2023 • 35 crumb/test-00-qlora-wizmlpmix-c0 Updated Sep 4, 2023 • 12 crumb/test-00-qlora-wizmlpmix-c1 Updated Sep 4, 2023 • 10 crumb/test-00-qlora-wizmlpmix-c3 Updated Sep 4, 2023 • 9
MoLora-v1 Model assets for the first Mixture-of-Lora technique applied to Llama. https://bit.ly/48bqshl Collection by crumb Sep 9, 2023 - crumb/llama2-7b-moe-text-exp0-4 Updated Jul 19, 2023 • 7 crumb/llama2-7b-moe-text-exp1-4 Updated Jul 19, 2023 • 17 • 2 crumb/llama2-7b-moe-text-exp2-4 Updated Jul 19, 2023 • 4 crumb/llama2-7b-moe-text-exp3-4 Updated Jul 19, 2023 • 5
Pulse Collection by knamg Sep 9, 2023 - OpenMEDLab/PULSE-7bv5 Text Generation • Updated Dec 14, 2023 • 147 • 28
Graphics Collection by jonahshader Sep 9, 2023 - 3D Gaussian Splatting for Real-Time Radiance Field Rendering Paper • 2308.04079 • Published Aug 8, 2023 • 175
3D Gaussian Splatting for Real-Time Radiance Field Rendering Paper • 2308.04079 • Published Aug 8, 2023 • 175
ai-models Collection by nragan Sep 9, 2023 - tiiuae/falcon-180B Text Generation • Updated Sep 6, 2023 • 8.97k • 1.14k