Small LMs
- Paused🐋
- Build error💬
MonadGPT
- Runtime error😻
Mistral-7B
- Paused🌪️
Voice Chat With Mistral 7B
- Paused⚡
Qwen VL
- Runtime error🏃
ChatGLM 6B
- Build error🐶
Koboldcpp Tiefighter
- Paused📚
Tinyllama Chat
- Paused⚡
Stable LM 2 Zephyr 1.6b
- Runtime error🚀
MoE LLaVA
- Paused🐬
Chat with DeepSeek Coder 7B
- Runtime error🦙
Llama 2 13b Chat
- Runtime error🔥
LLaVA
- Runtime error📚
Video LLaVA
- Paused🏢
Llava
- Paused👁
LLaVA 1.6
- Paused🐠
Gradio Notebook Local Model
- Sleeping📚
Blind Chat
- Running🌊🐋
Web-LLM: Mistral 7B OpenOrca
7B text-generation model running directly from the browser
- Runtime error🍑
[NSFW] C0ffee's Erotic Story Generator 2
- Running📉
Whisper Chess
- Runtime error🦙
LLaMA Board
Fine-tuning large language model with Gradio UI
- Running📕
Ratchet + Phi Locally
Run Phi-3 in Browser
- Running🗣️🏎️
Ratchet + Whisper Locally
Run Whisper in Browser
- Running4🔮🔮
Noosphere Webui on Cpu
- Running13👌👌
epicPhotoGASM Webui on Cpu
- Running🐠
Experimental Phi3 Webgpu
NeverSleep/Llama-3-Lumimaid-8B-v0.1
Text Generation • Updated • 542 • 79gradientai/Llama-3-8B-Instruct-Gradient-4194k
Text Generation • Updated • 162 • 69tiiuae/falcon-11B
Text Generation • Updated • 21.2k • 213- Running on Zero14🌘w🌖
Text-Streaming
text streaming space using Gemma-7B
- Running🌐
GemmaOnDevice
- Running on Zero4.28k🔥
OpenGPT 4o
GPT 4o like bot.
- Paused🤲
PaliGemma Demo
- Running🚀
Phi-3 WebGPU
A private and powerful AI that runs locally in your browser
- Running🏃
Mistral-7B-v0.3 Fast Chat
Fast chatting with Mistral v0.3
- Running🌐
YOLOv10 Web
- Running🏆
WebGPU Nomic Embed
- Running🚀
WebGPU Chat Qwen2
- Runtime error⚡
GLiNER HandyLab
- Paused💻
Kosmos 2
- Running6💫
Text Gen Playground
Chat with any model on the Hub
- Running🚀
Gemini Nano (Chrome Built-in)
Run Gemini Nano locally in your browser with Transformers.js
- Running1🌋
LLaVA WebGPU
A private and powerful multimodal AI chatbot that runs local
- Running🕯️🔡
Candle T5 Generation Wasm
- Running on Zero58🌍
MInference
- Running🚀
SmolLM 360M Instruct WebGPU
A blazingly fast and powerful AI chatbot that runs locally.
- Running5🚀
SmolLM 135M Instruct WebGPU
A blazingly fast and powerful AI chatbot that runs locally.
- Running78🔥
Chameleon 30b
- Running4✨
Nymbot Lite
Vision Chatbot with ImgGen & Web Search - Runs on CPU
- Running on Zero3🦙
Llama-3.1-8B-Instruct
The best 8B model with 128K context
- Sleeping🌖
ollama-Chat
Chat with Ollama
- Running4🤔📊
Llama CSV Agent
Need to analyze data? Let a Llama-3.1 agent do it for you!
- Runtime error1😻
MagicPrompt Stable Diffusion
- Running🏃
WebLLM JSON Playground
- Running💬
Webllm Simple Chat
- Running on Zero78😻
Gemma 2 2B IT
Chatbot
- Runtime error1✨✨✨
Cohere Command R+ inference
c4ai-command-r-plus (hub inference, not API)
- Sleeping🐁
Phi-3-Mini-4k-Instruct
Phi-3-Mini on hub inference
- Sleeping1🐼
Yi-1.5-34B-Chat
Yi-1.5-34B on hub inference
- Running1✨
Mistral-7B-Instruct-v0.3
SOTA Small Model by Mistral AI
- Running on Zero64🐍
Falcon Mamba Playground
- Paused💬
MiniCPM-V-2 6
- Sleeping🤏
Instant SmolLM
Run SmolLM-360M-Instruct in realtime with MLC WebLLM
- Runtime error158💬
LongWriter
LLM for long context
- Paused15🐭
Phi-3.5-Mini-Instruct
New SOTA small model from Microsoft, and multilingual!
- Sleeping3🤗
Inference Playground
One-stop-shop for frequently used models
- Running219💻
HF's Missing Inference Widget
- Sleeping💻🧲
1-Shot LLM Playground
Single-shot inference for rapid model testing
- Running1⚡
Phi-3.5-Mini WebLLM
- Running on Zero206🔥
Phi 3.5 Vision
- Paused🤩
Qwen2-VL-2B
Multilingual, Multimodal, Mighty 2B
- Sleeping🚀
Kotaemon
- Sleeping🏃
Dataset Rewriter
- Paused6🐢
Reflection 70B llama.cpp
Reflection-70B by Matt Schumer
- Paused3⚡
Joy Caption Alpha One
- Paused🦙🦙🦙
Llama-3.2-3B-Instruct
New SOTA small model from Meta
- Paused4🦙
Llama-3.2-1B-Instruct
the new tiny king
- Paused5📊
HTML To Markdown
Convert HTML to Markdown with readerlm-1.5B
- Running362🚀
Llama-Vision-11B
- Running⚡
Qwen-2.5 WebLLM
- Running2🦙
Llama-3.2 WebLLM
- Running on Zero100👁
Molmo 7B D 0924
- Paused🌖
Emu3
- Running🦙
Llama 3.2 WebGPU
A powerful AI chatbot that runs locally in your browser
- Running2🏎️
WebLLM Playground
- Sleeping9🐠🤖👌🏻
Nemotron-Mini
NemoAligner Synthetic SFT with function calling
- Paused🚀
Zamba2 7B
- Sleeping👌🔍
MiniSearch
Minimalist web-searching app with browser-based AI assistant
- Sleeping🌍
Janus Space Clone Me First
- Running🐍
Qwen 2.5 Code Interpreter
- Running on T4228🌍
Aya Expanse
- Running🦙
Wllama
Run GGUF directly on your browser!
- Running12🤏
SmolLM2-1.7B-Instruct Serverless
New SOTA smol king by Hugging Face
- Sleeping💻
BitNet.cpp
- Running on Zero94🏃
JanusFlow 1.3B
Huggingface space for JanusFlow-1.3B
- Paused🏃
JanusFlow 1.3B
Text Gen | Vision | Image Gen | One 1.3b model
- Sleeping2📉
Ai Scraper
- Paused📊
SmolVLM
- Running🏛️
Janus 1.3B WebGPU
In-browser unified multimodal understanding and generation.
- Sleeping👁️
Omnivlm Dpo Demo
- Running🧑💻
Github Issue Generator
- Running on Zero190💻
ShowUI
- Running🗣️
Text-to-Speech WebGPU
WebGPU text-to-Speech powered by OuteTTS and Transformers.js
- Running on Zero7🐍
Falcon3 Mamba 7b Instruct Playground
- Running on Zero28🦅
Falcon3 Demo
F3-DEMO