Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
Inference Endpoints
nm-vllm
AutoTrain Compatible
text-generation-inference
4-bit precision
custom_code
Misc with no match
Eval Results
Merge
text-embeddings-inference
8-bit precision
Carbon Emissions
Mixture of Experts
Apply filters
Models
30
Full-text search
Edit filters
Sort: Trending
Active filters:
nm-vllm
Clear all
neuralmagic/TinyLlama-1.1B-Chat-v1.0-pruned2.4
Text Generation
•
Updated
Mar 5
•
49
•
1
neuralmagic/MiniChat-2-3B-pruned2.4
Text Generation
•
Updated
Mar 5
•
15
neuralmagic/OpenHermes-2.5-Mistral-7B-pruned2.4
Text Generation
•
Updated
Mar 5
•
379
neuralmagic/OpenHermes-2.5-Mistral-7B-pruned50
Text Generation
•
Updated
Mar 5
•
358
•
1
neuralmagic/Nous-Hermes-2-SOLAR-10.7B-pruned2.4
Text Generation
•
Updated
Mar 5
•
13
neuralmagic/Nous-Hermes-2-Yi-34B-pruned2.4
Text Generation
•
Updated
Mar 5
•
39
neuralmagic/Nous-Hermes-2-Yi-34B-pruned50
Text Generation
•
Updated
Mar 5
•
16
neuralmagic/zephyr-7b-beta-marlin
Text Generation
•
Updated
Mar 6
•
4.88k
neuralmagic/llama2.c-stories110M-pruned2.4
Text Generation
•
Updated
Mar 5
•
26
neuralmagic/llama2.c-stories110M-pruned50
Text Generation
•
Updated
Mar 5
•
1.19k
neuralmagic/phi-2-pruned50
Text Generation
•
Updated
Mar 5
•
22
neuralmagic/TinyLlama-1.1B-Chat-v1.0-marlin
Text Generation
•
Updated
Mar 6
•
3.24k
•
1
neuralmagic/OpenHermes-2.5-Mistral-7B-marlin
Text Generation
•
Updated
Mar 6
•
937
•
2
neuralmagic/Nous-Hermes-2-Yi-34B-marlin
Text Generation
•
Updated
Mar 6
•
22
•
5
softmax/Llama-2-70b-chat-hf-marlin
Text Generation
•
Updated
Mar 17
•
342
softmax/falcon-180B-chat-marlin
Text Generation
•
Updated
Mar 21
•
11
dtransposed/llama2.c-stories110M-pruned50-compressed-tensors
Text Generation
•
Updated
Apr 23
•
32
nm-testing/llama2.c-stories110M-pruned50-compressed-tensors
Text Generation
•
Updated
Apr 25
•
29
mradermacher/Nous-Hermes-2-SOLAR-10.7B-pruned2.4-GGUF
Updated
Nov 6
•
99
mradermacher/Nous-Hermes-2-SOLAR-10.7B-pruned2.4-i1-GGUF
Updated
Nov 7
•
136
tensorblock/llama2.c-stories110M-pruned50-GGUF
Updated
14 days ago
•
182
mradermacher/phi-2-pruned50-GGUF
Updated
8 days ago
•
156
mradermacher/llama2.c-stories110M-pruned50-GGUF
Updated
6 days ago
•
146
mradermacher/OpenHermes-2.5-Mistral-7B-pruned50-GGUF
Updated
7 days ago
•
132
mradermacher/MiniChat-2-3B-pruned2.4-GGUF
Updated
7 days ago
•
159
mradermacher/OpenHermes-2.5-Mistral-7B-pruned50-i1-GGUF
Updated
7 days ago
•
208
mradermacher/llama2.c-stories110M-pruned50-i1-GGUF
Updated
6 days ago
•
123
mradermacher/OpenHermes-2.5-Mistral-7B-pruned2.4-GGUF
Updated
6 days ago
•
137
mradermacher/OpenHermes-2.5-Mistral-7B-pruned2.4-i1-GGUF
Updated
5 days ago
•
200
tensorblock/OpenHermes-2.5-Mistral-7B-pruned2.4-GGUF
Updated
1 day ago
•
144