neuralmagic
/

mpt-7b-gsm8k-pruned50-quant-ds

Text Generation

Model card Files Files and versions Community

mpt-7b-gsm8k-pruned50-quant-ds / README.md

mgoin's picture

Create README.md

935b5a4 over 1 year ago

|

462 Bytes

	---
	tags:
	- deepsparse
	---

	# Sparse MPT-7B-GSM8k - DeepSparse

	Sparse finetuned MPT 7b model on GSM8k, pruned to 50% and quantized for inference with DeepSparse

	```python
	from deepsparse import TextGeneration
	model = TextGeneration(model="hf:neuralmagic/mpt-7b-gsm8k-pruned50-quant")
	model("There are twice as many boys as girls at Dr. Wertz's school. If there are 60 girls and 5 students to every teacher, how many teachers are there?", max_new_tokens=50)
	```