Gaivoronsky
/

Mistral-7B-Saiga

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Mistral-7B-Saiga / README.md

Gaivoronsky's picture

Update README.md

3eb2551 over 1 year ago

|

883 Bytes

	---
	license: cc-by-4.0
	datasets:
	- lksy/ru_instruct_gpt4
	- IlyaGusev/ru_turbo_saiga
	- IlyaGusev/ru_sharegpt_cleaned
	- IlyaGusev/oasst1_ru_main_branch
	language:
	- ru
	pipeline_tag: text-generation
	---
	This is a generative model converted to fp16 format based on [IlyaGusev/saiga_mistral_7b_lora](https://huggingface.co/IlyaGusev/saiga_mistral_7b_lora)

	Install vLLM:
	```bash
	pip install vllm
	```

	Start server:
	```bash
	python -u -m vllm.entrypoints.openai.api_server --host 0.0.0.0 --model Gaivoronsky/Mistral-7B-Saiga
	```

	Client:
	```python
	import openai

	openai.api_base = "http://localhost:8000/v1"
	openai.api_key = "none"

	response = openai.ChatCompletion.create(
	model="Gaivoronsky/Mistral-7B-Saiga",
	messages=[{"role": "user", "content": 'Сколько весит жираф?'}],
	max_tokens=512,
	)
	response['choices'][0]['message']['content']
	```