llmware
/

dragon-llama2-ov

Model card Files Files and versions Community

dragon-llama2-ov / README.md

doberst's picture

Update README.md

acdd2b2 verified 16 days ago

|

1.11 kB

	---
	license: llama2
	inference: false
	base_model: llmware/dragon-llama-7b-v0
	base_model_relation: quantized
	tags:
	- green
	- llmware-rag
	- p7
	- ov
	---

	# dragon-llama2-ov

	dragon-llama2-ov is a high-quality, fact-based question-answering model, designed for retrieval augmented generation (RAG) with complex business documents, quantized and packaged in OpenVino int4 for AI PCs using Intel GPU, CPU and NPU.

	This model provides a good combination of accuracy and inference performance.

	### Model Description

	- Developed by: llmware
	- Model type: llama2
	- Parameters: 7 billion
	- Quantization: int4
	- Model Parent: [llmware/dragon-llama-7b-v0](https://www.huggingface.co/llmware/dragon-llama-7b-v0)
	- Language(s) (NLP): English
	- License: Llama2 Community License
	- Uses: Fact-based question-answering, RAG
	- RAG Benchmark Accuracy Score: 97.25


	## Model Card Contact
	[llmware on github](https://www.github.com/llmware-ai/llmware)
	[llmware on hf](https://www.huggingface.co/llmware)
	[llmware website](https://www.llmware.ai)