--- license: llama2 inference: false base_model: llmware/dragon-llama-7b-v0 base_model_relation: quantized tags: - green - llmware-rag - p7 - ov --- # dragon-llama2-ov **dragon-llama2-ov** is a high-quality, fact-based question-answering model, designed for retrieval augmented generation (RAG) with complex business documents, quantized and packaged in OpenVino int4 for AI PCs using Intel GPU, CPU and NPU. This model provides a good combination of accuracy and inference performance. ### Model Description - **Developed by:** llmware - **Model type:** llama2 - **Parameters:** 7 billion - **Quantization:** int4 - **Model Parent:** [llmware/dragon-llama-7b-v0](https://www.huggingface.co/llmware/dragon-llama-7b-v0) - **Language(s) (NLP):** English - **License:** Llama2 Community License - **Uses:** Fact-based question-answering, RAG - **RAG Benchmark Accuracy Score:** 97.25 ## Model Card Contact [llmware on github](https://www.github.com/llmware-ai/llmware) [llmware on hf](https://www.huggingface.co/llmware) [llmware website](https://www.llmware.ai)