metadata
license: apache-2.0
inference: false
tags:
- green
- llmware-rag
- p3
- onnx
bling-phi-3-onnx
bling-phi-3-ov is an ONNX int4 quantized version of BLING Phi-3, providing a very fast, very small inference implementation, optimized for AI PCs using Intel GPU, CPU and NPU.
bling-phi-3 is a fact-based question-answering model, optimized for complex business documents.
Get started right away with OpenVino
Looking for AI PC solutions and demos, contact us at llmware
Model Description
- Developed by: llmware
- Model type: phi3
- Parameters: 3.8 billion
- Model Parent: llmware/bling-phi-3
- Language(s) (NLP): English
- License: Apache 2.0
- Uses: Fact-based question-answering
- RAG Benchmark Accuracy Score: 99.5
- Quantization: int4