Hallucinations & 8/16-bit build
#10
by
roblek
- opened
This 4-bit ONNX version of the model seems to hallucinate frequently:
- Are there any specific GeneratorParams.search_options you would recommend to reduce the hallucinations?
- What is the procedure for building an 8-bit or 16-bit ONNX version of the model?