InferenceIllusionist
/

Mistral-RealworldQA-v0.2-7b-SFT

@@ -21,23 +21,26 @@ datasets:
 An experiment with the goal of reducing halucinations in [VQA](https://huggingface.co/tasks/visual-question-answering)
 First in of a series of projects centering around fine-tuning for image captioning.
-Release Notes:
 * v0.1 - Initial Release
 * v0.2 (Current)- Updating base model to official Mistral-7b fp16 release, refinements to dataset and instruction formating
 Mistral-7b-02 base model was fine-tuned using the [RealWorldQA dataset](https://huggingface.co/datasets/visheratin/realworldqa), originally provided by the X.Ai Team here: https://x.ai/blog/grok-1.5v
-<b>Initial Results</b>
 <img src="https://i.imgur.com/E9mS4Xb.jpeg" width="400"/>
 * Experiment yielded model that provides shorter, less verbose output for questions about pictures
 * The likelihood of hallucinations in output has decreased, however, the model can still be easily influenced to be inaccurate by the user
 * Best suited for captioning use cases that require concise descriptions and low token counts
 * This model lacks the conversational prose of Excalibur-7b-DPO and is much "drier" in tone
-*<b>Requires additional mmproj file. You have two options for vision functionality (available inside this repo):</b>
- * [Quantized - Limited VRAM Option (197mb)](https://huggingface.co/InferenceIllusionist/Excalibur-7b-DPO-GGUF/resolve/main/mistral-7b-mmproj-v1.5-Q4_1.gguf?download=true)
- * [Unquantized - Premium Option / Best Quality (596mb)](https://huggingface.co/InferenceIllusionist/Excalibur-7b-DPO-GGUF/resolve/main/mmproj-model-f16.gguf?download=true)
 Select the gguf file of your choice in [Koboldcpp](https://github.com/LostRuins/koboldcpp/releases/) as usual, then make sure to choose the mmproj file above in the LLaVA mmproj field of the model submenu:
 <img src="https://i.imgur.com/x8vqH29.png" width="425"/>
@@ -45,6 +48,8 @@ Select the gguf file of your choice in [Koboldcpp](https://github.com/LostRuins/
 ## Prompt Format
 Use Alpaca for best results.
 - **Developed by:** InferenceIllusionist
 - **License:** apache-2.0
 - **Finetuned from model :** mistral-community/Mistral-7B-v0.2

 An experiment with the goal of reducing halucinations in [VQA](https://huggingface.co/tasks/visual-question-answering)
 First in of a series of projects centering around fine-tuning for image captioning.
+<h1>Release Notes</h1>
 * v0.1 - Initial Release
 * v0.2 (Current)- Updating base model to official Mistral-7b fp16 release, refinements to dataset and instruction formating
+<h2>Background & Methodology</h2>
 Mistral-7b-02 base model was fine-tuned using the [RealWorldQA dataset](https://huggingface.co/datasets/visheratin/realworldqa), originally provided by the X.Ai Team here: https://x.ai/blog/grok-1.5v
+<h1>Vision Results</h1>
 <img src="https://i.imgur.com/E9mS4Xb.jpeg" width="400"/>
 * Experiment yielded model that provides shorter, less verbose output for questions about pictures
 * The likelihood of hallucinations in output has decreased, however, the model can still be easily influenced to be inaccurate by the user
 * Best suited for captioning use cases that require concise descriptions and low token counts
 * This model lacks the conversational prose of Excalibur-7b-DPO and is much "drier" in tone
+<b>Requires additional mmproj file. You have two options for vision functionality (available inside this repo):</b>
+ 1. [Quantized - Limited VRAM Option (197mb)](https://huggingface.co/InferenceIllusionist/Excalibur-7b-DPO-GGUF/resolve/main/mistral-7b-mmproj-v1.5-Q4_1.gguf?download=true)
+ 2. [Unquantized - Premium Option / Best Quality (596mb)](https://huggingface.co/InferenceIllusionist/Excalibur-7b-DPO-GGUF/resolve/main/mmproj-model-f16.gguf?download=true)
 Select the gguf file of your choice in [Koboldcpp](https://github.com/LostRuins/koboldcpp/releases/) as usual, then make sure to choose the mmproj file above in the LLaVA mmproj field of the model submenu:
 <img src="https://i.imgur.com/x8vqH29.png" width="425"/>
 ## Prompt Format
 Use Alpaca for best results.
+## Other info
 - **Developed by:** InferenceIllusionist
 - **License:** apache-2.0
 - **Finetuned from model :** mistral-community/Mistral-7B-v0.2