mehmetkeremturkcan
/

DeepSeek-LLaVA-Instruct

Image-Text-to-Text

Model card Files Files and versions Community

mehmetkeremturkcan commited on Feb 2

Commit

21ef8ca

·

verified ·

1 Parent(s): 9ddd5dd

Update README.md

Files changed (1) hide show

README.md +14 -4

README.md CHANGED Viewed

@@ -4,7 +4,7 @@ datasets:
 - 5CD-AI/LLaVA-CoT-o1-Instruct
 base_model:
 - deepseek-ai/DeepSeek-R1-Distill-Llama-8B
-- google/vit-large-patch16-224
 library_name: transformers
 ---
@@ -20,6 +20,8 @@ library_name: transformers
 Vision language models with chain-of-thought reasoning are just starting to emerge. This is a proof-of-concept to train a vision model with thinking-enabled chat templates based on DeepSeek-R1 models.
 ## Setup
 ```bash
 pip install git+https://github.com/facebookresearch/schedule_free.git
@@ -29,10 +31,18 @@ cd seers/seers/
 git clone https://huggingface.co/mehmetkeremturkcan/DeepSeek-LLaVA-Instruct
 ```
 ## Test
-Run
 ```bash
-python predict.py
 ```
 ## Training Details
-This model is a fine-tuned version of [deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B) on the [5CD-AI/LLaVA-CoT-o1-Instruct](https://huggingface.co/datasets/5CD-AI/LLaVA-CoT-o1-Instruct) dataset.
 It has been trained using [seers](https://github.com/mkturkcan/seers).

 - 5CD-AI/LLaVA-CoT-o1-Instruct
 base_model:
 - deepseek-ai/DeepSeek-R1-Distill-Llama-8B
+- google/vit-large-patch32-384
 library_name: transformers
 ---
 Vision language models with chain-of-thought reasoning are just starting to emerge. This is a proof-of-concept to train a vision model with thinking-enabled chat templates based on DeepSeek-R1 models.
+Note that this model will not always use thinking tokens, due to the current lack of high-quality CoT data in non-science contexts.
 ## Setup
 ```bash
 pip install git+https://github.com/facebookresearch/schedule_free.git
 git clone https://huggingface.co/mehmetkeremturkcan/DeepSeek-LLaVA-Instruct
 ```
 ## Test
+Run, in the seers/seers folder,
+```bash
+python predict_llava.py
+```
+## Train
+[seers](https://github.com/mkturkcan/seers) training code is public! Run
 ```bash
+python train_cot_mixed.py
 ```
 ## Training Details
+This model is a fine-tuned version of [deepseek-ai/DeepSeek-R1-Distill-Llama-8B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B) on the [5CD-AI/LLaVA-CoT-o1-Instruct](https://huggingface.co/datasets/5CD-AI/LLaVA-CoT-o1-Instruct) dataset.
 It has been trained using [seers](https://github.com/mkturkcan/seers).