Update README.md
Browse files
README.md
CHANGED
@@ -4,7 +4,7 @@ datasets:
|
|
4 |
- 5CD-AI/LLaVA-CoT-o1-Instruct
|
5 |
base_model:
|
6 |
- deepseek-ai/DeepSeek-R1-Distill-Llama-8B
|
7 |
-
- google/vit-large-
|
8 |
library_name: transformers
|
9 |
---
|
10 |
|
@@ -20,6 +20,8 @@ library_name: transformers
|
|
20 |
|
21 |
Vision language models with chain-of-thought reasoning are just starting to emerge. This is a proof-of-concept to train a vision model with thinking-enabled chat templates based on DeepSeek-R1 models.
|
22 |
|
|
|
|
|
23 |
## Setup
|
24 |
```bash
|
25 |
pip install git+https://github.com/facebookresearch/schedule_free.git
|
@@ -29,10 +31,18 @@ cd seers/seers/
|
|
29 |
git clone https://huggingface.co/mehmetkeremturkcan/DeepSeek-LLaVA-Instruct
|
30 |
```
|
31 |
## Test
|
32 |
-
Run
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
33 |
```bash
|
34 |
-
python
|
35 |
```
|
|
|
36 |
## Training Details
|
37 |
-
This model is a fine-tuned version of [deepseek-ai/DeepSeek-R1-Distill-
|
38 |
It has been trained using [seers](https://github.com/mkturkcan/seers).
|
|
|
4 |
- 5CD-AI/LLaVA-CoT-o1-Instruct
|
5 |
base_model:
|
6 |
- deepseek-ai/DeepSeek-R1-Distill-Llama-8B
|
7 |
+
- google/vit-large-patch32-384
|
8 |
library_name: transformers
|
9 |
---
|
10 |
|
|
|
20 |
|
21 |
Vision language models with chain-of-thought reasoning are just starting to emerge. This is a proof-of-concept to train a vision model with thinking-enabled chat templates based on DeepSeek-R1 models.
|
22 |
|
23 |
+
Note that this model will not always use thinking tokens, due to the current lack of high-quality CoT data in non-science contexts.
|
24 |
+
|
25 |
## Setup
|
26 |
```bash
|
27 |
pip install git+https://github.com/facebookresearch/schedule_free.git
|
|
|
31 |
git clone https://huggingface.co/mehmetkeremturkcan/DeepSeek-LLaVA-Instruct
|
32 |
```
|
33 |
## Test
|
34 |
+
Run, in the seers/seers folder,
|
35 |
+
```bash
|
36 |
+
python predict_llava.py
|
37 |
+
```
|
38 |
+
|
39 |
+
## Train
|
40 |
+
|
41 |
+
[seers](https://github.com/mkturkcan/seers) training code is public! Run
|
42 |
```bash
|
43 |
+
python train_cot_mixed.py
|
44 |
```
|
45 |
+
|
46 |
## Training Details
|
47 |
+
This model is a fine-tuned version of [deepseek-ai/DeepSeek-R1-Distill-Llama-8B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B) on the [5CD-AI/LLaVA-CoT-o1-Instruct](https://huggingface.co/datasets/5CD-AI/LLaVA-CoT-o1-Instruct) dataset.
|
48 |
It has been trained using [seers](https://github.com/mkturkcan/seers).
|