mehmetkeremturkcan commited on
Commit
21ef8ca
·
verified ·
1 Parent(s): 9ddd5dd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -4
README.md CHANGED
@@ -4,7 +4,7 @@ datasets:
4
  - 5CD-AI/LLaVA-CoT-o1-Instruct
5
  base_model:
6
  - deepseek-ai/DeepSeek-R1-Distill-Llama-8B
7
- - google/vit-large-patch16-224
8
  library_name: transformers
9
  ---
10
 
@@ -20,6 +20,8 @@ library_name: transformers
20
 
21
  Vision language models with chain-of-thought reasoning are just starting to emerge. This is a proof-of-concept to train a vision model with thinking-enabled chat templates based on DeepSeek-R1 models.
22
 
 
 
23
  ## Setup
24
  ```bash
25
  pip install git+https://github.com/facebookresearch/schedule_free.git
@@ -29,10 +31,18 @@ cd seers/seers/
29
  git clone https://huggingface.co/mehmetkeremturkcan/DeepSeek-LLaVA-Instruct
30
  ```
31
  ## Test
32
- Run
 
 
 
 
 
 
 
33
  ```bash
34
- python predict.py
35
  ```
 
36
  ## Training Details
37
- This model is a fine-tuned version of [deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B) on the [5CD-AI/LLaVA-CoT-o1-Instruct](https://huggingface.co/datasets/5CD-AI/LLaVA-CoT-o1-Instruct) dataset.
38
  It has been trained using [seers](https://github.com/mkturkcan/seers).
 
4
  - 5CD-AI/LLaVA-CoT-o1-Instruct
5
  base_model:
6
  - deepseek-ai/DeepSeek-R1-Distill-Llama-8B
7
+ - google/vit-large-patch32-384
8
  library_name: transformers
9
  ---
10
 
 
20
 
21
  Vision language models with chain-of-thought reasoning are just starting to emerge. This is a proof-of-concept to train a vision model with thinking-enabled chat templates based on DeepSeek-R1 models.
22
 
23
+ Note that this model will not always use thinking tokens, due to the current lack of high-quality CoT data in non-science contexts.
24
+
25
  ## Setup
26
  ```bash
27
  pip install git+https://github.com/facebookresearch/schedule_free.git
 
31
  git clone https://huggingface.co/mehmetkeremturkcan/DeepSeek-LLaVA-Instruct
32
  ```
33
  ## Test
34
+ Run, in the seers/seers folder,
35
+ ```bash
36
+ python predict_llava.py
37
+ ```
38
+
39
+ ## Train
40
+
41
+ [seers](https://github.com/mkturkcan/seers) training code is public! Run
42
  ```bash
43
+ python train_cot_mixed.py
44
  ```
45
+
46
  ## Training Details
47
+ This model is a fine-tuned version of [deepseek-ai/DeepSeek-R1-Distill-Llama-8B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B) on the [5CD-AI/LLaVA-CoT-o1-Instruct](https://huggingface.co/datasets/5CD-AI/LLaVA-CoT-o1-Instruct) dataset.
48
  It has been trained using [seers](https://github.com/mkturkcan/seers).