bluelike commited on
Commit
9c1a731
·
verified ·
1 Parent(s): 37ce9f6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -428,7 +428,7 @@ The model supports a wide range of resolution inputs. By default, it uses the na
428
  min_pixels = 256 * 28 * 28
429
  max_pixels = 1280 * 28 * 28
430
  processor = AutoProcessor.from_pretrained(
431
- "Qwen/Qwen2.5-VL-7B-Instruct", min_pixels=min_pixels, max_pixels=max_pixels
432
  )
433
  ```
434
 
@@ -478,6 +478,7 @@ To handle extensive inputs exceeding 32,768 tokens, we utilize [YaRN](https://ar
478
 
479
  For supported frameworks, you could add the following to `config.json` to enable YaRN:
480
 
 
481
  {
482
  ...,
483
  "type": "yarn",
@@ -489,6 +490,7 @@ For supported frameworks, you could add the following to `config.json` to enable
489
  "factor": 4,
490
  "original_max_position_embeddings": 32768
491
  }
 
492
 
493
  However, it should be noted that this method has a significant impact on the performance of temporal and spatial localization tasks, and is therefore not recommended for use.
494
 
 
428
  min_pixels = 256 * 28 * 28
429
  max_pixels = 1280 * 28 * 28
430
  processor = AutoProcessor.from_pretrained(
431
+ "Qwen/Qwen2.5-VL-3B-Instruct", min_pixels=min_pixels, max_pixels=max_pixels
432
  )
433
  ```
434
 
 
478
 
479
  For supported frameworks, you could add the following to `config.json` to enable YaRN:
480
 
481
+ ```
482
  {
483
  ...,
484
  "type": "yarn",
 
490
  "factor": 4,
491
  "original_max_position_embeddings": 32768
492
  }
493
+ ```
494
 
495
  However, it should be noted that this method has a significant impact on the performance of temporal and spatial localization tasks, and is therefore not recommended for use.
496