redmoe-ai-v1 commited on
Commit
0b22634
·
verified ·
1 Parent(s): 3d547ab

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -34,3 +34,6 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  tokenizer.json filter=lfs diff=lfs merge=lfs -text
 
 
 
 
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  tokenizer.json filter=lfs diff=lfs merge=lfs -text
37
+ figures/new_logo.png filter=lfs diff=lfs merge=lfs -text
38
+ figures/performance.png filter=lfs diff=lfs merge=lfs -text
39
+ figures/wechat.png filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -66,11 +66,11 @@ The highlights from `dots.llm1` include:
66
 
67
  ## 3. dots.llm1.inst.FP8-dynamic
68
 
69
- ### Docker (vllm)
70
-
71
  We release the quantized `dots.llm1.inst.FP8-dynamic` model, which retains approximately 98% of the original performance after quantization.
72
 
73
- run vllm inference using docker `rednote-hilab/vllm-openai-v0.9.1`. The docker images are available on [Docker Hub](https://hub.docker.com/repository/docker/rednotehilab/dots1/tags), based on the official images.
 
 
74
 
75
  ```bash
76
  python3 -m vllm.entrypoints.openai.api_server --model rednote-hilab/dots.llm1.inst.FP8-dynamic --tensor-parallel-size 4 --pipeline-parallel-size 1 --trust-remote-code --served-model-name dots1
 
66
 
67
  ## 3. dots.llm1.inst.FP8-dynamic
68
 
 
 
69
  We release the quantized `dots.llm1.inst.FP8-dynamic` model, which retains approximately 98% of the original performance after quantization.
70
 
71
+ ### Docker (vllm)
72
+
73
+ For convenience, we recommend running vLLM inference using our Docker image `rednote-hilab/vllm-openai-v0.9.1`, , which is available on [Docker Hub](https://hub.docker.com/repository/docker/rednotehilab/dots1/tags).
74
 
75
  ```bash
76
  python3 -m vllm.entrypoints.openai.api_server --model rednote-hilab/dots.llm1.inst.FP8-dynamic --tensor-parallel-size 4 --pipeline-parallel-size 1 --trust-remote-code --served-model-name dots1
figures/XHSlong750px.png ADDED
figures/new_logo.png ADDED

Git LFS Details

  • SHA256: 2e5808698bcd60df90869af469743248a4560d0ffb2232eceb74cd9c0a7df763
  • Pointer size: 131 Bytes
  • Size of remote file: 101 kB
figures/new_logo2.png ADDED
figures/performance.png ADDED

Git LFS Details

  • SHA256: ca42a057f65c1ea12c303e41938dbe38fc285769002272af767b76605cf8ea98
  • Pointer size: 131 Bytes
  • Size of remote file: 139 kB
figures/wechat.png ADDED

Git LFS Details

  • SHA256: e6f386b64bd313bd998bf0f25e9f1b32c0fbbfe7d972a60227c22fdc044da885
  • Pointer size: 131 Bytes
  • Size of remote file: 118 kB