Upload folder using huggingface_hub
Browse files- .gitattributes +3 -0
- README.md +3 -3
- figures/XHSlong750px.png +0 -0
- figures/new_logo.png +3 -0
- figures/new_logo2.png +0 -0
- figures/performance.png +3 -0
- figures/wechat.png +3 -0
.gitattributes
CHANGED
@@ -34,3 +34,6 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
37 |
+
figures/new_logo.png filter=lfs diff=lfs merge=lfs -text
|
38 |
+
figures/performance.png filter=lfs diff=lfs merge=lfs -text
|
39 |
+
figures/wechat.png filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
@@ -66,11 +66,11 @@ The highlights from `dots.llm1` include:
|
|
66 |
|
67 |
## 3. dots.llm1.inst.FP8-dynamic
|
68 |
|
69 |
-
### Docker (vllm)
|
70 |
-
|
71 |
We release the quantized `dots.llm1.inst.FP8-dynamic` model, which retains approximately 98% of the original performance after quantization.
|
72 |
|
73 |
-
|
|
|
|
|
74 |
|
75 |
```bash
|
76 |
python3 -m vllm.entrypoints.openai.api_server --model rednote-hilab/dots.llm1.inst.FP8-dynamic --tensor-parallel-size 4 --pipeline-parallel-size 1 --trust-remote-code --served-model-name dots1
|
|
|
66 |
|
67 |
## 3. dots.llm1.inst.FP8-dynamic
|
68 |
|
|
|
|
|
69 |
We release the quantized `dots.llm1.inst.FP8-dynamic` model, which retains approximately 98% of the original performance after quantization.
|
70 |
|
71 |
+
### Docker (vllm)
|
72 |
+
|
73 |
+
For convenience, we recommend running vLLM inference using our Docker image `rednote-hilab/vllm-openai-v0.9.1`, , which is available on [Docker Hub](https://hub.docker.com/repository/docker/rednotehilab/dots1/tags).
|
74 |
|
75 |
```bash
|
76 |
python3 -m vllm.entrypoints.openai.api_server --model rednote-hilab/dots.llm1.inst.FP8-dynamic --tensor-parallel-size 4 --pipeline-parallel-size 1 --trust-remote-code --served-model-name dots1
|
figures/XHSlong750px.png
ADDED
![]() |
figures/new_logo.png
ADDED
![]() |
Git LFS Details
|
figures/new_logo2.png
ADDED
![]() |
figures/performance.png
ADDED
![]() |
Git LFS Details
|
figures/wechat.png
ADDED
![]() |
Git LFS Details
|