[email protected]
commited on
Commit
•
dad27a5
1
Parent(s):
ebd553d
Update readme
Browse files
README.md
CHANGED
@@ -19,7 +19,7 @@ Today (September 17th, 2024), we introduce [NVLM 1.0](https://arxiv.org/abs/2409
|
|
19 |
In this repo, we are open-sourcing NVLM-1.0-D-72B (decoder-only architecture), the decoder-only model weights and code for the community.
|
20 |
|
21 |
## Other Resources
|
22 |
-
[Inference Code (HF)](https://huggingface.co/nvidia/NVLM-D-72B/tree/main)   [Training Code (Coming soon)]()   [Website](https://
|
23 |
|
24 |
## Benchmark Results
|
25 |
We train our model with legacy [Megatron-LM](https://github.com/NVIDIA/Megatron-LM/tree/main/megatron/legacy) and adapt the codebase to Huggingface for model hosting, reproducibility, and inference.
|
|
|
19 |
In this repo, we are open-sourcing NVLM-1.0-D-72B (decoder-only architecture), the decoder-only model weights and code for the community.
|
20 |
|
21 |
## Other Resources
|
22 |
+
[Inference Code (HF)](https://huggingface.co/nvidia/NVLM-D-72B/tree/main)   [Training Code (Coming soon)]()   [Website](https://research.nvidia.com/labs/adlr/NVLM-1/)   [Paper](https://arxiv.org/abs/2409.11402)
|
23 |
|
24 |
## Benchmark Results
|
25 |
We train our model with legacy [Megatron-LM](https://github.com/NVIDIA/Megatron-LM/tree/main/megatron/legacy) and adapt the codebase to Huggingface for model hosting, reproducibility, and inference.
|