Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
nvidia
/
Llama-3_1-Nemotron-51B-Instruct
like
183
Text Generation
Transformers
Safetensors
PyTorch
English
nemotron-nas
nvidia
llama-3
conversational
custom_code
arxiv:
4 papers
License:
nvidia-open-model-license
Model card
Files
Files and versions
Community
19
Train
Use this model
Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model
#19
by
tomer-nv
- opened
6 days ago
base:
refs/heads/main
←
from:
refs/pr/19
Discussion
Files changed
+45
-1
tomer-nv
NVIDIA org
6 days ago
No description provided.
Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model
775f6527
itlevy
changed pull request status to
merged
6 days ago
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment