Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
joshmiller656
/
Llama-3.1-Nemotron-70B-Instruct-AWQ-INT4
like
2
Text Generation
Transformers
Safetensors
nvidia/HelpSteer2
8 languages
llama
nemotron
awq
quantized
int4
conversational
text-generation-inference
Inference Endpoints
4-bit precision
arxiv:
2410.01257
arxiv:
2405.01481
arxiv:
2406.08673
License:
llama3.1
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Llama-3.1-Nemotron-70B-Instruct-AWQ-INT4
Commit History
Update README.md
b73d0f3
verified
joshmiller656
commited on
Nov 5, 2024
Update README.md
990545b
verified
joshmiller656
commited on
Nov 4, 2024
Update README.md
708a5ad
verified
joshmiller656
commited on
Oct 31, 2024
Update README.md
ef9ae12
verified
joshmiller656
commited on
Oct 30, 2024
Create README.md
ef0056e
verified
joshmiller656
commited on
Oct 30, 2024
Upload folder using huggingface_hub
372de0b
verified
joshmiller656
commited on
Oct 30, 2024
initial commit
4b1a3dd
verified
joshmiller656
commited on
Oct 29, 2024