Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
akswelh
/
NEOX
like
0
arxiv:
29 papers
Model card
Files
Files and versions
xet
Community
main
NEOX
/
megatron
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
akswelh
Upload 251 files
d90b3a8
verified
10 months ago
data
Upload 251 files
10 months ago
fused_kernels
Upload 251 files
10 months ago
gradient_noise_scale
Upload 251 files
10 months ago
model
Upload 251 files
10 months ago
mpu
Upload 251 files
10 months ago
neox_arguments
Upload 251 files
10 months ago
tokenizer
Upload 251 files
10 months ago
__init__.py
929 Bytes
Upload 251 files
10 months ago
checkpointing.py
17.6 kB
Upload 251 files
10 months ago
devutil.py
1.28 kB
Upload 251 files
10 months ago
initialize.py
8.58 kB
Upload 251 files
10 months ago
learning_rates.py
5.22 kB
Upload 251 files
10 months ago
logging.py
16.4 kB
Upload 251 files
10 months ago
mup_substitute.py
7.8 kB
Upload 251 files
10 months ago
optimizers.py
18.1 kB
Upload 251 files
10 months ago
text_generation_utils.py
42.3 kB
Upload 251 files
10 months ago
training.py
64.4 kB
Upload 251 files
10 months ago
utils.py
17.6 kB
Upload 251 files
10 months ago