flash-attention / adamw.yaml
theonlyengine's picture
Upload 421 files
3f9c425 verified
raw
history blame contribute delete
55 Bytes
# @package train.optimizer
_target_: torch.optim.AdamW