Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Dovakiins
/
qwerrwe
like
0
Build error
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
dde02fc
qwerrwe
/
src
/
axolotl
/
utils
Ctrl+K
Ctrl+K
100 contributors
History:
525 commits
JohanWork
ADD: warning hub model (#1301)
601c08b
unverified
about 1 year ago
callbacks
Update SaveAxolotlConfigtoWandBCallback to use artifact instead of save (#1483)
about 1 year ago
config
ADD: warning hub model (#1301)
about 1 year ago
data
wrap prepared_ds_path in str() to avoid TypeError in fsspec package (#1548)
about 1 year ago
gradient_checkpointing
Unsloth gradient checkpointing offload (#1528)
about 1 year ago
samplers
support for true batches with multipack (#1230)
over 1 year ago
__init__.py
Safe
136 Bytes
WIP: Support table logging for mlflow, too (#1506)
about 1 year ago
bench.py
Safe
2.44 kB
FDSP + QLoRA (#1378)
about 1 year ago
chat_templates.py
Safe
3.62 kB
add support for cohere chat template (#1478)
about 1 year ago
collators.py
Safe
9.78 kB
Pretrain multipack v2 (#1470)
about 1 year ago
dict.py
Safe
327 Bytes
Pydantic 2.x cfg (#1239)
about 1 year ago
distributed.py
Safe
7.14 kB
DBRX Model Support (#1462)
about 1 year ago
freeze.py
Safe
8.33 kB
ORPO (#1419)
about 1 year ago
lora_embeddings.py
Safe
375 Bytes
Phi2 multipack (#1173)
over 1 year ago
mlflow_.py
Safe
543 Bytes
ADD: push checkpoints to mlflow artifact registry (#1295) [skip ci]
about 1 year ago
model_shard_quant.py
Safe
8.95 kB
DBRX Model Support (#1462)
about 1 year ago
models.py
Safe
37.6 kB
make sure everything stays in the same dtype when using dpo + FSDP (#1559)
about 1 year ago
schedulers.py
Safe
7.66 kB
Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273)
over 1 year ago
tokenization.py
Safe
5.16 kB
Add debug option for RL dataset preprocessing (#1404)
about 1 year ago
trainer.py
Safe
16.4 kB
PoSE context length ext (#1567)
about 1 year ago
wandb_.py
Safe
609 Bytes
Feat(wandb): Refactor to be more flexible (#767)
over 1 year ago