Commit History
more pydantic fixes (#1338)
3f69571
unverified
Support user-defined prompt processing strategies for dpo (#1248)
1e3d530
unverified
add lion-pytorch optimizer (#1299) [skip ci]
1648279
unverified
Add StableLM 2 Example Scripts (#1327) [skip ci]
f30d062
unverified
hotfix to exclude_unset from pydantic config when converting back to a dict (#1334)
269c543
unverified
hotfix for missing outputs params (#1333)
e7eed20
unverified
hotfix for lora rank (#1332)
cf00231
unverified
hotfix for capabilities loading (#1331)
7de912e
unverified
chore: update readme to be more clear (#1326) [skip ci]
c6b01e0
unverified
Pydantic 2.x cfg (#1239)
cc3cebf
unverified
make mlflow optional (#1317)
5894f0e
unverified
Use yaml codeblock for config.yaml field (#1303) [skip ci]
5cf226e
unverified
kallewoof
commited on
fix(readme): Clarify doc for tokenizer_config (#1323) [skip ci]
2ed52bd
unverified
deprecate: pytorch 2.0.1 image (#1315) [skip ci]
a359579
unverified
multipack for gemma (#1313)
2752d5f
unverified
Adding Google's gemma Model (#1312)
9e300ac
unverified
fix(readme): update inference md link (#1311) [skip ci]
3d2cd80
unverified
Add instructions for playing with qlora model to colab example (#1290)
6ab69ec
unverified
Allow load_best_model_at_end to be configured for early stopping on custom evaluation datasets (#1291)
3c00f40
unverified
David Meikle
commited on
fix(examples): remove is_*_derived as it's parsed automatically (#1297)
a7a9a14
unverified
Validation always happens on first step (#1300)
e2786cc
unverified
Add seq2seq eval benchmark callback (#1274)
5a5d474
unverified
Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273)
8430db2
unverified
allow the optimizer prune ratio for ReLoRA to be configurable (#1287)
4b997c3
unverified
Add MPS support (#1264)
fac2d98
unverified
don't use load and push together (#1284)
ea00dd0
unverified
Update README.md (#1281)
b2a4cb4
unverified
run the docker image builds and push on gh action gpu runners (#1218)
aaf54dc
unverified
add support for https remote yamls (#1277)
9bca7db
unverified
allow remote data paths (#1278)
91cf4ee
unverified
copy edits (#1276)
1daecd1
unverified
Add link to axolotl cloud image on latitude (#1275)
4a654b3
unverified
simplify haldning for newer multipack patches so they can be added in a single place (#1270)
5698943
unverified
contributor avatars (#1269)
411293b
unverified
Fix bug preventing model_kwargs being injected (#1262)
73f1bda
unverified
Zac Brannelly
commited on
lock pytorch (#1247) [skip ci]
1c7ed26
unverified
JohanWork
commited on
Add more save strategies for DPO training. (#1255)
13eea21
unverified
Philip May
commited on