Commits · Dovakiins/qwerrwe

drop length column for issues with eval without packing (#1711)

3f1f5e3
unverified

winglian commited on Jun 19, 2024

add back packing efficiency estimate so epochs and multi-gpu works properly (#1697)

ed8ef65
unverified

winglian commited on Jun 8, 2024

add support for rpo_alpha (#1681)

c996881
unverified

winglian commited on Jun 4, 2024

need to add back drop_last for sampler (#1676)

05b0bd0
unverified

winglian commited on May 31, 2024

use mixins for orpo and kto configs so they work with axolotl customizations (#1674)

f7332ac
unverified

winglian commited on May 30, 2024

Switch to parallel FFD bin packing algorithm. (#1619)

367b2e8
unverified

winglian

daaave commited on May 23, 2024

enable loraplus setting for dpo trainer (#1646)

a27d5e1
unverified

thepowerfuldeez commited on May 22, 2024

allow report_to for multiple providers (#1647)

6299eb5
unverified

winglian commited on May 22, 2024

Add KTO support (#1640)

22ae21a
unverified

benredmond

winglian commited on May 20, 2024

fixes to save on fractional save_steps (#1643)

ba45531
unverified

winglian commited on May 20, 2024

add save_only_model option (#1634)

702a669
unverified

emozilla commited on May 17, 2024

FIX: max_length and max_prompt_length was not being sent to ORPOTrainer (#1584)

1e1921b
unverified

alimosavian Ali Mosavian

winglian commited on May 14, 2024

improve save callbacks (#1592)

29cf15a
unverified

winglian commited on May 5, 2024

FIX: TRL trainer preprocessing step was running in one process (#1583)

b9bb169
unverified

Ali Mosavian Ali Mosavian commited on May 3, 2024

PoSE context length ext (#1567)

5294653
unverified

winglian commited on Apr 27, 2024

make sure everything stays in the same dtype when using dpo + FSDP (#1559)

68601ec
unverified

winglian commited on Apr 22, 2024

ORPO Trainer replacement (#1551)

7d1d22f
unverified

winglian commited on Apr 19, 2024

DBRX Model Support (#1462)

132eb74
unverified

winglian commited on Apr 12, 2024

WIP: Support table logging for mlflow, too (#1506)

057fa44
unverified

DavidFarago Dave Farago

winglian commited on Apr 9, 2024

drop empty token from beginning if tokenizer has no bos_token (in the case of qwen) (#1490)

934fc85
unverified

winglian commited on Apr 7, 2024

LISA (#1469)

0ddfb24
unverified

winglian

tmm1 commited on Apr 1, 2024

Fix ORPO multi gpu (#1433)

34ba634
unverified

winglian commited on Mar 22, 2024

strip out hacky qlora-fsdp workarounds now that qlora-fsdp fixes are upstreamed (#1428)

2a1589f
unverified

winglian commited on Mar 21, 2024

HF / FEAT: Optimize HF tags (#1425) [skip ci]

7d55607
unverified

Younes Belkada

winglian commited on Mar 21, 2024

support galore once upstreamed into transformers (#1409)

dd449c5
unverified

winglian commited on Mar 19, 2024

fix(config): passing gradient_checkpoint_kwargs (#1412)

b1e3e1b
unverified

Nanobit commited on Mar 19, 2024

ORPO (#1419)

2ea70eb
unverified

winglian commited on Mar 18, 2024

FDSP + QLoRA (#1378)

9b6ee83
unverified

winglian commited on Mar 8, 2024

lora+ support (#1352)

decb66e
unverified

winglian commited on Mar 5, 2024

add lion-pytorch optimizer (#1299) [skip ci]

1648279
unverified

Maxime

winglian commited on Feb 26, 2024

make mlflow optional (#1317)

5894f0e
unverified

winglian commited on Feb 26, 2024

Allow load_best_model_at_end to be configured for early stopping on custom evaluation datasets (#1291)

3c00f40
unverified

David Meikle commited on Feb 21, 2024

Add seq2seq eval benchmark callback (#1274)

5a5d474
unverified

LeonardoEmili commited on Feb 13, 2024

Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273)

8430db2
unverified

whooray commited on Feb 13, 2024

allow the optimizer prune ratio for ReLoRA to be configurable (#1287)

4b997c3
unverified

winglian commited on Feb 12, 2024

simplify haldning for newer multipack patches so they can be added in a single place (#1270)

5698943
unverified

winglian commited on Feb 7, 2024

Add more save strategies for DPO training. (#1255)

13eea21
unverified

Philip May commited on Feb 6, 2024

relora: magnitude pruning of the optimizer (#1245)

8c2e05a
unverified

winglian commited on Feb 6, 2024

support for true batches with multipack (#1230)

00568c1
unverified

winglian commited on Feb 1, 2024

Fix and document test_datasets (#1228)

5787e1a
unverified

DreamGenX

winglian commited on Jan 31, 2024

FEAT: add tagging support to axolotl for DPOTrainer (#1209)

18f8119
unverified

Filippo Broggini

winglian commited on Jan 27, 2024

precompute dpo logprobs setting and fixes (#1199) [skip ci]

33e1170
unverified

winglian commited on Jan 25, 2024

fix learning rate scheduler's warnings (#1135) [skip ci]

b4ac96a
unverified

ricdomolm

winglian commited on Jan 25, 2024

more dpo fixes for dataset loading and docs (#1185) [skip ci]

5bce45f
unverified

winglian commited on Jan 24, 2024

DPO fixes v2 (#1174)

59a31fe
unverified

winglian commited on Jan 23, 2024

Phi2 multipack (#1173)

814aee6
unverified

winglian commited on Jan 23, 2024

DPO cleanup (#1126)

7523d1f
unverified

winglian

plaguss commited on Jan 23, 2024

Add mlflow callback for pushing config to mlflow artifacts (#1125)

b8e5603
unverified

JohanWork commited on Jan 22, 2024

jupyter lab fixes (#1139) [skip ci]

eaaeefc
unverified

winglian commited on Jan 22, 2024

Qwen2 (#1166)

f5a828a
unverified

winglian commited on Jan 22, 2024

Commit History

drop length column for issues with eval without packing (#1711) 3f1f5e3 unverified

add back packing efficiency estimate so epochs and multi-gpu works properly (#1697) ed8ef65 unverified

add support for rpo_alpha (#1681) c996881 unverified

need to add back drop_last for sampler (#1676) 05b0bd0 unverified

use mixins for orpo and kto configs so they work with axolotl customizations (#1674) f7332ac unverified

Switch to parallel FFD bin packing algorithm. (#1619) 367b2e8 unverified

enable loraplus setting for dpo trainer (#1646) a27d5e1 unverified

allow report_to for multiple providers (#1647) 6299eb5 unverified

Add KTO support (#1640) 22ae21a unverified

fixes to save on fractional save_steps (#1643) ba45531 unverified

add save_only_model option (#1634) 702a669 unverified

FIX: max_length and max_prompt_length was not being sent to ORPOTrainer (#1584) 1e1921b unverified

improve save callbacks (#1592) 29cf15a unverified

FIX: TRL trainer preprocessing step was running in one process (#1583) b9bb169 unverified

PoSE context length ext (#1567) 5294653 unverified

make sure everything stays in the same dtype when using dpo + FSDP (#1559) 68601ec unverified

ORPO Trainer replacement (#1551) 7d1d22f unverified

DBRX Model Support (#1462) 132eb74 unverified

WIP: Support table logging for mlflow, too (#1506) 057fa44 unverified

drop empty token from beginning if tokenizer has no bos_token (in the case of qwen) (#1490) 934fc85 unverified

LISA (#1469) 0ddfb24 unverified

Fix ORPO multi gpu (#1433) 34ba634 unverified

strip out hacky qlora-fsdp workarounds now that qlora-fsdp fixes are upstreamed (#1428) 2a1589f unverified

HF / FEAT: Optimize HF tags (#1425) [skip ci] 7d55607 unverified

support galore once upstreamed into transformers (#1409) dd449c5 unverified

fix(config): passing gradient_checkpoint_kwargs (#1412) b1e3e1b unverified

ORPO (#1419) 2ea70eb unverified

FDSP + QLoRA (#1378) 9b6ee83 unverified

lora+ support (#1352) decb66e unverified

add lion-pytorch optimizer (#1299) [skip ci] 1648279 unverified

make mlflow optional (#1317) 5894f0e unverified

Allow load_best_model_at_end to be configured for early stopping on custom evaluation datasets (#1291) 3c00f40 unverified

Add seq2seq eval benchmark callback (#1274) 5a5d474 unverified

Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273) 8430db2 unverified

allow the optimizer prune ratio for ReLoRA to be configurable (#1287) 4b997c3 unverified

simplify haldning for newer multipack patches so they can be added in a single place (#1270) 5698943 unverified

Add more save strategies for DPO training. (#1255) 13eea21 unverified

relora: magnitude pruning of the optimizer (#1245) 8c2e05a unverified

support for true batches with multipack (#1230) 00568c1 unverified

Fix and document test_datasets (#1228) 5787e1a unverified

FEAT: add tagging support to axolotl for DPOTrainer (#1209) 18f8119 unverified

precompute dpo logprobs setting and fixes (#1199) [skip ci] 33e1170 unverified

fix learning rate scheduler's warnings (#1135) [skip ci] b4ac96a unverified

more dpo fixes for dataset loading and docs (#1185) [skip ci] 5bce45f unverified

DPO fixes v2 (#1174) 59a31fe unverified

Phi2 multipack (#1173) 814aee6 unverified

DPO cleanup (#1126) 7523d1f unverified

Add mlflow callback for pushing config to mlflow artifacts (#1125) b8e5603 unverified

jupyter lab fixes (#1139) [skip ci] eaaeefc unverified

Qwen2 (#1166) f5a828a unverified

drop length column for issues with eval without packing (#1711)

3f1f5e3
unverified

add back packing efficiency estimate so epochs and multi-gpu works properly (#1697)

ed8ef65
unverified

add support for rpo_alpha (#1681)

c996881
unverified

need to add back drop_last for sampler (#1676)

05b0bd0
unverified

use mixins for orpo and kto configs so they work with axolotl customizations (#1674)

f7332ac
unverified

Switch to parallel FFD bin packing algorithm. (#1619)

367b2e8
unverified

enable loraplus setting for dpo trainer (#1646)

a27d5e1
unverified

allow report_to for multiple providers (#1647)

6299eb5
unverified

Add KTO support (#1640)

22ae21a
unverified

fixes to save on fractional save_steps (#1643)

ba45531
unverified

add save_only_model option (#1634)

702a669
unverified

FIX: max_length and max_prompt_length was not being sent to ORPOTrainer (#1584)

1e1921b
unverified

improve save callbacks (#1592)

29cf15a
unverified

FIX: TRL trainer preprocessing step was running in one process (#1583)

b9bb169
unverified

PoSE context length ext (#1567)

5294653
unverified

make sure everything stays in the same dtype when using dpo + FSDP (#1559)

68601ec
unverified

ORPO Trainer replacement (#1551)

7d1d22f
unverified

DBRX Model Support (#1462)

132eb74
unverified

WIP: Support table logging for mlflow, too (#1506)

057fa44
unverified

drop empty token from beginning if tokenizer has no bos_token (in the case of qwen) (#1490)

934fc85
unverified

LISA (#1469)

0ddfb24
unverified

Fix ORPO multi gpu (#1433)

34ba634
unverified

strip out hacky qlora-fsdp workarounds now that qlora-fsdp fixes are upstreamed (#1428)

2a1589f
unverified

HF / FEAT: Optimize HF tags (#1425) [skip ci]

7d55607
unverified

support galore once upstreamed into transformers (#1409)

dd449c5
unverified

fix(config): passing gradient_checkpoint_kwargs (#1412)

b1e3e1b
unverified

ORPO (#1419)

2ea70eb
unverified

FDSP + QLoRA (#1378)

9b6ee83
unverified

lora+ support (#1352)

decb66e
unverified

add lion-pytorch optimizer (#1299) [skip ci]

1648279
unverified

make mlflow optional (#1317)

5894f0e
unverified

Allow load_best_model_at_end to be configured for early stopping on custom evaluation datasets (#1291)

3c00f40
unverified

Add seq2seq eval benchmark callback (#1274)

5a5d474
unverified

Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273)

8430db2
unverified

allow the optimizer prune ratio for ReLoRA to be configurable (#1287)

4b997c3
unverified

simplify haldning for newer multipack patches so they can be added in a single place (#1270)

5698943
unverified

Add more save strategies for DPO training. (#1255)

13eea21
unverified

relora: magnitude pruning of the optimizer (#1245)

8c2e05a
unverified

support for true batches with multipack (#1230)

00568c1
unverified

Fix and document test_datasets (#1228)

5787e1a
unverified

FEAT: add tagging support to axolotl for DPOTrainer (#1209)

18f8119
unverified

precompute dpo logprobs setting and fixes (#1199) [skip ci]

33e1170
unverified

fix learning rate scheduler's warnings (#1135) [skip ci]

b4ac96a
unverified

more dpo fixes for dataset loading and docs (#1185) [skip ci]

5bce45f
unverified

DPO fixes v2 (#1174)

59a31fe
unverified

Phi2 multipack (#1173)

814aee6
unverified

DPO cleanup (#1126)

7523d1f
unverified

Add mlflow callback for pushing config to mlflow artifacts (#1125)

b8e5603
unverified

jupyter lab fixes (#1139) [skip ci]

eaaeefc
unverified

Qwen2 (#1166)

f5a828a
unverified