Commits · Dovakiins/qwerrwe

Phi-3 conversation format, example training script and perplexity metric (#1582)

cf64284
unverified

roborovski

winglian commited on Jun 4, 2024

add support for rpo_alpha (#1681)

c996881
unverified

winglian commited on Jun 4, 2024

re-enable DPO for tests in modal ci (#1374)

1f151c0
unverified

winglian commited on Jun 3, 2024

re-enable phi for tests in modal ci (#1373)

16d46b7
unverified

winglian commited on May 29, 2024

make sure the CI fails when pytest script fails (#1669)

fe650dd
unverified

winglian commited on May 29, 2024

Generalizing the chat_template prompt strategy (#1660) [skip ci]

cc11c6b
unverified

Keith Stevens commited on May 28, 2024

Switch to parallel FFD bin packing algorithm. (#1619)

367b2e8
unverified

winglian

daaave commited on May 23, 2024

Fix llama3 chat_template (extra <|eot_id|> on last turn) (#1635)

7c2bf30
unverified

leonardlin

winglian commited on May 21, 2024

Add KTO support (#1640)

22ae21a
unverified

benredmond

winglian commited on May 20, 2024

feat: Add LLaMA-3 instruct prompt strategies for fine-tuning (#1553)

50421c8
unverified

Ram Ram

winglian commited on May 11, 2024

ADD: warning hub model (#1301)

601c08b
unverified

JohanWork

Nanobit commited on Apr 30, 2024

Add ORPO example and e2e test (#1572)

98c25e1
unverified

tokestermw commited on Apr 27, 2024

wrap prepared_ds_path in str() to avoid TypeError in fsspec package (#1548)

7477a53
unverified

Frank Ruis

winglian commited on Apr 21, 2024

ORPO Trainer replacement (#1551)

7d1d22f
unverified

winglian commited on Apr 19, 2024

fix broken linting (#1541)

c10563c
unverified

winglian commited on Apr 19, 2024

feat: validate sample packing requires flash_attention (#1465)

bf4cd67
unverified

Nanobit commited on Apr 5, 2024

Support loading datasets saved via save_to_disk (#1432)

e634118
unverified

Keith Stevens commited on Mar 29, 2024

make sure to capture non-null defaults from config validation (#1415)

601b77b
unverified

winglian commited on Mar 26, 2024

fix(dataset): normalize tokenizer config and change hash from tokenizer class to tokenizer path (#1298)

ff939d8
unverified

Nanobit commited on Mar 25, 2024

strip out hacky qlora-fsdp workarounds now that qlora-fsdp fixes are upstreamed (#1428)

2a1589f
unverified

winglian commited on Mar 21, 2024

Feat: Add sharegpt multirole (#1137)

40a88e8
unverified

Nanobit commited on Mar 19, 2024

ORPO (#1419)

2ea70eb
unverified

winglian commited on Mar 18, 2024

Train parameters exclusively in specific ranges (#1390)

05bcc9e
unverified

seungduk commited on Mar 14, 2024

Add Glaive conversation format support (#1365)

b7d8a7d
unverified

Brian Fitzgerald

winglian commited on Mar 11, 2024

plain input/output prompt strategy w/o chat templates (#1346)

4d09b42
unverified

winglian commited on Mar 4, 2024

run tests again on Modal (#1289) [skip ci]

0001862
unverified

winglian commited on Feb 29, 2024

fix for protected model_ namespace w pydantic (#1345)

6b3b271
unverified

winglian commited on Feb 28, 2024

more fixes 20240228 (#1342) [skip ci]

0f985e1
unverified

winglian commited on Feb 28, 2024

Pydantic 2.x cfg (#1239)

cc3cebf
unverified

winglian commited on Feb 26, 2024

make mlflow optional (#1317)

5894f0e
unverified

winglian commited on Feb 26, 2024

Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273)

8430db2
unverified

whooray commited on Feb 13, 2024

Pretrain transforms (#1261)

c7cf381
unverified

winglian commited on Feb 6, 2024

relora: magnitude pruning of the optimizer (#1245)

8c2e05a
unverified

winglian commited on Feb 6, 2024

support for true batches with multipack (#1230)

00568c1
unverified

winglian commited on Feb 1, 2024

Support for additional_special_tokens (#1221) [skip ci]

25e037f
unverified

DreamGenX

winglian commited on Jan 31, 2024

Peft lotfq (#1222)

4cb7900
unverified

winglian commited on Jan 28, 2024

ADD: warning if hub_model_id ist set but not any save strategy (#1202)

af29d81
unverified

JohanWork

winglian commited on Jan 26, 2024

Feat/chatml add system message (#1117)

98b4762
unverified

mhenrichsen Mads Henrichsen

winglian commited on Jan 25, 2024

Phi2 multipack (#1173)

814aee6
unverified

winglian commited on Jan 23, 2024

DPO cleanup (#1126)

7523d1f
unverified

winglian

plaguss commited on Jan 23, 2024

Feat(test): Add tests for alpaca chatml prompt tokenizer (#1088)

5439707
unverified

JohanWork

Nanobit commited on Jan 23, 2024

Falcon embeddings (#1149) [skip docker]

e799e08
unverified

winglian commited on Jan 23, 2024

set fp16 to false if bf16, update bf16: auto in example YAMLs (#1122) [skip ci]

782b6a4
unverified

winglian

Nanobit commited on Jan 22, 2024

Deprecate max packed sequence len (#1141)

2ce5c0d
unverified

winglian commited on Jan 20, 2024

Multipack simplify for Mixtral (#1142)

6910e6a
unverified

winglian commited on Jan 18, 2024

Add shifted sparse attention (#973) [skip-ci]

1d70f24
unverified

jrc joecummings

winglian commited on Jan 18, 2024

Add `layers_to_transform` for `lora_config` (#1118)

8487b97
unverified

xzuyn commited on Jan 16, 2024

Enable or disable bf16 support based on availability (#1116)

0865613
unverified

Simon Hällqvist commited on Jan 14, 2024

keep gate in fp32 for 16 bit loras (#1105)

da97285
unverified

winglian commited on Jan 12, 2024

add gptneox embeddings, fix phi2 inputs, also fix the casting (#1083)

78c5b19
unverified

winglian commited on Jan 11, 2024

Commit History

Phi-3 conversation format, example training script and perplexity metric (#1582) cf64284 unverified

add support for rpo_alpha (#1681) c996881 unverified

re-enable DPO for tests in modal ci (#1374) 1f151c0 unverified

re-enable phi for tests in modal ci (#1373) 16d46b7 unverified

make sure the CI fails when pytest script fails (#1669) fe650dd unverified

Generalizing the chat_template prompt strategy (#1660) [skip ci] cc11c6b unverified

Switch to parallel FFD bin packing algorithm. (#1619) 367b2e8 unverified

Fix llama3 chat_template (extra <|eot_id|> on last turn) (#1635) 7c2bf30 unverified

Add KTO support (#1640) 22ae21a unverified

feat: Add LLaMA-3 instruct prompt strategies for fine-tuning (#1553) 50421c8 unverified

ADD: warning hub model (#1301) 601c08b unverified

Add ORPO example and e2e test (#1572) 98c25e1 unverified

wrap prepared_ds_path in str() to avoid TypeError in fsspec package (#1548) 7477a53 unverified

ORPO Trainer replacement (#1551) 7d1d22f unverified

fix broken linting (#1541) c10563c unverified

feat: validate sample packing requires flash_attention (#1465) bf4cd67 unverified

Support loading datasets saved via save_to_disk (#1432) e634118 unverified

make sure to capture non-null defaults from config validation (#1415) 601b77b unverified

fix(dataset): normalize tokenizer config and change hash from tokenizer class to tokenizer path (#1298) ff939d8 unverified

strip out hacky qlora-fsdp workarounds now that qlora-fsdp fixes are upstreamed (#1428) 2a1589f unverified

Feat: Add sharegpt multirole (#1137) 40a88e8 unverified

ORPO (#1419) 2ea70eb unverified

Train parameters exclusively in specific ranges (#1390) 05bcc9e unverified

Add Glaive conversation format support (#1365) b7d8a7d unverified

plain input/output prompt strategy w/o chat templates (#1346) 4d09b42 unverified

run tests again on Modal (#1289) [skip ci] 0001862 unverified

fix for protected model_ namespace w pydantic (#1345) 6b3b271 unverified

more fixes 20240228 (#1342) [skip ci] 0f985e1 unverified

Pydantic 2.x cfg (#1239) cc3cebf unverified

make mlflow optional (#1317) 5894f0e unverified

Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273) 8430db2 unverified

Pretrain transforms (#1261) c7cf381 unverified

relora: magnitude pruning of the optimizer (#1245) 8c2e05a unverified

support for true batches with multipack (#1230) 00568c1 unverified

Support for additional_special_tokens (#1221) [skip ci] 25e037f unverified

Peft lotfq (#1222) 4cb7900 unverified

ADD: warning if hub_model_id ist set but not any save strategy (#1202) af29d81 unverified

Feat/chatml add system message (#1117) 98b4762 unverified

Phi2 multipack (#1173) 814aee6 unverified

DPO cleanup (#1126) 7523d1f unverified

Feat(test): Add tests for alpaca chatml prompt tokenizer (#1088) 5439707 unverified

Falcon embeddings (#1149) [skip docker] e799e08 unverified

set fp16 to false if bf16, update bf16: auto in example YAMLs (#1122) [skip ci] 782b6a4 unverified

Deprecate max packed sequence len (#1141) 2ce5c0d unverified

Multipack simplify for Mixtral (#1142) 6910e6a unverified

Add shifted sparse attention (#973) [skip-ci] 1d70f24 unverified

Add `layers_to_transform` for `lora_config` (#1118) 8487b97 unverified

Enable or disable bf16 support based on availability (#1116) 0865613 unverified

keep gate in fp32 for 16 bit loras (#1105) da97285 unverified

add gptneox embeddings, fix phi2 inputs, also fix the casting (#1083) 78c5b19 unverified

Phi-3 conversation format, example training script and perplexity metric (#1582)

cf64284
unverified

add support for rpo_alpha (#1681)

c996881
unverified

re-enable DPO for tests in modal ci (#1374)

1f151c0
unverified

re-enable phi for tests in modal ci (#1373)

16d46b7
unverified

make sure the CI fails when pytest script fails (#1669)

fe650dd
unverified

Generalizing the chat_template prompt strategy (#1660) [skip ci]

cc11c6b
unverified

Switch to parallel FFD bin packing algorithm. (#1619)

367b2e8
unverified

Fix llama3 chat_template (extra <|eot_id|> on last turn) (#1635)

7c2bf30
unverified

Add KTO support (#1640)

22ae21a
unverified

feat: Add LLaMA-3 instruct prompt strategies for fine-tuning (#1553)

50421c8
unverified

ADD: warning hub model (#1301)

601c08b
unverified

Add ORPO example and e2e test (#1572)

98c25e1
unverified

wrap prepared_ds_path in str() to avoid TypeError in fsspec package (#1548)

7477a53
unverified

ORPO Trainer replacement (#1551)

7d1d22f
unverified

fix broken linting (#1541)

c10563c
unverified

feat: validate sample packing requires flash_attention (#1465)

bf4cd67
unverified

Support loading datasets saved via save_to_disk (#1432)

e634118
unverified

make sure to capture non-null defaults from config validation (#1415)

601b77b
unverified

fix(dataset): normalize tokenizer config and change hash from tokenizer class to tokenizer path (#1298)

ff939d8
unverified

strip out hacky qlora-fsdp workarounds now that qlora-fsdp fixes are upstreamed (#1428)

2a1589f
unverified

Feat: Add sharegpt multirole (#1137)

40a88e8
unverified

ORPO (#1419)

2ea70eb
unverified

Train parameters exclusively in specific ranges (#1390)

05bcc9e
unverified

Add Glaive conversation format support (#1365)

b7d8a7d
unverified

plain input/output prompt strategy w/o chat templates (#1346)

4d09b42
unverified

run tests again on Modal (#1289) [skip ci]

0001862
unverified

fix for protected model_ namespace w pydantic (#1345)

6b3b271
unverified

more fixes 20240228 (#1342) [skip ci]

0f985e1
unverified

Pydantic 2.x cfg (#1239)

cc3cebf
unverified

make mlflow optional (#1317)

5894f0e
unverified

Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273)

8430db2
unverified

Pretrain transforms (#1261)

c7cf381
unverified

relora: magnitude pruning of the optimizer (#1245)

8c2e05a
unverified

support for true batches with multipack (#1230)

00568c1
unverified

Support for additional_special_tokens (#1221) [skip ci]

25e037f
unverified

Peft lotfq (#1222)

4cb7900
unverified

ADD: warning if hub_model_id ist set but not any save strategy (#1202)

af29d81
unverified

Feat/chatml add system message (#1117)

98b4762
unverified

Phi2 multipack (#1173)

814aee6
unverified

DPO cleanup (#1126)

7523d1f
unverified

Feat(test): Add tests for alpaca chatml prompt tokenizer (#1088)

5439707
unverified

Falcon embeddings (#1149) [skip docker]

e799e08
unverified

set fp16 to false if bf16, update bf16: auto in example YAMLs (#1122) [skip ci]

782b6a4
unverified

Deprecate max packed sequence len (#1141)

2ce5c0d
unverified

Multipack simplify for Mixtral (#1142)

6910e6a
unverified

Add shifted sparse attention (#973) [skip-ci]

1d70f24
unverified

Add `layers_to_transform` for `lora_config` (#1118)

8487b97
unverified

Enable or disable bf16 support based on availability (#1116)

0865613
unverified

keep gate in fp32 for 16 bit loras (#1105)

da97285
unverified

add gptneox embeddings, fix phi2 inputs, also fix the casting (#1083)

78c5b19
unverified