SebastianBodza
/

mpt-30B-qlora-multi_GPU

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

SebastianBodza commited on Jun 30, 2023

Commit

d36cc58

·

1 Parent(s): a7b39ef

Update README.md

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -1,4 +1,9 @@
-# MPT-7B LoRA Patch
 This is the Python model code for MPT-7B patched so that it can be used with a LoRA. Note that while I tested that it works and I get reasonable results out, it is very possible that the model isn't being trained correctly. The model code specifically says that left padding is not supported, but I forcibly did so and got decent results.

+# MPT-7B LoRA Patch - multi GPU
+Multi-GPU bugfix for MPT-30B
 This is the Python model code for MPT-7B patched so that it can be used with a LoRA. Note that while I tested that it works and I get reasonable results out, it is very possible that the model isn't being trained correctly. The model code specifically says that left padding is not supported, but I forcibly did so and got decent results.