SebastianBodza
/

mpt-30B-qlora-multi_GPU

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

SebastianBodza commited on Jul 3, 2023

Commit

a748a2b

·

1 Parent(s): d36cc58

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -3,6 +3,7 @@
 Multi-GPU bugfix for MPT-30B
 This is the Python model code for MPT-7B patched so that it can be used with a LoRA. Note that while I tested that it works and I get reasonable results out, it is very possible that the model isn't being trained correctly. The model code specifically says that left padding is not supported, but I forcibly did so and got decent results.

 Multi-GPU bugfix for MPT-30B
+Patch based on: https://github.com/iwalton3/mpt-lora-patch
 This is the Python model code for MPT-7B patched so that it can be used with a LoRA. Note that while I tested that it works and I get reasonable results out, it is very possible that the model isn't being trained correctly. The model code specifically says that left padding is not supported, but I forcibly did so and got decent results.