Handle model parallelism
#4
by
sgugger
- opened
With this added line (similar to many models in Transformers), this model will work with device_map="auto"
during training.
With this added line (similar to many models in Transformers), this model will work with device_map="auto"
during training.