Transformers
PyTorch
Graphcore
English
groupbert
Generated from Trainer

Lowering matmul_proportion and moving optimizer state offchip to avoid OOM on test 'groupbert_swag'

graphcore-rahult changed pull request status to merged
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment