nreimers's picture
upload
8981c8d

distilbert-base-uncased trained for 1000K steps with batch size 64 on C4, MSMARCO, Wikipedia, S2ORC, News