distilbert-mlm-250k / Readme.md
nreimers's picture
upload
8296347
|
raw
history blame
105 Bytes

distilbert-base-uncased trained for 250K steps with batch size 64 on C4, MSMARCO, Wikipedia, S2ORC, News