venkatasg
/

lil-bevo

Inference Endpoints

Model card Files Files and versions Community

venkatasg commited on Aug 2, 2023

Commit

7cc2d7f

·

1 Parent(s): ff8c391

Update README.md

Files changed (1) hide show

README.md +2 -3

README.md CHANGED Viewed

@@ -15,9 +15,8 @@ Lil-Bevo is UT Austin's submission to the BabyLM challenge, specifically the *st
 ## TLDR:
 - Unigram tokenizer trained on 10M BabyLM tokens plus MAESTRO dataset for a vocab size of 16k.
 - `deberta-small-v3` trained on mixture of MAESTRO and 10M tokens for 5 epochs.
-- Model continues training for 50 epochs on 10M tokens with 128 sequence length.
-- Model continues training for 2 epochs on 10M tokens with 512 sequence length.
-- Model is trained with targeted linguistic masking for 10 epochs.
   This README will be updated with more details soon.

 ## TLDR:
 - Unigram tokenizer trained on 10M BabyLM tokens plus MAESTRO dataset for a vocab size of 16k.
 - `deberta-small-v3` trained on mixture of MAESTRO and 10M tokens for 5 epochs.
+- Model continues training for 50 epochs on 10M tokens with sequence length of 128.
+- Model is trained for 2 epochs with targeted linguistic masking with sequence length of 512.
   This README will be updated with more details soon.