Purge duplicate "decoder.weight", rely on tied weights instead c0e4443 Tom Aarsen commited on about 1 month ago
Update the arch: ModernBertModel to ModernBertForMaskedLM 290243f Tom Aarsen commited on Dec 11, 2024