Added GLUMLP, changed config accordingly, added code to convert state_dict 0211324 Markus28 commited on Mar 22, 2024
Revert "feat: added back option to disable flash attention" b7ee9c4 Markus28 commited on Feb 21, 2024