Spaces:
Runtime error
Runtime error
Commit History
feat: scan layers + gradient checkpointing (#161)
07a6f9a
unverified
fix: smelu
7f2f8ed
fix: sinkformer
2c583b3
fix: support smelu
a2dcee4
feat: allow relative position (#156)
769d20a
unverified
feat: sinkhorn in lse mode (#155)
00d4661
unverified
fix: sinkformer gradient
eed4896
feat(model): allow bias (#152)
361a994
unverified
feat: add sinkformer + custom final ln + pre-ln (#151)
f139b0b
unverified
feat: force final ln in encoder
32f4ba5
feat: allow more configurations
5bd4c20
fix: DeepNet doesn't scale weights of embedding/output layers (#150)
503d6b4
unverified
Shuming Ma
Shuming Ma
commited on
feat: remove unecessary LN
02824a7
feat: add cogview
472c4cc
feat: implement transformer variants (#144)
542378c
unverified
feat(data): super conditioning (#141)
7939874
unverified
feat: support pod (#139)
803ccbf
unverified
feat: handle gradient checkpointing
5173ec7
feat: restore weights on CPU
5f954fc
fix: position embedding for generate method
ebac379
fix: typo
68cc185
fix: load from checkpoint
44b7c3e
feat(modeling): simplify abstract_init
fa72aa7
feat: handle model parallel
1bb3269
feat(train): distributed_shampoo with pjit
cc34d07
feat(model): clean way to load on cpu
12f323d
feat: use_artifact if run existing
a5ed112
Tokenizer, config, model can be loaded from wandb.
7e48337
Pedro Cuenca
commited on
Change import order again.
2b2be9b
Pedro Cuenca
commited on
Fix import order to make isort happy.
64d99b2
Pedro Cuenca
commited on
Accept changes suggested by linter.
9f522b8
Pedro Cuenca
commited on
Never consider local dirs as remote wandb references.
08dd098
Pedro Cuenca
commited on
Store resolved path after loading model.
55a631d
Pedro Cuenca
commited on
Override from_pretrained to support wandb artifacts.
1023afa
Pedro Cuenca
commited on