AISE-TUDelft/Starcoder2-15b-32f-gguf
Updated
AISE-TUDelft/ViTFT_16-32t_gpt2_ioi_20250108
0.1B
•
Updated
•
2
AISE-TUDelft/ViTFT_16-32t_gpt2_20240101
0.1B
•
Updated
•
2
AISE-TUDelft/100M_babylm_ascii__SPM-BPE_6144__8000S__32g__256b__0.00125lr12L_1024H_2048I_16h__debertav2
Fill-Mask
•
0.1B
•
Updated
•
6
AISE-TUDelft/10M_fwedu_0.001_ascii__SPM-BPE_6144__8000S__32g__256b__0.00125lr12L_1024H_2048I_16h__debertav2
Fill-Mask
•
0.1B
•
Updated
•
6
AISE-TUDelft/10M_babylm_ascii__SPM-BPE_6144__8000S__32g__256b__0.00125lr12L_1024H_2048I_16h__debertav2
Fill-Mask
•
0.1B
•
Updated
•
8
AISE-TUDelft/StarCoder2Java-15b_ep3
Text Generation
•
16B
•
Updated
•
4
AISE-TUDelft/StarCoder2Java-15b_ep2
Text Generation
•
16B
•
Updated
•
4
AISE-TUDelft/StarCoder2Java-15b_ep1
Text Generation
•
16B
•
Updated
•
4
AISE-TUDelft/StarCoder2Java-7b_ep3
Text Generation
•
7B
•
Updated
•
4
AISE-TUDelft/StarCoder2Java-7b_ep2
Text Generation
•
7B
•
Updated
•
4
AISE-TUDelft/StarCoder2Java-7b_ep1
Text Generation
•
7B
•
Updated
•
4
AISE-TUDelft/StarCoder2Java-3b_ep3
Text Generation
•
3B
•
Updated
•
4
AISE-TUDelft/StarCoder2Java-3b_ep2
Text Generation
•
3B
•
Updated
•
4
AISE-TUDelft/StarCoder2Java-3b_ep1
Text Generation
•
3B
•
Updated
•
4
AISE-TUDelft/Custom-Activations-GPT-KAN
Text Generation
•
Updated
•
11
AISE-TUDelft/Custom-Activations-GPT-SiLU
Text Generation
•
Updated
•
6
AISE-TUDelft/Custom-Activations-BERT-Swish
Fill-Mask
•
Updated
•
3
AISE-TUDelft/Custom-Activations-BERT-SiLU
Fill-Mask
•
Updated
•
3
AISE-TUDelft/Custom-Activations-GPT-PReLU
Text Generation
•
Updated
•
5
AISE-TUDelft/Custom-Activations-GPT-ReLU
Text Generation
•
Updated
•
4
AISE-TUDelft/Custom-Activations-BERT-PReLU
Fill-Mask
•
Updated
•
4
AISE-TUDelft/Custom-Activations-BERT-ReLU
Fill-Mask
•
Updated
•
3
AISE-TUDelft/Custom-Activations-GPT-Adaptive-GELU
Text Generation
•
Updated
•
4
AISE-TUDelft/Custom-Activations-GPT-GELU
Text Generation
•
Updated
•
4
AISE-TUDelft/Custom-Activations-BERT-Adaptive-GELU
Fill-Mask
•
Updated
•
2
AISE-TUDelft/Custom-Activations-BERT-GELU
Fill-Mask
•
Updated
•
3
AISE-TUDelft/Custom-Activations-GPT-Swish
Text Generation
•
Updated
•
9
AISE-TUDelft/JonBERTa-attn-ft-coco-5L
Text Classification
•
0.1B
•
Updated
•
4
AISE-TUDelft/JonBERTa-attn-ft-coco-4L
Text Classification
•
0.1B
•
Updated
•
4