Qwen models with custom class for bidirectional attention
Joao Coelho
jmvcoelho
AI & ML interests
None yet
Organizations
models
14
jmvcoelho/Qwen2.5-0.5B-bidirectional-attn-mntp
0.5B
•
Updated
•
3
jmvcoelho/Qwen2.5-0.5B-bidirectional-attn
0.5B
•
Updated
•
3
jmvcoelho/ad-classifier-v0.2
Text Classification
•
0.2B
•
Updated
•
3
jmvcoelho/ad-classifier-v0.1
Text Classification
•
0.2B
•
Updated
•
4
jmvcoelho/ad-classifier-v0.0
Text Classification
•
0.2B
•
Updated
•
3
jmvcoelho/GPTNeoX-160m
0.2B
•
Updated
•
7
•
1
jmvcoelho/pythia-160m-1024-marco-docs-bow-contrastive-pretrain
Updated
•
4
jmvcoelho/t5-base-marco-lm-pretrain-2048
Updated
•
2
jmvcoelho/t5-base-marco-crop-pretrain-2048
Updated
•
3
jmvcoelho/t5-base-marco-2048
Updated
•
1