Muhammad Osama
mosama
AI & ML interests
None yet
Recent Activity
updated
a model
13 minutes ago
mosama/Qwen2.5-0.5B-Kaggle-Float16-Pretrained-arb-eng-urd
updated
a model
14 minutes ago
mosama/Qwen2.5-0.5B-Kaggle-Float16-Pretrained-arb-eng-urd
updated
a model
about 1 hour ago
mosama/Qwen2.5-0.5B-Kaggle-Float16-Pretrained-arb-eng-urd
Organizations
mosama's activity
tensor size mismatch
2
#9 opened 4 months ago
by
Daemontatox
Train Mistral 7B 0.2
9
#2 opened about 1 year ago
by
mosama
Error: `rope_scaling`must be a dictionary with two fields
6
#1 opened about 1 year ago
by
LeMoussel
Model loading datatype bfloat16 or simple float16?
#2 opened about 1 year ago
by
mosama
With use_cache=False, the reponse is taking very long
#41 opened about 1 year ago
by
mosama
No chat template in tokenizer
2
#2 opened about 1 year ago
by
mosama
Output Score
4
#7 opened about 1 year ago
by
mosama