Shuyue Jia (Bruce)
shuyuej
AI & ML interests
A Ph.D. Student at @vkola-lab, Boston University. Passionate about Large Language Models (LLMs), Multimodal Foundation Models, Generative AI, and Medical AI.
Recent Activity
updated
a model
about 19 hours ago
shuyuej/Ministral-8B-Instruct-2410-2048-with-grad_norm
updated
a model
about 19 hours ago
shuyuej/Llama-3.1-8B-Instruct-2048-with-grad_norm
updated
a model
about 19 hours ago
shuyuej/Differential-Diagnoser-GPTQ-Model
Organizations
shuyuej's activity
quant versions?
1
#12 opened 19 days ago
by
apol
RecursionError: maximum recursion depth exceeded
1
#1 opened over 1 year ago
by
WajihUllahBaig
missing model.safetensors.index.json
3
#1 opened 5 months ago
by
kresimirfijacko
Can you create gptq 8 bits quants?
1
#1 opened 5 months ago
by
rjmehta
Can you provide one model using `group_size=1024` to make the model smaller?
#15 opened 5 months ago
by
shuyuej
Update quantize_config.json
1
#12 opened 5 months ago
by
shuyuej
Update config.json
1
#11 opened 5 months ago
by
shuyuej
Source codes to quantize the LLaMA 3.1 405B model
3
#10 opened 5 months ago
by
shuyuej
Request for Mistral Large Instruct GPTQ INT4
4
#2 opened 5 months ago
by
sparsh35
Missing config.json
5
#6 opened 5 months ago
by
wxl2001
Where can we download `quant.py`?
1
#1 opened 5 months ago
by
shuyuej
Learning Rate during pretraining
1
#58 opened 5 months ago
by
shuyuej
About the tokenizer - Why use LLaMA tokenizer?
#5 opened 5 months ago
by
shuyuej
Model max_seq_length
7
#6 opened 5 months ago
by
shuyuej
Model max_seq_length
1
#4 opened 5 months ago
by
shuyuej
Where can we find `eval_medical_llm.py` and `main.py`
1
#15 opened 7 months ago
by
shuyuej
Fine-Tune a gemma model for question answering
17
#62 opened 10 months ago
by
Iamexperimenting
Weird Performance Issue with Gemma-7b compared to Gemma-2b with Qlora
6
#91 opened 8 months ago
by
UserDAN