Li Tan PRO

tanliboy

AI & ML interests

None yet

Organizations

tanliboy's activity

New activity in Qwen/Qwen2.5-7B-Instruct 16 days ago
New activity in Qwen/Qwen2.5-Math-RM-72B 24 days ago

Preference Alignment

4
#6 opened 26 days ago by tanliboy
New activity in meta-llama/Llama-3.1-8B 27 days ago

Text Classification with LLMs

7
#30 opened 2 months ago by dss107
New activity in NousResearch/Hermes-3-Llama-3.1-8B 27 days ago

IFEVAL drop

#16 opened 27 days ago by tanliboy
New activity in Alibaba-NLP/gte-Qwen2-7B-instruct 28 days ago

bfloat16 vs. float32

#34 opened 28 days ago by tanliboy
New activity in Alibaba-NLP/gte-Qwen2-1.5B-instruct 29 days ago

Qwen 2.5 1.5B retrain?

4
#12 opened 30 days ago by tomaarsen
New activity in meta-llama/Llama-3.1-8B-Instruct 30 days ago
New activity in Qwen/Qwen2-VL-7B-Instruct about 1 month ago
New activity in Qwen/Qwen2-VL-7B-Instruct about 1 month ago

Have you deleted your GitHub page?

7
#10 opened about 1 month ago by xwzy6
New activity in google/gemma-2-9b-it about 1 month ago

Sliding window vs. Global Attention

5
#41 opened about 2 months ago by tanliboy
New activity in google/gemma-2-2b about 2 months ago
New activity in google/gemma-2b about 2 months ago

GemmaSdpaAttention vs GemmaAttention

2
#71 opened about 2 months ago by canqin001
New activity in Qwen/Qwen2-VL-7B-Instruct about 2 months ago
New activity in google/gemma-2-9b-it about 2 months ago
New activity in google/recurrentgemma-9b-it about 2 months ago

Evaluation Result

#15 opened about 2 months ago by tanliboy
New activity in meta-llama/Llama-3.1-70B-Instruct about 2 months ago

Pruning

7
#24 opened 2 months ago by dhivakarsa
New activity in google/gemma-7b about 2 months ago
New activity in meta-llama/Llama-3.1-8B-Instruct about 2 months ago

two BOS token id is right?

4
#97 opened 2 months ago by hpsun
New activity in meta-llama/Llama-3.1-8B 2 months ago
New activity in google/gemma-7b-it 2 months ago
New activity in Qwen/Qwen2-7B-Instruct 2 months ago
New activity in Qwen/Qwen2-Audio-7B-Instruct 2 months ago

TTS support?

3
#4 opened 2 months ago by yukiarimo
New activity in google/gemma-2-27b 2 months ago
New activity in google/gemma-2-9b 3 months ago

Fine-tuning Hyperparameters

6
#27 opened 3 months ago by tanliboy
New activity in microsoft/Phi-3-small-8k-instruct 4 months ago

Crash in Fine-tuning

4
#14 opened 5 months ago by tanliboy