Kh

raidhon

AI & ML interests

Fine-tuning, Dataset creation, Time Series

Recent Activity

Organizations

None yet

raidhon's activity

replied to m-ric's post 4 months ago
replied to hrishbhdalal's post 7 months ago
view reply

Yeah, I was thinking the same thing. A large vocabulary does improve the performance of smaller LLMs and judging by the GPT-4o the same is true for larger LLM. Give it a try. I'm just doing this for small size models up to 3B parameters.

New activity in raidhon/coven_tiny_1.1b_32k_orpo_alpha 8 months ago

Finetuning the Model

2
#1 opened 8 months ago by
GreazySpoon