4 1 104

Kh PRO

raidhon

AI & ML interests

Fine-tuning, Dataset creation, Time Series

Recent Activity

liked a dataset 17 days ago

open-thoughts/OpenThoughts-114k

liked a model 17 days ago

Zyphra/Zonos-v0.1-hybrid

liked a model 17 days ago

agentica-org/DeepScaleR-1.5B-Preview

View all activity

Organizations

None yet

raidhon's activity

liked a dataset 17 days ago

open-thoughts/OpenThoughts-114k

Viewer • Updated 8 days ago • 228k • 114k • 617

liked 2 models 17 days ago

Zyphra/Zonos-v0.1-hybrid

Text-to-Speech • Updated 13 days ago • 51.8k • 1.01k

agentica-org/DeepScaleR-1.5B-Preview

Text Generation • Updated 5 days ago • 36.3k • • 486

commented on Open-source DeepResearch – Freeing our search agents 23 days ago

Very cool thanks! I think OpenAI already hate Open Source :)))))
Products that are trying so hard to monetize are created in one day.

upvoted an article 23 days ago

Article

Open-source DeepResearch – Freeing our search agents

24 days ago

• 1.11k

liked 2 models about 1 month ago

hexgrad/Kokoro-82M

Text-to-Speech • Updated about 12 hours ago • 1.25M • 3.46k

bytedance-research/UI-TARS-72B-DPO

Image-Text-to-Text • Updated Jan 25 • 22.7k • 90

reacted to onekq's post with 🔥 about 1 month ago

Post

4747

🐋DeepSeek 🐋 is the real OpenAI 😯

6 replies

liked a dataset about 2 months ago

NovaSky-AI/Sky-T1_data_17k

Viewer • Updated Jan 14 • 16.4k • 1.94k • 176

New activity in Qwen/QwQ-32B-Preview 2 months ago

Can't reproduce the evaluation result of GPQA dataset

#47 opened 2 months ago by

Rinn000

liked a model 5 months ago

rhymes-ai/Aria

Image-Text-to-Text • Updated Jan 27 • 24.7k • 617

liked a dataset 5 months ago

KbsdJames/Omni-MATH

Viewer • Updated Oct 12, 2024 • 4.43k • 4.01k • 83

replied to m-ric's post 6 months ago

Yes, it's been tested, and it's false. It's even worse than the regular LLAMA 3.1 70b. It's even funny to compare it to Claude.
https://www.reddit.com/r/LocalLLaMA/s/BH5A2ngyui

liked 2 models 10 months ago

imone/Llama-3-8B-fixed-special-embedding

Text Generation • Updated Apr 25, 2024 • 79 • 17

Xenova/gpt-4o

Updated May 13, 2024 • 58

replied to hrishbhdalal's post 10 months ago

Yeah, I was thinking the same thing. A large vocabulary does improve the performance of smaller LLMs and judging by the GPT-4o the same is true for larger LLM. Give it a try. I'm just doing this for small size models up to 3B parameters.

liked a model 10 months ago

mustafaaljadery/gemma-2B-10M

Updated May 9, 2024 • 82 • 228

updated a model 10 months ago

raidhon/coven_7b_128k_orpo_alpha-v1.1

Updated May 8, 2024

liked a model 10 months ago

aeonium/Aeonium-v0-Base-1B

Text Generation • Updated Jul 8, 2024 • 309 • 22

updated a model 10 months ago

raidhon/coven_tiny_1.1b_32k_orpo_alpha_gguf

Updated May 5, 2024 • 11