2 5 25

Mert Ege

mertege

mertege

AI & ML interests

None yet

Recent Activity

liked a Space 18 days ago

nanotron/ultrascale-playbook

liked a model 20 days ago

ALLaM-AI/ALLaM-7B-Instruct-preview

upvoted a paper about 1 month ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

View all activity

Organizations

mertege's activity

liked a Space 18 days ago

2.15k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a model 20 days ago

ALLaM-AI/ALLaM-7B-Instruct-preview

Text Generation • Updated 21 days ago • 7.2k • 85

upvoted a paper about 1 month ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 341

liked 2 models about 1 month ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 14 days ago • 1.49M • • 1.24k

deepseek-ai/DeepSeek-R1

Text Generation • Updated 14 days ago • 3.64M • • 11.1k

upvoted a paper 3 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 349

New activity in kashif/gkd_openassistant-guanaco 5 months ago

Chat template on GKD Trainer

#1 opened 5 months ago by

mertege

liked a dataset 5 months ago

abdoelsayed/Open-ArabicaQA

Preview • Updated Mar 27, 2024 • 244 • 4

liked a dataset 6 months ago

BAAI/Infinity-Instruct

Viewer • Updated 13 days ago • 20.4M • 5.34k • 598

liked a model 6 months ago

maywell/Qwen2-7B-Multilingual-RP

Text Generation • Updated Jun 25, 2024 • 2.43k • 55

liked a dataset 6 months ago

macadeliccc/opus_samantha

Viewer • Updated Jun 21, 2024 • 3.19k • 112 • 21

liked 3 models 6 months ago

liked a Space 7 months ago

140

Open Arabic LLM Leaderboard

🏆

Track, rank and evaluate open Arabic LLMs and chatbots

upvoted an article 7 months ago

Article

Fit More and Train Faster With ZeRO via DeepSpeed and FairScale

Jan 19, 2021

• 4

liked a model 8 months ago

haoranxu/ALMA-13B-Pretrain

Text Generation • Updated Oct 5, 2024 • 1.63k • 9

liked a dataset 9 months ago

mlfoundations/dclm-baseline-1.0

Preview • Updated Jul 22, 2024 • 302k • 208

upvoted a paper 9 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 93

liked a Space 9 months ago

Magpie

🐦

Generate and rate instruction-response pairs