3 11 165

Turbo Pascal

TurboPascal

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

liked a Space about 1 month ago

nanotron/ultrascale-playbook

liked a model about 2 months ago

Qwen/Qwen3-Reranker-0.6B

View all activity

Organizations

upvoted a paper about 1 month ago

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2 • 52

liked a Space about 1 month ago

2.99k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a model about 2 months ago

Qwen/Qwen3-Reranker-0.6B

Text Ranking • 0.6B • Updated Jun 9 • 229k • 194

liked a model 2 months ago

Qwen/Qwen3-Embedding-0.6B

Feature Extraction • 0.6B • Updated Jun 20 • 3.33M • • 454

upvoted a collection 2 months ago

GTE models

Collection

General Text Embedding Models Released by Tongyi Lab of Alibaba Group • 21 items • Updated Jan 21 • 30

updated a dataset 3 months ago

Mmoment/Mirage_Multimodal_Benchmark

Updated May 15 • 32

liked a dataset 4 months ago

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 17.6k • 1.39k

upvoted 2 papers 4 months ago

AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization

Paper • 2503.23733 • Published Mar 31 • 11

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published Mar 28 • 46

liked a model 5 months ago

nvidia/NV-Embed-v2

Feature Extraction • 8B • Updated 19 days ago • 27.1k • 455

upvoted 2 articles 5 months ago

Article

Open-source DeepResearch – Freeing our search agents

and 4 others •

Feb 4

• 1.28k

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 45

updated a model 5 months ago

TurboPascal/ChineseModernBert

0.5B • Updated Feb 28 • 422 • 18

New activity in TurboPascal/ChineseModernBert 5 months ago

Could you please publish the training code on GitHub?

#1 opened 6 months ago by

Pony

upvoted a paper 5 months ago

DebCSE: Rethinking Unsupervised Contrastive Sentence Embedding Learning in the Debiasing Perspective

Paper • 2309.07396 • Published Sep 14, 2023 • 1

liked a Space 5 months ago

197

MT Bench

📊

Compare model answers to questions

liked a model 6 months ago

TurboPascal/ChineseModernBert

0.5B • Updated Feb 28 • 422 • 18

published a model 6 months ago

TurboPascal/ChineseModernBert

0.5B • Updated Feb 28 • 422 • 18

liked a dataset 8 months ago

phanerozoic/Coq-HoTT

Viewer • Updated Dec 13, 2024 • 7.85k • 27 • 2

liked a model 9 months ago

Qwen/Qwen2-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated Jan 12 • 782k • 436

Turbo Pascal

AI & ML interests

Recent Activity

Organizations

TurboPascal's activity

The Ultra-Scale Playbook

Open-source DeepResearch – Freeing our search agents

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Could you please publish the training code on GitHub?

MT Bench