Dokyoon's picture

73 283

Dokyoon

leeloolee

·

Eruly

AI & ML interests

ai

Recent Activity

reacted to prithivMLmods's post with 👍 1 day ago

On the verge of releasing Poseidon-Reasoning-5M, a dataset built to excel in general thought processes, mathematics, and science across a diverse mixture of domains, I’m also dropping the Gargantua-R1-Compact dataset, a collection of over six million high-quality reasoning QA pair traces. 🤗🚀 ✦ Gargantua-R1-Compact : https://huggingface.co/datasets/prithivMLmods/Gargantua-R1-Compact ``` from datasets import load_dataset dataset = load_dataset("prithivMLmods/Gargantua-R1-Compact", split="train") ``` Additionally, I’m adding the mini version of Gargantua — the Gargantua-R1-Wee : https://huggingface.co/datasets/prithivMLmods/Gargantua-R1-Wee ``` from datasets import load_dataset dataset = load_dataset("prithivMLmods/Gargantua-R1-Wee", split="train") ``` The composition spans 73.93% core mathematical reasoning involving problems, proofs, and computational challenges, 12.11% across diverse scientific domains such as physics, chemistry, biology, and interdisciplinary topics, 11.35% in competitive coding covering algorithms and data structures, 1.37% in academic science focusing on research-level methodology, 0.95% in creative and analytical reasoning through logic puzzles and problem-solving tasks, 0.25% in specialized technical areas like MLOps, LLMs, diffusion models, and CUDA, and 0.06% involving data from graphs and charts converted into structured JSON formats. Designed with both rich contextual depth and formal structural clarity, Gargantua-R1-Compact is an optimal resource for advancing research in symbolic reasoning, interpretability, and high-precision question answering in mathematical domains. ✦ Collection : https://huggingface.co/collections/prithivMLmods/gargantua-r1-mod-6896bfd7834e82b89ad2b38b To know more about it, visit the dataset card of the respective dataset. !!

liked a model 3 days ago

mwhanna/qwen3-14b-transcoders

liked a model 3 days ago

Intelligent-Internet/II-Search-4B

View all activity

Organizations

Collections 2

spaces 2

Llama 2 13b Chat

Daekeun Ml Llama 2 Ko DPO 13B

models 45

leeloolee/gkd-model

leeloolee/intention

Sentence Similarity • 0.3B • Updated Sep 7, 2024 • 11 • 3

leeloolee/online_dpo_gemma

Updated Aug 6, 2024

leeloolee/models-moved

Updated Aug 6, 2024

leeloolee/0806

Updated Aug 6, 2024

leeloolee/online_dpo_0805

3B • Updated Aug 5, 2024 • 2

leeloolee/online_dpo_02_18_48

Updated Aug 3, 2024

leeloolee/online_dpo_02_15_23

Updated Aug 3, 2024

leeloolee/online_dpo_02_12_38

Updated Aug 3, 2024

leeloolee/online_dpo_02_08_24

Updated Aug 3, 2024

datasets 1

leeloolee/mdpo

Updated Oct 23, 2024 • 7 • 1