Dokyoon
leeloolee
AI & ML interests
ai
Recent Activity
reacted
to
prithivMLmods's
post
with 👍
about 13 hours ago
On the verge of releasing Poseidon-Reasoning-5M, a dataset built to excel in general thought processes, mathematics, and science across a diverse mixture of domains, I’m also dropping the Gargantua-R1-Compact dataset, a collection of over six million high-quality reasoning QA pair traces. 🤗🚀
✦ Gargantua-R1-Compact : https://huggingface.co/datasets/prithivMLmods/Gargantua-R1-Compact
```
from datasets import load_dataset
dataset = load_dataset("prithivMLmods/Gargantua-R1-Compact", split="train")
```
Additionally, I’m adding the mini version of Gargantua — the Gargantua-R1-Wee : https://huggingface.co/datasets/prithivMLmods/Gargantua-R1-Wee
```
from datasets import load_dataset
dataset = load_dataset("prithivMLmods/Gargantua-R1-Wee", split="train")
```
The composition spans 73.93% core mathematical reasoning involving problems, proofs, and computational challenges, 12.11% across diverse scientific domains such as physics, chemistry, biology, and interdisciplinary topics, 11.35% in competitive coding covering algorithms and data structures, 1.37% in academic science focusing on research-level methodology, 0.95% in creative and analytical reasoning through logic puzzles and problem-solving tasks, 0.25% in specialized technical areas like MLOps, LLMs, diffusion models, and CUDA, and 0.06% involving data from graphs and charts converted into structured JSON formats. Designed with both rich contextual depth and formal structural clarity, Gargantua-R1-Compact is an optimal resource for advancing research in symbolic reasoning, interpretability, and high-precision question answering in mathematical domains.
✦ Collection : https://huggingface.co/collections/prithivMLmods/gargantua-r1-mod-6896bfd7834e82b89ad2b38b
To know more about it, visit the dataset card of the respective dataset. !!
liked
a model
2 days ago
mwhanna/qwen3-14b-transcoders
liked
a model
2 days ago
Intelligent-Internet/II-Search-4B