XeTute's picture
Update README.md
cf08c15 verified
metadata
license: apache-2.0
datasets:
  - open-thoughts/OpenThoughts-114k
  - prithivMLmods/Deepthink-Reasoning-Ins
base_model:
  - XeTute/SaplingDream_V0.5-0.5B
tags:
  - reasoning
  - conversational
  - thinking
  - tiny
  - small
library_name: transformers

Sapling Dream V1

Introducing SaplingDream, a compact GPT model with 0.5 billion parameters, based on the Qwen/Qwen2.5-0.5B-Instruct architecture. This model has been fine-tuned on a RTX4060 8GB for a bit over two days on ~0.3B tokens...

Datasets & Resources

Evaluation Loss Chart

Evaluation Loss Chart

Our Apps & Socials

Chat Assistant | Support Us | GitHub

Long live the Islamic Republic of Pakistan; Glory to the Islamic Republic of Pakistan 🇵🇰

Pakistan Flag