llama3.2-1b-Open-R1-GRPO-test0 / train_results.json
hyunseoki's picture
Model save
e9c8320 verified
{
"total_flos": 0.0,
"train_loss": 0.0,
"train_runtime": 1.6097,
"train_samples": 7473,
"train_samples_per_second": 4642.518,
"train_steps_per_second": 165.871
}