arxiv:2410.02725
Anikait Singh
Asap7772
AI & ML interests
Deep Learning, Reinforcement Learning, Robotics
Recent Activity
updated
a dataset
about 20 hours ago
Asap7772/Math-steptok-steps-mcvalue-test-part4-of-5
updated
a dataset
about 20 hours ago
Asap7772/Math-steptok-steps-mcvalue-test-part2-of-5
updated
a dataset
about 20 hours ago
Asap7772/Math-steptok-steps-mcvalue-test-part1-of-5
Organizations
models
8
Asap7772/mathcamp_sft_llama3-1-8b
Text Generation
•
Updated
•
6
Asap7772/anikait-prm_lr1e-5-datamath-mc-seed15486-exp0_epoch0_checkpoint1
Updated
Asap7772/anikait-prm_lr1e-5-datamath-mc-seed31426-exp0_epoch0_checkpoint2
Updated
Asap7772/anikait-prm_lr1e-5-datamath-mc-seed31426-exp0_epoch0_checkpoint1
Text Generation
•
Updated
•
11
Asap7772/anikait-prm_lr1e-5-datamath-mc-seed26382-exp0_epoch0_checkpoint2
Updated
Asap7772/anikait-prm_lr1e-5-datamath-mc-seed26382-exp0_epoch0_checkpoint1
Text Generation
•
Updated
•
13
Asap7772/elix-llama32-3b-ipo
Updated
Asap7772/sft-prm800k-llama31-8b-steptok
Text Generation
•
Updated
•
2.36k
datasets
467
Asap7772/Math-steptok-steps-mcvalue-test-part4-of-5
Viewer
•
Updated
•
521k
•
7
Asap7772/Math-steptok-steps-mcvalue-test-part2-of-5
Viewer
•
Updated
•
521k
•
8
Asap7772/Math-steptok-steps-mcvalue-test-part1-of-5
Viewer
•
Updated
•
521k
•
7
Asap7772/Math-steptok-steps-mcvalue-test-part5-of-5
Viewer
•
Updated
•
521k
•
7
Asap7772/Math-steptok-steps-mcvalue-test-part3-of-5
Viewer
•
Updated
•
521k
•
8
Asap7772/math_llamagen_feedback_sft
Viewer
•
Updated
•
77.3k
•
8
Asap7772/Math-steptok-steps-mcvalue-part4-of-5
Viewer
•
Updated
•
521k
•
10
Asap7772/Math-steptok-steps-mcvalue-part1-of-5
Viewer
•
Updated
•
521k
•
8
Asap7772/Math-steptok-steps-mcvalue-part2-of-5
Viewer
•
Updated
•
521k
•
11
Asap7772/Math-steptok-steps-mcvalue-part3-of-5
Viewer
•
Updated
•
521k
•
9