Bahdanau
Dzmitry
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
How to Train Your LLM Web Agent: A Statistical Diagnosis
published
an
article
4 months ago
PipelineRL
new activity
7 months ago
open-r1/README:[Experiment] Applying GRPO to DeepSeek-R1-Distill-Qwen-1.5B with LIMO