VLM^2-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues Paper • 2502.12084 • Published 10 days ago • 29
PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data Paper • 2502.14397 • Published 7 days ago • 34
Mol-LLaMA: Towards General Understanding of Molecules in Large Molecular Language Model Paper • 2502.13449 • Published 9 days ago • 42
SurveyX: Academic Survey Automation via Large Language Models Paper • 2502.14776 • Published 7 days ago • 88
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper • 2502.15007 • Published 7 days ago • 150
Qwen2.5-3B-GRPO Collection Trained with unsloth on just 250 steps (resource constraints) on GSM8K to add reasoning abilities to Qwen2.5-3B (smaller model because resources) • 3 items • Updated 4 days ago
Qwen2.5-3B-GRPO Collection Trained with unsloth on just 250 steps (resource constraints) on GSM8K to add reasoning abilities to Qwen2.5-3B (smaller model because resources) • 3 items • Updated 4 days ago