-
CodeFusion: A Pre-trained Diffusion Model for Code Generation
Paper • 2310.17680 • Published • 73 -
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning
Paper • 2312.15685 • Published • 16 -
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper • 2401.01055 • Published • 56 -
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws
Paper • 2401.00448 • Published • 31
Sergei Averkiev
averoo
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
5 days ago
LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?
upvoted
a
paper
14 days ago
Deep Researcher with Test-Time Diffusion
updated
a model
21 days ago
averoo/flux-lora-fal-kukynyx