2 32 29

Denis Akhiyarov

dtanow

AI & ML interests

AI Code Generation with LLMs

Recent Activity

upvoted a paper 8 days ago

MPO: Boosting LLM Agents with Meta Plan Optimization

upvoted a paper 17 days ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

upvoted a paper 22 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

View all activity

Organizations

None yet

dtanow's activity

upvoted a paper 8 days ago

MPO: Boosting LLM Agents with Meta Plan Optimization

Paper • 2503.02682 • Published 10 days ago • 23

upvoted a paper 17 days ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published 22 days ago • 179

upvoted a paper 22 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 26 days ago • 142

upvoted a paper 29 days ago

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published Feb 11 • 47

upvoted a paper 2 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 263

upvoted 2 papers 3 months ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 38

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 57

liked a model 3 months ago

mistralai/Pixtral-12B-2409

Image-Text-to-Text • Updated Dec 26, 2024 • • 622

upvoted a paper 4 months ago

GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models

Paper • 2411.05830 • Published Nov 5, 2024 • 21

liked a model 5 months ago

facebook/incoder-6B

Text Generation • Updated Jan 24, 2023 • 647 • • 79

liked a Space 5 months ago

12.7k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots

liked a model 5 months ago

neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w4a16

Text Generation • Updated 30 days ago • 16k • 31

liked a dataset 5 months ago

coseal/codal-bench

Viewer • Updated Mar 18, 2024 • 500 • 107 • 6

liked a Space 5 months ago

185

BigCodeBench Leaderboard

🥇

Explore and analyze code evaluation data

liked 2 models 5 months ago

google/gemma-2-2b-it

Text Generation • Updated Aug 27, 2024 • 426k • • 1.02k

mistralai/Mistral-Nemo-Instruct-2407

Text Generation • Updated Nov 6, 2024 • 298k • • 1.49k

liked a dataset 5 months ago

nvidia/OpenMathInstruct-2

Viewer • Updated Nov 25, 2024 • 22M • 6.51k • 160

New activity in nvidia/Llama-3_1-Nemotron-51B-Instruct 6 months ago

fp8 / int8 inference - use bitsandbytes or awq

#8 opened 6 months ago by

dtanow

liked a model 6 months ago

meta-llama/Llama-3.2-11B-Vision-Instruct

Image-Text-to-Text • Updated Dec 4, 2024 • 1.32M • • 1.37k

liked a dataset 6 months ago

THUDM/humaneval-x

Viewer • Updated Oct 25, 2022 • 820 • 825 • 83