57 62 177

Asankhaya Sharma

codelion

http://asankhaya.github.io/

AI & ML interests

AI/ML, Dev Tools and Application Security

Recent Activity

updated a Space 2 days ago

codelion/svg2png

liked a Space 2 days ago

codelion/svg2png

new activity 4 days ago

codelion/optillm:Implement other approach that optillm supported?

View all activity

Organizations

codelion's activity

upvoted a collection 4 days ago

Flagship models

Collection

A list of all the latest flagship Arcee models, including from the Virtuoso and Nova series • 8 items • Updated Dec 27, 2024 • 6

upvoted a paper 9 days ago

LettuceDetect: A Hallucination Detection Framework for RAG Applications

Paper • 2502.17125 • Published 18 days ago • 8

upvoted a paper 16 days ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published 17 days ago • 68

upvoted a paper 18 days ago

The Relationship Between Reasoning and Performance in Large Language Models -- o3 (mini) Thinks Harder, Not Longer

Paper • 2502.15631 • Published 21 days ago • 8

upvoted a paper 21 days ago

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published 22 days ago • 60

upvoted a collection 23 days ago

The Ultimate Collection of Code Classifiers

Collection

🔥 15 classifiers, 124M parameters, one per programming language— for assessing the educational value of GitHub code • 15 items • Updated 22 days ago • 11

upvoted 2 papers 24 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 26 days ago • 142

SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

Paper • 2502.12115 • Published 25 days ago • 43

upvoted an article 25 days ago

Article

What is test-time compute and how to scale it?

and 1 other •

Feb 6

• 54

upvoted 11 papers 26 days ago

Mathematical Reasoning in Large Language Models: Assessing Logical and Arithmetic Errors across Wide Numerical Ranges

Paper • 2502.08680 • Published about 1 month ago • 11

SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models

Paper • 2502.09390 • Published 29 days ago • 16

CoT-Valve: Length-Compressible Chain-of-Thought Tuning

Paper • 2502.09601 • Published 29 days ago • 14

Logical Reasoning in Large Language Models: A Survey

Paper • 2502.09100 • Published 29 days ago • 22

Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

Paper • 2502.09619 • Published 29 days ago • 31

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 29 days ago • 143