7 13 4

ROHITH VENKATA REDDY

knight7561

AI & ML interests

Deep learning, Autonomous Driving

Recent Activity

commented on an article 25 days ago

SmolLM3: smol, multilingual, long-context reasoner

upvoted an article 25 days ago

SmolLM3: smol, multilingual, long-context reasoner

commented on an article 2 months ago

DABStep: Data Agent Benchmark for Multi-step Reasoning

View all activity

Organizations

Collections 2

spaces 3

Runtime error

Demo Mcp

📚

demo of MCP

Runtime error

Groot

💬

Groot - I can do anything

Sleeping

First Agent Template

⚡

Fetch ArXiV papers and get local timezone time

models 5

datasets 0

None public yet

ROHITH VENKATA REDDY

AI & ML interests

Recent Activity

Organizations

Collections 2

Aligning Instruction Tuning with Pre-training

Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Internal Consistency and Self-Feedback in Large Language Models: A Survey

Large Language Models are Zero-Shot Reasoners

Let's Verify Step by Step

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Aligning Instruction Tuning with Pre-training

Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Internal Consistency and Self-Feedback in Large Language Models: A Survey

Large Language Models are Zero-Shot Reasoners

Let's Verify Step by Step

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

spaces 3

Demo Mcp

Groot

First Agent Template

models 5

knight7561/SmolLM2_python_coder-FT-ORPO

knight7561/SmolLM2-FT-DPO-python-code

knight7561/SmolLM2_python_coder

knight7561/SmolLM2-eli5_precomputed_top_slice

knight7561/SmolLM2-FT-MyDataset

datasets 0

ROHITH VENKATA REDDY

AI & ML interests

Recent Activity

Organizations

Collections 2

spaces 3 Sort: Recently updated

Demo Mcp

Groot

First Agent Template

models 5 Sort: Recently updated

datasets 0

spaces 3

models 5