69 1271 2066

taesiri PRO

taesiri

https://taesiri.ai/

AI & ML interests

AGI ... one linear layer at a time

Recent Activity

updated a dataset about 1 hour ago

taesiri/telus_data_review

updated a dataset about 10 hours ago

taesiri/BugsBunny-ManualEval-IntermediateSet

published a dataset about 10 hours ago

taesiri/BugsBunny-ManualEval-IntermediateSet

View all activity

Organizations

taesiri's activity

upvoted a paper about 19 hours ago

PixelWorld: Towards Perceiving Everything as Pixels

Paper • 2501.19339 • Published 4 days ago • 12

upvoted a paper about 21 hours ago

Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Paper • 2501.18837 • Published 5 days ago • 7

upvoted an article about 21 hours ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 303

upvoted 3 papers 1 day ago

upvoted a paper 4 days ago

PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding

Paper • 2501.16411 • Published 8 days ago • 17

upvoted an article 4 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

8 days ago

• 607

upvoted 4 papers 5 days ago

People who frequently use ChatGPT for writing tasks are accurate and robust detectors of AI-generated text

Paper • 2501.15654 • Published 9 days ago • 9

Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation

Paper • 2501.17433 • Published 6 days ago • 7

Atla Selene Mini: A General Purpose Evaluation Model

Paper • 2501.17195 • Published 8 days ago • 30

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 6 days ago • 45

upvoted 2 papers 6 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 7 days ago • 93

DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation

Paper • 2501.16764 • Published 7 days ago • 20

upvoted 3 papers 7 days ago

Are Vision Language Models Texture or Shape Biased and Can We Steer Them?

Paper • 2403.09193 • Published Mar 14, 2024 • 9

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 10 days ago • 50

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published 10 days ago • 48

upvoted 3 papers 8 days ago

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published 12 days ago • 19

Redundancy Principles for MLLMs Benchmarks

Paper • 2501.13953 • Published 15 days ago • 25

Humanity's Last Exam

Paper • 2501.14249 • Published 11 days ago • 51