4 11 10

Zhengyang Tang

tangzhy

AI & ML interests

None yet

Recent Activity

authored a paper 10 days ago

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

upvoted a paper 10 days ago

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

commented on a paper 10 days ago

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

View all activity

Organizations

tangzhy's activity

authored a paper 10 days ago

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

Paper • 2501.14492 • Published 13 days ago • 29

upvoted a paper 10 days ago

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

Paper • 2501.14492 • Published 13 days ago • 29

commented a paper 10 days ago

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

Paper • 2501.14492 • Published 13 days ago • 29 •

authored a paper 24 days ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published 27 days ago • 70

upvoted a paper 24 days ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published 27 days ago • 70

commented a paper 24 days ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published 27 days ago • 70 •

upvoted a paper about 1 month ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 79

New activity in tangzhy/ORLM 4 months ago

Apply for community grant: Academic project (gpu)

#1 opened 7 months ago by

tangzhy

upvoted 2 papers 4 months ago

Roadmap towards Superhuman Speech Understanding using Large Language Models

Paper • 2410.13268 • Published Oct 17, 2024 • 33

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Paper • 2410.07985 • Published Oct 10, 2024 • 28

authored a paper 4 months ago

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Paper • 2410.07985 • Published Oct 10, 2024 • 28

upvoted a paper 5 months ago

LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture

Paper • 2409.02889 • Published Sep 4, 2024 • 55

liked 3 datasets 6 months ago

liked a model 6 months ago

CardinalOperations/ORLM-LLaMA-3-8B

Text Generation • Updated May 29, 2024 • 276 • 4

liked 2 datasets 6 months ago

CardinalOperations/OR-Instruct-Data-3K

Viewer • Updated May 29, 2024 • 3k • 43 • 4

CardinalOperations/NL4OPT

Viewer • Updated May 29, 2024 • 245 • 48 • 2

updated a Space 6 months ago

ORLM

😻

Chatbot

liked a Space 6 months ago

ORLM

😻

Chatbot