p2el

Enterprise

community

AI & ML interests

None defined yet.

p2el's activity

weichiang

authored a paper 8 months ago

RouteLLM: Learning to Route LLMs with Preference Data

Paper • 2406.18665 • Published Jun 26, 2024 • 6

evanfrick

authored a paper 9 months ago

From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline

Paper • 2406.11939 • Published Jun 17, 2024 • 7

Timmli

authored a paper 9 months ago

From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline

Paper • 2406.11939 • Published Jun 17, 2024 • 7

weichiang

authored a paper 9 months ago

From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline

Paper • 2406.11939 • Published Jun 17, 2024 • 7

Timmli

authored a paper about 1 year ago

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

Paper • 2403.04132 • Published Mar 7, 2024 • 39

angelopoulos

authored a paper about 1 year ago

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

Paper • 2403.04132 • Published Mar 7, 2024 • 39

weichiang

authored a paper about 1 year ago

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

Paper • 2403.04132 • Published Mar 7, 2024 • 39

weichiang

authored a paper over 1 year ago

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

Paper • 2306.05685 • Published Jun 9, 2023 • 33

Timmli

authored a paper over 1 year ago

LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset

Paper • 2309.11998 • Published Sep 21, 2023 • 25

weichiang

authored a paper over 1 year ago

LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset

Paper • 2309.11998 • Published Sep 21, 2023 • 25