Models deployed on HuggingFace or RunPods.
Patronus AI
company
Verified
AI & ML interests
LLM Evaluation
Recent Activity
View all activity
A benchmark for tip-of-the-tongue search and reasoning.
-
PatronusAI/lynx-70b-instruct-covidqa-generations
Viewer • Updated • 1k • 6 -
PatronusAI/lynx-70b-instruct-drop-generations
Viewer • Updated • 1k • 6 -
PatronusAI/lynx-70b-instruct-financebench-generations
Viewer • Updated • 1k • 9 -
PatronusAI/lynx-70b-instruct-halueval-generations
Viewer • Updated • 10k • 8
Models deployed on HuggingFace or RunPods.
A benchmark for tip-of-the-tongue search and reasoning.
-
PatronusAI/lynx-70b-instruct-covidqa-generations
Viewer • Updated • 1k • 6 -
PatronusAI/lynx-70b-instruct-drop-generations
Viewer • Updated • 1k • 6 -
PatronusAI/lynx-70b-instruct-financebench-generations
Viewer • Updated • 1k • 9 -
PatronusAI/lynx-70b-instruct-halueval-generations
Viewer • Updated • 10k • 8