Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
ybchen928
/
oncall-guide-ai
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
71b7de3
oncall-guide-ai
/
evaluation
Ctrl+K
Ctrl+K
6 contributors
History:
17 commits
VanKee
Refactor evaluation modules and add hospital chart generation
71b7de3
19 days ago
modules
Refactor evaluation modules and add hospital chart generation
19 days ago
old
Before Run the 1st Evalation: Add Precision & MRR Chart Generator and a sample test query
21 days ago
results
Refactor evaluation modules and add hospital chart generation
19 days ago
README_HOSPITAL_CUSTOMIZATION.md
Safe
10.2 kB
feat(evaluation): add comprehensive hospital customization evaluation system
21 days ago
direct_llm_evaluator.py
16.5 kB
Enhance Direct LLM Evaluator and Judge Evaluator:
21 days ago
generate_combined_comparison_chart.py
Safe
8.56 kB
feat(evaluation): add visualization generators for generating png files
21 days ago
generate_comparison_report.py
Safe
18.8 kB
feat(evaluation): add comprehensive hospital customization evaluation system
21 days ago
generate_execution_time_table.py
Safe
7.6 kB
feat(evaluation): add visualization generators for generating png files
21 days ago
generate_hospital_charts.py
Safe
7.84 kB
Refactor evaluation modules and add hospital chart generation
19 days ago
generate_individual_analysis_charts.py
Safe
17.4 kB
Refactor evaluation modules and add hospital chart generation
19 days ago
generate_individual_rag_vs_direct_charts.py
Safe
12.9 kB
feat(evaluation): add visualization generators for generating png files
21 days ago
hospital_customization_evaluator.py
Safe
26.5 kB
feat(evaluation): add comprehensive hospital customization evaluation system
21 days ago
latency_evaluator.py
41.4 kB
Update threshold values in latency evaluator and coverage chart generator; enhance precision and MRR analysis with corrected thresholds and new chart generator for detailed metrics visualization.
21 days ago
metric1_latency_chart_generator.py
Safe
13.6 kB
Before Run the 1st Evalation: Add Precision & MRR Chart Generator and a sample test query
21 days ago
metric2_extraction_chart_generator.py
Safe
8.63 kB
Before Run the 1st Evalation: Add Precision & MRR Chart Generator and a sample test query
21 days ago
metric3_relevance_chart_generator.py
Safe
9.93 kB
Update threshold values in latency evaluator and coverage chart generator; enhance precision and MRR analysis with corrected thresholds and new chart generator for detailed metrics visualization.
21 days ago
metric4_coverage_chart_generator.py
Safe
9.32 kB
Update threshold values in latency evaluator and coverage chart generator; enhance precision and MRR analysis with corrected thresholds and new chart generator for detailed metrics visualization.
21 days ago
metric5_6_judge_evaluator_manual.md
Safe
9.86 kB
Add multi-system evaluation support for clinical actionability and evidence quality metrics
21 days ago
metric5_6_llm_judge_chart_generator.py
19.5 kB
Add multi-system evaluation support for clinical actionability and evidence quality metrics
21 days ago
metric5_6_llm_judge_evaluator.py
Safe
30.3 kB
Enhance Direct LLM Evaluator and Judge Evaluator:
21 days ago
metric7_8_precision_MRR.py
16.8 kB
Add adaptive relevance thresholds for query complexity in PrecisionMRRAnalyzer; fix typo in condition mapping for postpartum hemorrhage
21 days ago
metric7_8_precision_mrr_chart_generator.py
Safe
23.8 kB
Update threshold values in latency evaluator and coverage chart generator; enhance precision and MRR analysis with corrected thresholds and new chart generator for detailed metrics visualization.
21 days ago
pre_user_query_evaluate.txt
318 Bytes
Add latency and relevance evaluators for medical query analysis (evaluatoin)
21 days ago
run_hospital_evaluation.py
Safe
3.58 kB
feat(evaluation): add comprehensive hospital customization evaluation system
21 days ago
run_rag_vs_direct_comparison.py
Safe
17.4 kB
Refactor evaluation modules and add hospital chart generation
19 days ago
single_test_query.txt
122 Bytes
Before Run the 1st Evalation: Add Precision & MRR Chart Generator and a sample test query
21 days ago
user_query.txt
2.06 kB
fix(mild bug): enhance user query prompts (more robust dealing process with .txt or .json) and add postpartum hemorrhage condition mapping
21 days ago
validate_expected_results.py
Safe
9.24 kB
Refactor evaluation modules and add hospital chart generation
19 days ago