Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
metunlp
/
model-eval-be
like
0
Running
on
T4
App
Files
Files
Community
2
Fetching metadata from the HF Docker repository...
main
model-eval-be
/
src
/
deepeval
7 contributors
History:
61 commits
Ahmet Kaan Sever
Sending inputs to device for non- multi gpu hardware
52b6367
about 8 hours ago
__init__.py
Safe
0 Bytes
Create deep eval suite
23 days ago
base_task.py
Safe
10.9 kB
Sending inputs to device for non- multi gpu hardware
about 8 hours ago
bias.py
Safe
4.91 kB
Changed dataset size to default and fixed imports
about 8 hours ago
bias_task.py
Safe
2.45 kB
Added new seperate logs for llm judges. Commented adapter loading for testing
about 9 hours ago
commonsense_reasoning_task.py
Safe
3.48 kB
Removed unnecessary debug prints and timestamps now return seconds.
7 days ago
complex_reasoning.py
Safe
2.31 kB
Removed unnecessary debug prints and timestamps now return seconds.
7 days ago
deepeval_task_manager.py
Safe
7.35 kB
Update src/deepeval/deepeval_task_manager.py
about 14 hours ago
faithfulness_task.py
Safe
2.35 kB
Added new seperate logs for llm judges. Commented adapter loading for testing
about 9 hours ago
instruction_following_task.py
Safe
2.38 kB
Sending inputs to device for non- multi gpu hardware
about 8 hours ago
math.py
Safe
5.72 kB
Changed dataset size to default and fixed imports
about 8 hours ago
metaphors_and_idioms.py
Safe
3.98 kB
Post merge fix
about 12 hours ago
mmlu.py
Safe
3.8 kB
Changed dataset size to default and fixed imports
about 8 hours ago
ner.py
Safe
8.13 kB
Changed dataset size to default and fixed imports
about 8 hours ago
nli.py
Safe
3.42 kB
Removed unnecessary debug prints and timestamps now return seconds.
7 days ago
pos.py
Safe
9.72 kB
Changed dataset size to default and fixed imports
about 8 hours ago
reading_comp_mc.py
Safe
3.23 kB
Removed unnecessary debug prints and timestamps now return seconds.
7 days ago
reading_comprehension_task.py
Safe
3.2 kB
Added new seperate logs for llm judges. Commented adapter loading for testing
about 9 hours ago
sentiment_analysis_task.py
Safe
1.61 kB
Removed unnecessary debug prints and timestamps now return seconds.
7 days ago
sts.py
Safe
5.99 kB
Changed dataset size to default and fixed imports
about 8 hours ago
summarization_task.py
Safe
2.44 kB
Changed dataset size to default and fixed imports
about 8 hours ago
topic_detection.py
Safe
3.39 kB
Changed dataset size to default and fixed imports
about 8 hours ago
toxicity_task.py
Safe
2.02 kB
Sending inputs to device for non- multi gpu hardware
about 8 hours ago
truthfulness_task.py
Safe
2.77 kB
Changed dataset size to default and fixed imports
about 8 hours ago
turkish_general_knowledge_task.py
Safe
3.08 kB
Removed unnecessary debug prints and timestamps now return seconds.
7 days ago
turkish_vocabulary.py
Safe
4.5 kB
Changed dataset size to default and fixed imports
about 8 hours ago
utils.py
Safe
513 Bytes
Add acc_std_err
21 days ago