Konstantin Chernyshev
k4black
AI & ML interests
None yet
Recent Activity
updated
a collection
about 11 hours ago
U-MATH and μ-MATH - University-level math evaluation
authored
a paper
about 1 month ago
U-MATH: A University-Level Benchmark for Evaluating Mathematical Skills
in LLMs
upvoted
a
paper
about 1 month ago
U-MATH: A University-Level Benchmark for Evaluating Mathematical Skills
in LLMs
Organizations
k4black's activity
Upload codebleu.py
1
#2 opened 12 months ago
by
fasterinnerlooper
Problem calling this using Huggingface Evaluate
1
#1 opened 12 months ago
by
fasterinnerlooper