mgyigit commited on
Commit
66b8efa
·
verified ·
1 Parent(s): da4dc20

Update src/about.py

Browse files
Files changed (1) hide show
  1. src/about.py +3 -3
src/about.py CHANGED
@@ -39,9 +39,9 @@ LLM_BENCHMARKS_TEXT = f"""
39
  - This benchmark evaluates how well protein representation models can infer functional similarities between proteins. Ground truth functional similarities are derived from Gene Ontology (GO) annotations.
40
  - Different distance metrics (Cosine, Manhattan, Euclidean) are used to compute protein vector similarities, which are then correlated with the functional similarities.
41
  - The benchmark uses three different datasets:
42
- • Sparse: A sparse uniform dataset with broader protein coverage
43
- • 200: A set of well-annotated 200 proteins
44
- • 500: A set of well-annotated 500 proteins
45
 
46
  - Metrics (sim_ prefix):
47
  • sim_sparse_MF_correlation/sim_200_MF_correlation/sim_500_MF_correlation: Correlation between protein embeddings and Molecular Function (MF) similarity scores
 
39
  - This benchmark evaluates how well protein representation models can infer functional similarities between proteins. Ground truth functional similarities are derived from Gene Ontology (GO) annotations.
40
  - Different distance metrics (Cosine, Manhattan, Euclidean) are used to compute protein vector similarities, which are then correlated with the functional similarities.
41
  - The benchmark uses three different datasets:
42
+ • Sparse Uniform: A sparse uniform dataset with broader protein coverage
43
+ Well Annotated 200: A set of well-annotated 200 proteins
44
+ Well Annotated 500: A set of well-annotated 500 proteins
45
 
46
  - Metrics (sim_ prefix):
47
  • sim_sparse_MF_correlation/sim_200_MF_correlation/sim_500_MF_correlation: Correlation between protein embeddings and Molecular Function (MF) similarity scores