Spaces:
Running
Running
Update src/about.py
Browse files- src/about.py +3 -3
src/about.py
CHANGED
@@ -39,9 +39,9 @@ LLM_BENCHMARKS_TEXT = f"""
|
|
39 |
- This benchmark evaluates how well protein representation models can infer functional similarities between proteins. Ground truth functional similarities are derived from Gene Ontology (GO) annotations.
|
40 |
- Different distance metrics (Cosine, Manhattan, Euclidean) are used to compute protein vector similarities, which are then correlated with the functional similarities.
|
41 |
- The benchmark uses three different datasets:
|
42 |
-
• Sparse: A sparse uniform dataset with broader protein coverage
|
43 |
-
• 200: A set of well-annotated 200 proteins
|
44 |
-
• 500: A set of well-annotated 500 proteins
|
45 |
|
46 |
- Metrics (sim_ prefix):
|
47 |
• sim_sparse_MF_correlation/sim_200_MF_correlation/sim_500_MF_correlation: Correlation between protein embeddings and Molecular Function (MF) similarity scores
|
|
|
39 |
- This benchmark evaluates how well protein representation models can infer functional similarities between proteins. Ground truth functional similarities are derived from Gene Ontology (GO) annotations.
|
40 |
- Different distance metrics (Cosine, Manhattan, Euclidean) are used to compute protein vector similarities, which are then correlated with the functional similarities.
|
41 |
- The benchmark uses three different datasets:
|
42 |
+
• Sparse Uniform: A sparse uniform dataset with broader protein coverage
|
43 |
+
• Well Annotated 200: A set of well-annotated 200 proteins
|
44 |
+
• Well Annotated 500: A set of well-annotated 500 proteins
|
45 |
|
46 |
- Metrics (sim_ prefix):
|
47 |
• sim_sparse_MF_correlation/sim_200_MF_correlation/sim_500_MF_correlation: Correlation between protein embeddings and Molecular Function (MF) similarity scores
|