Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
Commit History
fix ModelType is not defined
67c432c
change 'proprietary' models to 'external' models and added news models
91c6e89
update ranges
5651102
verified
filter quantized models from the collection
ab44cd6
verified
Update src/leaderboard/read_evals.py
09cd30b
verified
Update src/tools/plots.py
0e84464
verified
Update src/envs.py
246795a
verified
Update src/envs.py
d6353b5
verified
update collection format
9c5c692
Update src/display/about.py
183ec61
verified
Merge branch 'main' of https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard
dffeeb0
Make model text exibit precision if there's more than one eval or precision is not float16 or bfloat16
59399bc
Update src/submission/submit.py
ddc78ea
verified
fix eval_name for non main revision models
4717ca8
fix typo and multiple models in README
b4fc70b
Merge branch 'main' of https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard
f49e1e5
Permit different revision
a3b0a0f
Update src/display/about.py
77a5f61
verified
better explation for use_remote_code=True
a761927
add citations
c14ac9f
Add proprietary model results v1
1dbfacb
manual create request
6db2f85
Fix num_parameters in some models
0c95be4
Change language dropdown order
1ec02f0
Update model list in README
2331e6f
Add new column: Main Language
6da7311
change name
88675db
Rename 85B+ format to 100B+
73d86a6
verified
add dynamic documentation for RAW_RESULTS_REPO
43c2b1a
Add raw results links if exists, and fix minor issues
aa7060a
new size intervals and apply same intervals for the collection
7625ef6
fix typo
5ab1da9
add env variables: REQUIRE_MODEL_CARD and REQUIRE_MODEL_LICENSE
de3b367
Merge branch 'main' of https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard into merge_original
811ded7
submit ident json
f3a1876
Add env variable SHOW_INCOMPLETE_EVALS and order evaluation queue by priority
8aaf0e7
should fix the problem with flagsrm create_request_file.py
a4c11b8
Clémentine
commited on
einops as requirement
0cc3edb
Allow old model metrics
6269bd0
Add NPM field
f976f1c
Add new tasks and make leadboard work without new tasks evals
5639a81
add datasources links on the about page
03f7287
fix bool env variabled
b81a33b
fix eval time plot
0cb9327
add changelog tab
b74e881
Better about's
67cd6fc
Tweak description of TruthfulQA in About (#599)
b6f02e1
verified
update plot to only look at correct models
dbb8b5d
Clémentine
commited on