Commit History

Update gradio version
f82f31d
verified

OSainz commited on

Postprocessing
998b5f7

OSainz commited on

GPT-3.5 HumanEval_R CodeForces2305 contamination based on https://arxiv.org/abs/2402.15938 (#28)
047d292
verified

OSainz suryanshs16103 commited on

Likely FLORES contamination for Claude 3 Opus (#29)
52c7b2a
verified

Iker davidstap commited on

Add reports from Benchmarking paper "Benchmark Leakage in Large Language Models" (#27)
25633c4
verified

OSainz SinclairWang commited on

Add Reports Based on "Llemma: An Open Language Model For Mathematics" (#23)
9fba4d8
verified

OSainz wlchen commited on

Add Aquila model series which have gsm8k test set contamination (#21)
8f6a7cc
verified

OSainz bpHigh commited on

Update README.md
e190954
verified

OSainz commited on

GPT-3.5 Spider contamination based on https://arxiv.org/pdf/2402.08100 (#18)
dc4c3f8
verified

OSainz bpHigh commited on

update interface
95be02e

OSainz commited on

Merge branch 'pr/17'
77404ae

OSainz commited on

Updates
d4d0c64

OSainz commited on

File fixes and cleaning (#17)
99a8650
verified

OSainz commited on

Add info about the changes in the markdown.
4a1e5cc

OSainz commited on

Add changes
23add19

OSainz commited on

Superglue/RealNews Contamination based on "Noise-Robust De-Duplication at Scale" (#15)
888fb82
verified

OSainz emilys commited on

Mistral 7B Arc Easy Contamination based on "Proving Test Set Contamination in Black Box Language Models" (#14)
4f71313
verified

OSainz AmeyaPrabhu commited on

Added Contamination Evidence from GPT4 Tech Report using String matching on GPT-4 (#11)
f82db5d
verified

OSainz AmeyaPrabhu commited on

GPT-3.5Turbo HumanEval Contamination based on "Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language Models" (#16)
6b722ae
verified

OSainz jupyter31 commited on

Added Contamination Evidence on MMLU of ChatGPT/GPT4 from "Investigating data contamination in modern benchmarks for large language models" (#10)
f5daf9b
verified

OSainz AmeyaPrabhu commited on

Add ignorecase to search options
473e687

OSainz commited on

Added Contamination Info on Old Models: GPT3, FLAN, GLaM, PaLM, PaLM 2 (#13)
c4acbf6
verified

OSainz AmeyaPrabhu commited on

Fix arxiv links
7127ae8

OSainz commited on

Update README.md
9852685
verified

Iker commited on

Add model-based results for MedNLI, RadNLI for GPT-3.5 and GPT-4 (#8)
d57b460
verified

Iker j-chim commited on

Add data from "An Open-Source Data Contamination Report for Large Language Models" (#5)
619ed3b
verified

Iker vishaal27 commited on

Fix format issues
9b28f49

OSainz commited on

Add data from "Documenting Large Webtext Corpora: A Case Study on the Colossal Clean Crawled Corpus" (#6)
935e79b
verified

Iker vishaal27 commited on

update urls
f77074b

Iker commited on

Increase tab font size
6738f41

Iker commited on

Further refine the guidelines
49c00c2

Iker commited on

Update markdown.py
49c092a
verified

OSainz commited on

Get token from environment
76cf558

OSainz commited on

Add reports from Time Travel In LLMs paper (#3)
5a41656
verified

OSainz commited on

Use HF api to check repo existance
dee592a

OSainz commited on

Fix super_glue replace
ab79de8

OSainz commited on

Add PR links to previous commits
f35c65c

OSainz commited on

Add data from WIMBD paper (#2)
eadd64a
verified

OSainz commited on

Small changes
fd6f269

OSainz commited on

theme changes
540407e

OSainz commited on

Style + gitignore
5945c23

OSainz commited on

Text fixes
11def42

Iker commited on

Initital commit
eba8a37

Iker commited on

initial commit
1751f3a
verified

Iker commited on