ethz-privsec (ETHZ Privacy and Security Lab)

dedeswim

authored 4 papers 7 months ago

Evading Black-box Classifiers Without Breaking Eggs

Paper • 2306.02895 • Published Jun 5, 2023

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

Paper • 2404.01318 • Published Mar 28, 2024

Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition

Paper • 2406.07954 • Published Jun 12, 2024 • 2

AgentDojo: A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents

Paper • 2406.13352 • Published Jun 19, 2024

ftramer

authored a paper 12 months ago

Stealing Part of a Production Language Model

Paper • 2403.06634 • Published Mar 11, 2024 • 91

dpaleka

authored a paper 12 months ago

Stealing Part of a Production Language Model

Paper • 2403.06634 • Published Mar 11, 2024 • 91

carlini

authored a paper 12 months ago

Stealing Part of a Production Language Model

Paper • 2403.06634 • Published Mar 11, 2024 • 91

dpaleka

authored a paper over 1 year ago

ARB: Advanced Reasoning Benchmark for Large Language Models

Paper • 2307.13692 • Published Jul 25, 2023 • 17

carlini

authored a paper over 1 year ago

Preprocessors Matter! Realistic Decision-Based Attacks on Machine Learning Systems

Paper • 2210.03297 • Published Oct 7, 2022

ftramer

authored a paper over 1 year ago

Are aligned neural networks adversarially aligned?

Paper • 2306.15447 • Published Jun 26, 2023 • 5

carlini

authored a paper over 1 year ago

Are aligned neural networks adversarially aligned?

Paper • 2306.15447 • Published Jun 26, 2023 • 5

ETHZ Privacy and Security Lab

AI & ML interests

ethz-privsec's activity

Evading Black-box Classifiers Without Breaking Eggs

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition

AgentDojo: A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents

Stealing Part of a Production Language Model

Stealing Part of a Production Language Model

Stealing Part of a Production Language Model

ARB: Advanced Reasoning Benchmark for Large Language Models

Preprocessors Matter! Realistic Decision-Based Attacks on Machine Learning Systems

Are aligned neural networks adversarially aligned?

Are aligned neural networks adversarially aligned?

AI & ML interests

Team members 5

ethz-privsec's activity