arxiv:2410.18057

CLEAR: Character Unlearning in Textual and Visual Modalities

Published on Oct 23

· Submitted by

ai-alanov on Oct 30

#1 Paper of the day

Upvote

193

Authors:

Alexey Dontsov ,

Oleg Y. Rogov ,

Elena Tutubalina

Abstract

Machine Unlearning (MU) is critical for enhancing privacy and security in deep learning models, particularly in large multimodal language models (MLLMs), by removing specific private or hazardous information. While MU has made significant progress in textual and visual modalities, multimodal unlearning (MMU) remains significantly underexplored, partially due to the absence of a suitable open-source benchmark. To address this, we introduce CLEAR, a new benchmark designed to evaluate MMU methods. CLEAR contains 200 fictitious individuals and 3,700 images linked with corresponding question-answer pairs, enabling a thorough evaluation across modalities. We assess 10 MU methods, adapting them for MMU, and highlight new challenges specific to multimodal forgetting. We also demonstrate that simple ell_1 regularization on LoRA weights significantly mitigates catastrophic forgetting, preserving model performance on retained data. The dataset is available at https://huggingface.co/datasets/therem/CLEAR

View arXiv page View PDF Add to collection

Community

ai-alanov

Paper submitter 5 days ago

We introduce the first open-source benchmark for unlearning methods in a multimodal setup. We generate 200 fictitious individuals with associated biographical and visual data, such as facial images. After fine-tuning the model on this dataset, we then aim to selectively forget subsets of individuals (2, 10, or 20 persons). For the full pipeline, visit our GitHub repository.

librarian-bot

5 days ago

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

end000

4 days ago

•

edited 4 days ago

Nice

m-ric

2 days ago

Really interesting paper! Here's my summary:

🧠 𝗖𝗟𝗘𝗔𝗥: 𝗳𝗶𝗿𝘀𝘁 𝗺𝘂𝗹𝘁𝗶𝗺𝗼𝗱𝗮𝗹 𝗯𝗲𝗻𝗰𝗵𝗺𝗮𝗿𝗸 𝘁𝗼 𝗺𝗮𝗸𝗲 𝗺𝗼𝗱𝗲𝗹𝘀 𝗳𝗼𝗿𝗴𝗲𝘁 𝘄𝗵𝗮𝘁 𝘄𝗲 𝘄𝗮𝗻𝘁 𝘁𝗵𝗲𝗺 𝘁𝗼 𝗳𝗼𝗿𝗴𝗲𝘁

With privacy concerns rising, we sometimes need our models to "forget" specific information - like a person's data - while keeping everything else intact. Researchers just released CLEAR, the first benchmark to test how well this works with both text and images.

❌ Bad news: Current methods either fail to truly forget or end up forgetting way too much. It's like trying to remove a single ingredient from a baked cake!

✨ But there's hope: Adding simple mathematical constraints (L1 regularization) during the forgetting process significantly improves results.

🎯 Key insights:

✅ The benchmark tests forgetting on 200 fictional personas
‣ 3,770 visual Q&A pairs
‣ 4,000 textual Q&A pairs
‣ Additional real-world tests

🛑 Most current forgetting methods don't work well with both text and images
‣ They either remember what they should forget
‣ Or they forget too much unrelated information

✨ Simple mathematical constraints work surprisingly well
‣ L1 regularization prevents excessive forgetting
‣ Works especially well with the LLMU method