LLMs Meet Editing

community

https://llm-editing.github.io/

llm-editing

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

canyuchen authored a paper about 1 month ago

From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge

canyuchen authored a paper about 1 month ago

ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?

BaixHuang authored a paper 2 months ago

Can Editing LLMs Inject Harm?

View all activity

llm-editing's activity

canyuchen

authored 2 papers about 1 month ago

From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge

Paper • 2411.16594 • Published Nov 25 • 36

ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?

Paper • 2411.06469 • Published Nov 10 • 17

BaixHuang

authored 2 papers 2 months ago

Can Editing LLMs Inject Harm?

Paper • 2407.20224 • Published Jul 29 • 3

Authorship Attribution in the Era of LLMs: Problems, Methodologies, and Challenges

Paper • 2408.08946 • Published Aug 16 • 11

canyuchen

updated a dataset 2 months ago

llm-editing/HalluEditBench

Updated Oct 25 • 66 • 2

BaixHuang

authored a paper 2 months ago

Can Knowledge Editing Really Correct Hallucinations?

Paper • 2410.16251 • Published Oct 21 • 54

canyuchen

authored a paper 2 months ago

Can Knowledge Editing Really Correct Hallucinations?

Paper • 2410.16251 • Published Oct 21 • 54

BaixHuang

updated a dataset 2 months ago

llm-editing/HalluEditBench

Updated Oct 25 • 66 • 2

canyuchen

authored 2 papers 4 months ago

Can Editing LLMs Inject Harm?

Paper • 2407.20224 • Published Jul 29 • 3

Authorship Attribution in the Era of LLMs: Problems, Methodologies, and Challenges

Paper • 2408.08946 • Published Aug 16 • 11

BaixHuang

updated a dataset 5 months ago

llm-editing/editing-attack

Preview • Updated Jul 30 • 28

canyuchen

authored a paper 6 months ago

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

Paper • 2407.04842 • Published Jul 5 • 52

canyuchen

authored a paper 8 months ago

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Paper • 2404.12241 • Published Apr 18 • 10

canyuchen

authored a paper 11 months ago

Can Large Language Model Agents Simulate Human Trust Behaviors?

Paper • 2402.04559 • Published Feb 7

canyuchen

authored 2 papers about 1 year ago

Combating Misinformation in the Age of LLMs: Opportunities and Challenges

Paper • 2311.05656 • Published Nov 9, 2023

Can LLM-Generated Misinformation Be Detected?

Paper • 2309.13788 • Published Sep 25, 2023

AI & ML interests

Recent Activity

Team members 2

llm-editing's activity