ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction? Paper • 2411.06469 • Published Nov 10, 2024 • 17
Wait, but Tylenol is Acetaminophen... Investigating and Improving Language Models' Ability to Resist Requests for Misinformation Paper • 2409.20385 • Published Sep 30, 2024
WorldMedQA-V: a multilingual, multimodal medical examination dataset for multimodal language models evaluation Paper • 2410.12722 • Published Oct 16, 2024 • 5
Language Models are Surprisingly Fragile to Drug Names in Biomedical Benchmarks Paper • 2406.12066 • Published Jun 17, 2024 • 8
Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias Paper • 2405.05506 • Published May 9, 2024 • 1
Measuring Pointwise $\mathcal{V}$-Usable Information In-Context-ly Paper • 2310.12300 • Published Oct 18, 2023 • 1
MEDBERT.de: A Comprehensive German BERT Model for the Medical Domain Paper • 2303.08179 • Published Mar 14, 2023 • 2
Generalist embedding models are better at short-context clinical semantic search than specialized embedding models Paper • 2401.01943 • Published Jan 3, 2024 • 6
LongHealth: A Question Answering Benchmark with Long Clinical Documents Paper • 2401.14490 • Published Jan 25, 2024 • 3
xMEN: A Modular Toolkit for Cross-Lingual Medical Entity Normalization Paper • 2310.11275 • Published Oct 17, 2023 • 1
The impact of using an AI chatbot to respond to patient messages Paper • 2310.17703 • Published Oct 26, 2023 • 5
Large Language Models to Identify Social Determinants of Health in Electronic Health Records Paper • 2308.06354 • Published Aug 11, 2023 • 3