med-flamingo

community

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

mdmoor authored a paper about 2 months ago

SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context Learning

mdmoor authored a paper about 2 months ago

MARBLE: A Hard Benchmark for Multimodal Spatial Reasoning and Planning

mdmoor authored a paper 2 months ago

Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMs

View all activity

mdmoor

authored 2 papers about 2 months ago

SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context Learning

Paper • 2506.21355 • Published Jun 26 • 9

MARBLE: A Hard Benchmark for Multimodal Spatial Reasoning and Planning

Paper • 2506.22992 • Published Jun 28 • 12

mdmoor

authored 5 papers 2 months ago

Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMs

Paper • 2405.18740 • Published May 29, 2024

Almanac Copilot: Towards Autonomous Electronic Health Record Navigation

Paper • 2405.07896 • Published Apr 30, 2024

AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments

Paper • 2405.07960 • Published May 13, 2024 • 1

MIRIAD: Augmenting LLMs with millions of medical query-response pairs

Paper • 2506.06091 • Published Jun 6 • 9

Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards

Paper • 2506.11474 • Published Jun 13 • 18

mdmoor

authored a paper 5 months ago

AgentRxiv: Towards Collaborative Autonomous Research

Paper • 2503.18102 • Published Mar 23 • 24

michiyasunaga

authored a paper 5 months ago

reWordBench: Benchmarking and Improving the Robustness of Reward Models with Transformed Inputs

Paper • 2503.11751 • Published Mar 14 • 16

shirwu

authored 3 papers 6 months ago

michiyasunaga

authored 8 papers 6 months ago

Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task

Paper • 1809.08887 • Published Sep 24, 2018 • 2

Large Language Models as Analogical Reasoners

Paper • 2310.01714 • Published Oct 3, 2023 • 16

SParC: Cross-Domain Semantic Parsing in Context

Paper • 1906.02285 • Published Jun 5, 2019

ScisummNet: A Large Annotated Corpus and Content-Impact Models for Scientific Paper Summarization with Citation Networks

Paper • 1909.01716 • Published Sep 4, 2019

Beyond Positive Scaling: How Negation Impacts Scaling Trends of Language Models

Paper • 2305.17311 • Published May 27, 2023 • 1

WILDS: A Benchmark of in-the-Wild Distribution Shifts

Paper • 2012.07421 • Published Dec 14, 2020 • 1

On the Opportunities and Risks of Foundation Models

Paper • 2108.07258 • Published Aug 16, 2021 • 1

LM-Critic: Language Models for Unsupervised Grammatical Error Correction

Paper • 2109.06822 • Published Sep 14, 2021

AI & ML interests

Recent Activity

Team members 4

med-flamingo's activity