IsoBench

community

https://isobench.github.io

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

deqing authored a paper 10 days ago

FoNE: Precise Single-Token Number Embeddings via Fourier Features

oliu-io authored a paper about 2 months ago

METAGENE-1: Metagenomic Foundation Model for Pandemic Monitoring

oliu-io authored a paper about 2 months ago

Game-theoretic LLM: Agent Workflow for Negotiation Games

View all activity

isobench's activity

deqing

authored a paper 10 days ago

FoNE: Precise Single-Token Number Embeddings via Fourier Features

Paper • 2502.09741 • Published 15 days ago • 11

oliu-io

authored 2 papers about 2 months ago

METAGENE-1: Metagenomic Foundation Model for Pandemic Monitoring

Paper • 2501.02045 • Published Jan 3 • 21

Game-theoretic LLM: Agent Workflow for Negotiation Games

Paper • 2411.05990 • Published Nov 8, 2024 • 7

willieneis

authored 2 papers about 2 months ago

Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions

Paper • 2412.08737 • Published Dec 11, 2024 • 53

METAGENE-1: Metagenomic Foundation Model for Pandemic Monitoring

Paper • 2501.02045 • Published Jan 3 • 21

oliu-io

authored a paper 3 months ago

Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions

Paper • 2412.08737 • Published Dec 11, 2024 • 53

deqing

authored a paper 3 months ago

VisualLens: Personalization through Visual History

Paper • 2411.16034 • Published Nov 25, 2024 • 18

deqing

authored a paper 5 months ago

TLDR: Token-Level Detective Reward Model for Large Vision Language Models

Paper • 2410.04734 • Published Oct 7, 2024 • 17

Ruohao

authored a paper 7 months ago

Open-Vocabulary Audio-Visual Semantic Segmentation

Paper • 2407.21721 • Published Jul 31, 2024 • 8

deqing

updated a dataset 7 months ago

isobench/IsoBench

Viewer • Updated Aug 2, 2024 • 1.89k • 510 • 8

oliu-io

authored 4 papers 7 months ago

On Retrieval Augmentation and the Limitations of Language Model Training

Paper • 2311.09615 • Published Nov 16, 2023 • 1

DeLLMa: A Framework for Decision Making Under Uncertainty with Large Language Models

Paper • 2402.02392 • Published Feb 4, 2024 • 5

Interpretable Diffusion via Information Decomposition

Paper • 2310.07972 • Published Oct 12, 2023

IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations

Paper • 2404.01266 • Published Apr 1, 2024 • 2

willieneis

authored a paper 8 months ago

LiveBench: A Challenging, Contamination-Free LLM Benchmark

Paper • 2406.19314 • Published Jun 27, 2024 • 23

deqing

authored a paper 9 months ago

Pre-trained Large Language Models Use Fourier Features to Compute Addition

Paper • 2406.03445 • Published Jun 5, 2024

yushihu

authored a paper 11 months ago

BLINK: Multimodal Large Language Models Can See but Not Perceive

Paper • 2404.12390 • Published Apr 18, 2024 • 26

willieneis

authored 2 papers 11 months ago

IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations

Paper • 2404.01266 • Published Apr 1, 2024 • 2

DeLLMa: A Framework for Decision Making Under Uncertainty with Large Language Models

Paper • 2402.02392 • Published Feb 4, 2024 • 5

yushihu

authored a paper 11 months ago

Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models

Paper • 2312.03052 • Published Dec 5, 2023

AI & ML interests

Recent Activity

Team members 6

isobench's activity