Alex Zhang
a1zhang
AI & ML interests
None yet
Recent Activity
authored
a paper
about 2 months ago
SWE-bench Multimodal: Do AI Systems Generalize to Visual Software
Domains?
authored
a paper
about 2 months ago
KernelBench: Can LLMs Write Efficient GPU Kernels?
authored
a paper
about 2 months ago
VideoGameBench: Can Vision-Language Models complete popular video games?