Binhang Yuan
biyuan
AI & ML interests
ML System
Recent Activity
authored
a paper
24 days ago
FlexGen: High-Throughput Generative Inference of Large Language Models
with a Single GPU
authored
a paper
24 days ago
Auto-Differentiation of Relational Computations for Very Large Scale
Machine Learning
authored
a paper
24 days ago
Holistic Evaluation of Language Models