-
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 75 -
REST: Retrieval-Based Speculative Decoding
Paper • 2311.08252 • Published -
Active Retrieval Augmented Generation
Paper • 2305.06983 • Published • 3 -
Retrieval-Augmented Generation for Large Language Models: A Survey
Paper • 2312.10997 • Published • 10
Zhongzhi Yu
kevin1020
AI & ML interests
Efficient LLM Inference and Tuning
Recent Activity
updated
a collection
21 days ago
Efficient VLM via Image Token Compression
Organizations
Collections
18
models
None public yet
datasets
None public yet