view article Article What We Learned About LLM/VLMs in Healthcare AI Evaluation: By shanchen • Nov 8, 2024 • 10
The SIFo Benchmark: Investigating the Sequential Instruction Following Ability of Large Language Models Paper • 2406.19999 • Published Jun 28, 2024 • 3
Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation Paper • 2406.13663 • Published Jun 19, 2024 • 7
Resonance RoPE: Improving Context Length Generalization of Large Language Models Paper • 2403.00071 • Published Feb 29, 2024 • 22