Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data Paper • 2404.03862 • Published Apr 5, 2024
AdapterSwap: Continuous Training of LLMs with Data Removal and Access-Control Guarantees Paper • 2404.08417 • Published Apr 12, 2024 • 1
Dated Data: Tracing Knowledge Cutoffs in Large Language Models Paper • 2403.12958 • Published Mar 19, 2024
Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation Paper • 2411.14384 • Published Nov 21, 2024 • 9
LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression Paper • 2406.20092 • Published Jun 28, 2024
Every Language Counts: Learn and Unlearn in Multilingual LLMs Paper • 2406.13748 • Published Jun 19, 2024
Insights into LLM Long-Context Failures: When Transformers Know but Don't Tell Paper • 2406.14673 • Published Jun 20, 2024
It Takes Two: On the Seamlessness between Reward and Policy Model in RLHF Paper • 2406.07971 • Published Jun 12, 2024
Amuro & Char: Analyzing the Relationship between Pre-Training and Fine-Tuning of Large Language Models Paper • 2408.06663 • Published Aug 13, 2024 • 16
Faux Polyglot: A Study on Information Disparity in Multilingual Large Language Models Paper • 2407.05502 • Published Jul 7, 2024
Nugget 2D: Dynamic Contextual Compression for Scaling Decoder-only Language Models Paper • 2310.02409 • Published Oct 3, 2023 • 1
Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents Paper • 2402.17896 • Published Feb 27, 2024
CLERC: A Dataset for Legal Case Retrieval and Retrieval-Augmented Analysis Generation Paper • 2406.17186 • Published Jun 24, 2024 • 1
SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers Paper • 2407.09413 • Published Jul 12, 2024 • 11