Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization Paper • 2405.15071 • Published May 23, 2024 • 37
Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs Paper • 2311.05657 • Published Nov 9, 2023 • 27
Simple and Scalable Strategies to Continually Pre-train Large Language Models Paper • 2403.08763 • Published Mar 13, 2024 • 49
Is Cosine-Similarity of Embeddings Really About Similarity? Paper • 2403.05440 • Published Mar 8, 2024 • 3
Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification Paper • 2403.04696 • Published Mar 7, 2024 • 4
Fine Tuning vs. Retrieval Augmented Generation for Less Popular Knowledge Paper • 2403.01432 • Published Mar 3, 2024 • 2
Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering Paper • 2401.08500 • Published Jan 16, 2024 • 5
Large language models surpass human experts in predicting neuroscience results Paper • 2403.03230 • Published Mar 4, 2024 • 4
DeLLMa: A Framework for Decision Making Under Uncertainty with Large Language Models Paper • 2402.02392 • Published Feb 4, 2024 • 5
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper • 2402.13753 • Published Feb 21, 2024 • 114
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models Paper • 2402.10986 • Published Feb 16, 2024 • 77
Foundation AI Papers Collection Curated List of Must-Reads on LLM reasoning at Temus AI team • 135 items • Updated Jun 15, 2024 • 29