-
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
Paper • 2402.14083 • Published • 47 -
Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models
Paper • 2402.14848 • Published • 18 -
A False Sense of Safety: Unsafe Information Leakage in 'Safe' AI Responses
Paper • 2407.02551 • Published • 7
Bo Yuan
BBexist
·
AI & ML interests
LLMs, MCMC
Recent Activity
new activity
10 days ago
Maxwell-Jia/AIME_2024:Errors in data
updated
a dataset
10 days ago
BBexist/AIME25
published
a dataset
10 days ago
BBexist/AIME25