ICML2023_papers

Running

App Files Files Community

hysts HF Staff commited on Mar 8, 2024

Commit

9a74aaf

1 Parent(s): 1bb410b

commit files to HF hub

Browse files

Files changed (1) hide show

papers.csv +2 -2

papers.csv CHANGED Viewed

@@ -849,7 +849,7 @@ Hyperparameters in Reinforcement Learning and How To Tune Them,"Theresa Eimer, M
 On Bridging the Gap between Mean Field and Finite Width in Deep Random Multilayer Perceptron with Batch Normalization,"Amir Joudaki, Hadi Daneshmand, Francis Bach",,,,,,,,,
 Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-type Samplers,"Sitan Chen, Giannis Daras, Alexandros Dimakis",http://arxiv.org/abs/2303.03384,,https://huggingface.co/papers/2303.03384,,,,2303.03384,3,0
 Exact Inference in High-order Structured Prediction,"Chuyang Ke, Jean Honorio",http://arxiv.org/abs/2302.03236,,https://huggingface.co/papers/2302.03236,,,,2302.03236,2,0
-When is Realizability Sufficient for Off-Policy Reinforcement Learning?,Andrea Zanette,http://arxiv.org/abs/2211.05311,,https://huggingface.co/papers/2211.05311,,,,2211.05311,1,0
 Doubly Optimal No-Regret Learning in Monotone Games,"Yang Cai, Weiqiang Zheng",http://arxiv.org/abs/2301.13120,,https://huggingface.co/papers/2301.13120,,,,2301.13120,2,0
 Q-Flow: Generative Modeling for Differential Equations of Open Quantum Dynamics with Normalizing Flows,"Owen Dugan, Peter Y. Lu, Rumen Dangovski, Di Luo, Marin Solja\v{c}i\'{c}",,,,,,,,,
 Feature learning in deep classifiers through Intermediate Neural Collapse,"Akshay Rangamani, Marius Lindegaard, Tomer Galanti, Tomaso Poggio",,,,,,,,,
@@ -1473,7 +1473,7 @@ DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm,"Yunhao Tang, Tadas
 On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures,"Xian Yu, Lei Ying",http://arxiv.org/abs/2301.10932,,https://huggingface.co/papers/2301.10932,,,,2301.10932,2,0
 Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling,"Yuta Saito, Qingyang Ren, Thorsten Joachims",http://arxiv.org/abs/2305.08062,,https://huggingface.co/papers/2305.08062,,,,2305.08062,3,0
 Statistical Inference on Multi-armed Bandits with Delayed Feedback,"Lei Shi, Jingshen Wang, Tianhao Wu",,,,,,,,,
-Multi-User Reinforcement Learning with Low Rank Rewards,"Naman Agarwal, Prateek Jain, Dheeraj Nagaraj, Suhas Kowshik, Praneeth Netrapalli",http://arxiv.org/abs/2210.05355,,https://huggingface.co/papers/2210.05355,,,,2210.05355,5,0
 Additive Causal Bandits with Unknown Graph,"Alan Malek, Virginia Aglietti, Silvia Chiappa",http://arxiv.org/abs/2306.07858,,https://huggingface.co/papers/2306.07858,,,,2306.07858,3,1
 Minimizing Trajectory Curvature of ODE-based Generative Models,"Sangyun Lee, Beomsu Kim, Jongchul Ye",http://arxiv.org/abs/2301.12003,https://github.com/sangyun884/fast-ode,https://huggingface.co/papers/2301.12003,,,,2301.12003,3,0
 All in a Row: Compressed Convolution Networks for Graphs,"Junshu Sun, Shuhui Wang, XINZHE HAN, Zhe Xue, Qingming Huang",,,,,,,,,

 On Bridging the Gap between Mean Field and Finite Width in Deep Random Multilayer Perceptron with Batch Normalization,"Amir Joudaki, Hadi Daneshmand, Francis Bach",,,,,,,,,
 Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-type Samplers,"Sitan Chen, Giannis Daras, Alexandros Dimakis",http://arxiv.org/abs/2303.03384,,https://huggingface.co/papers/2303.03384,,,,2303.03384,3,0
 Exact Inference in High-order Structured Prediction,"Chuyang Ke, Jean Honorio",http://arxiv.org/abs/2302.03236,,https://huggingface.co/papers/2302.03236,,,,2302.03236,2,0
+When is Realizability Sufficient for Off-Policy Reinforcement Learning?,Andrea Zanette,http://arxiv.org/abs/2211.05311,,https://huggingface.co/papers/2211.05311,,,,2211.05311,1,1
 Doubly Optimal No-Regret Learning in Monotone Games,"Yang Cai, Weiqiang Zheng",http://arxiv.org/abs/2301.13120,,https://huggingface.co/papers/2301.13120,,,,2301.13120,2,0
 Q-Flow: Generative Modeling for Differential Equations of Open Quantum Dynamics with Normalizing Flows,"Owen Dugan, Peter Y. Lu, Rumen Dangovski, Di Luo, Marin Solja\v{c}i\'{c}",,,,,,,,,
 Feature learning in deep classifiers through Intermediate Neural Collapse,"Akshay Rangamani, Marius Lindegaard, Tomer Galanti, Tomaso Poggio",,,,,,,,,
 On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures,"Xian Yu, Lei Ying",http://arxiv.org/abs/2301.10932,,https://huggingface.co/papers/2301.10932,,,,2301.10932,2,0
 Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling,"Yuta Saito, Qingyang Ren, Thorsten Joachims",http://arxiv.org/abs/2305.08062,,https://huggingface.co/papers/2305.08062,,,,2305.08062,3,0
 Statistical Inference on Multi-armed Bandits with Delayed Feedback,"Lei Shi, Jingshen Wang, Tianhao Wu",,,,,,,,,
+Multi-User Reinforcement Learning with Low Rank Rewards,"Naman Agarwal, Prateek Jain, Dheeraj Nagaraj, Suhas Kowshik, Praneeth Netrapalli",http://arxiv.org/abs/2210.05355,,https://huggingface.co/papers/2210.05355,,,,2210.05355,5,1
 Additive Causal Bandits with Unknown Graph,"Alan Malek, Virginia Aglietti, Silvia Chiappa",http://arxiv.org/abs/2306.07858,,https://huggingface.co/papers/2306.07858,,,,2306.07858,3,1
 Minimizing Trajectory Curvature of ODE-based Generative Models,"Sangyun Lee, Beomsu Kim, Jongchul Ye",http://arxiv.org/abs/2301.12003,https://github.com/sangyun884/fast-ode,https://huggingface.co/papers/2301.12003,,,,2301.12003,3,0
 All in a Row: Compressed Convolution Networks for Graphs,"Junshu Sun, Shuhui Wang, XINZHE HAN, Zhe Xue, Qingming Huang",,,,,,,,,