hysts HF Staff commited on
Commit
9a74aaf
·
1 Parent(s): 1bb410b

commit files to HF hub

Browse files
Files changed (1) hide show
  1. papers.csv +2 -2
papers.csv CHANGED
@@ -849,7 +849,7 @@ Hyperparameters in Reinforcement Learning and How To Tune Them,"Theresa Eimer, M
849
  On Bridging the Gap between Mean Field and Finite Width in Deep Random Multilayer Perceptron with Batch Normalization,"Amir Joudaki, Hadi Daneshmand, Francis Bach",,,,,,,,,
850
  Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-type Samplers,"Sitan Chen, Giannis Daras, Alexandros Dimakis",http://arxiv.org/abs/2303.03384,,https://huggingface.co/papers/2303.03384,,,,2303.03384,3,0
851
  Exact Inference in High-order Structured Prediction,"Chuyang Ke, Jean Honorio",http://arxiv.org/abs/2302.03236,,https://huggingface.co/papers/2302.03236,,,,2302.03236,2,0
852
- When is Realizability Sufficient for Off-Policy Reinforcement Learning?,Andrea Zanette,http://arxiv.org/abs/2211.05311,,https://huggingface.co/papers/2211.05311,,,,2211.05311,1,0
853
  Doubly Optimal No-Regret Learning in Monotone Games,"Yang Cai, Weiqiang Zheng",http://arxiv.org/abs/2301.13120,,https://huggingface.co/papers/2301.13120,,,,2301.13120,2,0
854
  Q-Flow: Generative Modeling for Differential Equations of Open Quantum Dynamics with Normalizing Flows,"Owen Dugan, Peter Y. Lu, Rumen Dangovski, Di Luo, Marin Solja\v{c}i\'{c}",,,,,,,,,
855
  Feature learning in deep classifiers through Intermediate Neural Collapse,"Akshay Rangamani, Marius Lindegaard, Tomer Galanti, Tomaso Poggio",,,,,,,,,
@@ -1473,7 +1473,7 @@ DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm,"Yunhao Tang, Tadas
1473
  On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures,"Xian Yu, Lei Ying",http://arxiv.org/abs/2301.10932,,https://huggingface.co/papers/2301.10932,,,,2301.10932,2,0
1474
  Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling,"Yuta Saito, Qingyang Ren, Thorsten Joachims",http://arxiv.org/abs/2305.08062,,https://huggingface.co/papers/2305.08062,,,,2305.08062,3,0
1475
  Statistical Inference on Multi-armed Bandits with Delayed Feedback,"Lei Shi, Jingshen Wang, Tianhao Wu",,,,,,,,,
1476
- Multi-User Reinforcement Learning with Low Rank Rewards,"Naman Agarwal, Prateek Jain, Dheeraj Nagaraj, Suhas Kowshik, Praneeth Netrapalli",http://arxiv.org/abs/2210.05355,,https://huggingface.co/papers/2210.05355,,,,2210.05355,5,0
1477
  Additive Causal Bandits with Unknown Graph,"Alan Malek, Virginia Aglietti, Silvia Chiappa",http://arxiv.org/abs/2306.07858,,https://huggingface.co/papers/2306.07858,,,,2306.07858,3,1
1478
  Minimizing Trajectory Curvature of ODE-based Generative Models,"Sangyun Lee, Beomsu Kim, Jongchul Ye",http://arxiv.org/abs/2301.12003,https://github.com/sangyun884/fast-ode,https://huggingface.co/papers/2301.12003,,,,2301.12003,3,0
1479
  All in a Row: Compressed Convolution Networks for Graphs,"Junshu Sun, Shuhui Wang, XINZHE HAN, Zhe Xue, Qingming Huang",,,,,,,,,
 
849
  On Bridging the Gap between Mean Field and Finite Width in Deep Random Multilayer Perceptron with Batch Normalization,"Amir Joudaki, Hadi Daneshmand, Francis Bach",,,,,,,,,
850
  Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-type Samplers,"Sitan Chen, Giannis Daras, Alexandros Dimakis",http://arxiv.org/abs/2303.03384,,https://huggingface.co/papers/2303.03384,,,,2303.03384,3,0
851
  Exact Inference in High-order Structured Prediction,"Chuyang Ke, Jean Honorio",http://arxiv.org/abs/2302.03236,,https://huggingface.co/papers/2302.03236,,,,2302.03236,2,0
852
+ When is Realizability Sufficient for Off-Policy Reinforcement Learning?,Andrea Zanette,http://arxiv.org/abs/2211.05311,,https://huggingface.co/papers/2211.05311,,,,2211.05311,1,1
853
  Doubly Optimal No-Regret Learning in Monotone Games,"Yang Cai, Weiqiang Zheng",http://arxiv.org/abs/2301.13120,,https://huggingface.co/papers/2301.13120,,,,2301.13120,2,0
854
  Q-Flow: Generative Modeling for Differential Equations of Open Quantum Dynamics with Normalizing Flows,"Owen Dugan, Peter Y. Lu, Rumen Dangovski, Di Luo, Marin Solja\v{c}i\'{c}",,,,,,,,,
855
  Feature learning in deep classifiers through Intermediate Neural Collapse,"Akshay Rangamani, Marius Lindegaard, Tomer Galanti, Tomaso Poggio",,,,,,,,,
 
1473
  On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures,"Xian Yu, Lei Ying",http://arxiv.org/abs/2301.10932,,https://huggingface.co/papers/2301.10932,,,,2301.10932,2,0
1474
  Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling,"Yuta Saito, Qingyang Ren, Thorsten Joachims",http://arxiv.org/abs/2305.08062,,https://huggingface.co/papers/2305.08062,,,,2305.08062,3,0
1475
  Statistical Inference on Multi-armed Bandits with Delayed Feedback,"Lei Shi, Jingshen Wang, Tianhao Wu",,,,,,,,,
1476
+ Multi-User Reinforcement Learning with Low Rank Rewards,"Naman Agarwal, Prateek Jain, Dheeraj Nagaraj, Suhas Kowshik, Praneeth Netrapalli",http://arxiv.org/abs/2210.05355,,https://huggingface.co/papers/2210.05355,,,,2210.05355,5,1
1477
  Additive Causal Bandits with Unknown Graph,"Alan Malek, Virginia Aglietti, Silvia Chiappa",http://arxiv.org/abs/2306.07858,,https://huggingface.co/papers/2306.07858,,,,2306.07858,3,1
1478
  Minimizing Trajectory Curvature of ODE-based Generative Models,"Sangyun Lee, Beomsu Kim, Jongchul Ye",http://arxiv.org/abs/2301.12003,https://github.com/sangyun884/fast-ode,https://huggingface.co/papers/2301.12003,,,,2301.12003,3,0
1479
  All in a Row: Compressed Convolution Networks for Graphs,"Junshu Sun, Shuhui Wang, XINZHE HAN, Zhe Xue, Qingming Huang",,,,,,,,,