Spaces:
Sleeping
Sleeping
commit files to HF hub
Browse files- papers.csv +2 -2
papers.csv
CHANGED
@@ -849,7 +849,7 @@ Hyperparameters in Reinforcement Learning and How To Tune Them,"Theresa Eimer, M
|
|
849 |
On Bridging the Gap between Mean Field and Finite Width in Deep Random Multilayer Perceptron with Batch Normalization,"Amir Joudaki, Hadi Daneshmand, Francis Bach",,,,,,,,,
|
850 |
Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-type Samplers,"Sitan Chen, Giannis Daras, Alexandros Dimakis",http://arxiv.org/abs/2303.03384,,https://huggingface.co/papers/2303.03384,,,,2303.03384,3,0
|
851 |
Exact Inference in High-order Structured Prediction,"Chuyang Ke, Jean Honorio",http://arxiv.org/abs/2302.03236,,https://huggingface.co/papers/2302.03236,,,,2302.03236,2,0
|
852 |
-
When is Realizability Sufficient for Off-Policy Reinforcement Learning?,Andrea Zanette,http://arxiv.org/abs/2211.05311,,https://huggingface.co/papers/2211.05311,,,,2211.05311,1,
|
853 |
Doubly Optimal No-Regret Learning in Monotone Games,"Yang Cai, Weiqiang Zheng",http://arxiv.org/abs/2301.13120,,https://huggingface.co/papers/2301.13120,,,,2301.13120,2,0
|
854 |
Q-Flow: Generative Modeling for Differential Equations of Open Quantum Dynamics with Normalizing Flows,"Owen Dugan, Peter Y. Lu, Rumen Dangovski, Di Luo, Marin Solja\v{c}i\'{c}",,,,,,,,,
|
855 |
Feature learning in deep classifiers through Intermediate Neural Collapse,"Akshay Rangamani, Marius Lindegaard, Tomer Galanti, Tomaso Poggio",,,,,,,,,
|
@@ -1473,7 +1473,7 @@ DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm,"Yunhao Tang, Tadas
|
|
1473 |
On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures,"Xian Yu, Lei Ying",http://arxiv.org/abs/2301.10932,,https://huggingface.co/papers/2301.10932,,,,2301.10932,2,0
|
1474 |
Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling,"Yuta Saito, Qingyang Ren, Thorsten Joachims",http://arxiv.org/abs/2305.08062,,https://huggingface.co/papers/2305.08062,,,,2305.08062,3,0
|
1475 |
Statistical Inference on Multi-armed Bandits with Delayed Feedback,"Lei Shi, Jingshen Wang, Tianhao Wu",,,,,,,,,
|
1476 |
-
Multi-User Reinforcement Learning with Low Rank Rewards,"Naman Agarwal, Prateek Jain, Dheeraj Nagaraj, Suhas Kowshik, Praneeth Netrapalli",http://arxiv.org/abs/2210.05355,,https://huggingface.co/papers/2210.05355,,,,2210.05355,5,
|
1477 |
Additive Causal Bandits with Unknown Graph,"Alan Malek, Virginia Aglietti, Silvia Chiappa",http://arxiv.org/abs/2306.07858,,https://huggingface.co/papers/2306.07858,,,,2306.07858,3,1
|
1478 |
Minimizing Trajectory Curvature of ODE-based Generative Models,"Sangyun Lee, Beomsu Kim, Jongchul Ye",http://arxiv.org/abs/2301.12003,https://github.com/sangyun884/fast-ode,https://huggingface.co/papers/2301.12003,,,,2301.12003,3,0
|
1479 |
All in a Row: Compressed Convolution Networks for Graphs,"Junshu Sun, Shuhui Wang, XINZHE HAN, Zhe Xue, Qingming Huang",,,,,,,,,
|
|
|
849 |
On Bridging the Gap between Mean Field and Finite Width in Deep Random Multilayer Perceptron with Batch Normalization,"Amir Joudaki, Hadi Daneshmand, Francis Bach",,,,,,,,,
|
850 |
Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-type Samplers,"Sitan Chen, Giannis Daras, Alexandros Dimakis",http://arxiv.org/abs/2303.03384,,https://huggingface.co/papers/2303.03384,,,,2303.03384,3,0
|
851 |
Exact Inference in High-order Structured Prediction,"Chuyang Ke, Jean Honorio",http://arxiv.org/abs/2302.03236,,https://huggingface.co/papers/2302.03236,,,,2302.03236,2,0
|
852 |
+
When is Realizability Sufficient for Off-Policy Reinforcement Learning?,Andrea Zanette,http://arxiv.org/abs/2211.05311,,https://huggingface.co/papers/2211.05311,,,,2211.05311,1,1
|
853 |
Doubly Optimal No-Regret Learning in Monotone Games,"Yang Cai, Weiqiang Zheng",http://arxiv.org/abs/2301.13120,,https://huggingface.co/papers/2301.13120,,,,2301.13120,2,0
|
854 |
Q-Flow: Generative Modeling for Differential Equations of Open Quantum Dynamics with Normalizing Flows,"Owen Dugan, Peter Y. Lu, Rumen Dangovski, Di Luo, Marin Solja\v{c}i\'{c}",,,,,,,,,
|
855 |
Feature learning in deep classifiers through Intermediate Neural Collapse,"Akshay Rangamani, Marius Lindegaard, Tomer Galanti, Tomaso Poggio",,,,,,,,,
|
|
|
1473 |
On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures,"Xian Yu, Lei Ying",http://arxiv.org/abs/2301.10932,,https://huggingface.co/papers/2301.10932,,,,2301.10932,2,0
|
1474 |
Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling,"Yuta Saito, Qingyang Ren, Thorsten Joachims",http://arxiv.org/abs/2305.08062,,https://huggingface.co/papers/2305.08062,,,,2305.08062,3,0
|
1475 |
Statistical Inference on Multi-armed Bandits with Delayed Feedback,"Lei Shi, Jingshen Wang, Tianhao Wu",,,,,,,,,
|
1476 |
+
Multi-User Reinforcement Learning with Low Rank Rewards,"Naman Agarwal, Prateek Jain, Dheeraj Nagaraj, Suhas Kowshik, Praneeth Netrapalli",http://arxiv.org/abs/2210.05355,,https://huggingface.co/papers/2210.05355,,,,2210.05355,5,1
|
1477 |
Additive Causal Bandits with Unknown Graph,"Alan Malek, Virginia Aglietti, Silvia Chiappa",http://arxiv.org/abs/2306.07858,,https://huggingface.co/papers/2306.07858,,,,2306.07858,3,1
|
1478 |
Minimizing Trajectory Curvature of ODE-based Generative Models,"Sangyun Lee, Beomsu Kim, Jongchul Ye",http://arxiv.org/abs/2301.12003,https://github.com/sangyun884/fast-ode,https://huggingface.co/papers/2301.12003,,,,2301.12003,3,0
|
1479 |
All in a Row: Compressed Convolution Networks for Graphs,"Junshu Sun, Shuhui Wang, XINZHE HAN, Zhe Xue, Qingming Huang",,,,,,,,,
|