ICML2023_papers

Running

App Files Files Community

hysts HF Staff commited on Jul 28, 2023

Commit

e92c33d

1 Parent(s): 88971bc

Upload papers.csv with huggingface_hub

Browse files

Files changed (1) hide show

papers.csv +34 -34

papers.csv CHANGED Viewed

@@ -172,7 +172,7 @@ Adaptive Identification of Populations with Treatment Benefit in Clinical Trials
 Graph Ladling: Shockingly Simple Parallel GNN Training without Intermediate Communication,"Ajay Jaiswal, Shiwei Liu, Tianlong Chen,  Ding, Zhangyang “Atlas” Wang",,,,,,,,,
 A Critical Revisit of Adversarial Robustness in 3D Point Cloud Recognition with Diffusion-Driven Purification,"Jiachen Sun, Jiongxiao Wang, Weili Nie, Zhiding Yu, Zhuoqing Morley Mao, Chaowei Xiao",,,,,,,,,
 COLA: Orchestrating Error Coding and Learning for Robust Neural Network Inference Against Hardware Defects,"Anlan Yu, Ning Lyu, Jieming Yin, Zhiyuan Yan, Wujie Wen",,,,,,,,,
-A Closer Look at Self-Supervised Lightweight Vision Transformers,"Shaoru Wang, Jin Gao, Zeming Li, Xiaoqin Zhang, Weiming Hu",http://arxiv.org/abs/2205.14443,https://github.com/wangsr126/mae-lite,https://huggingface.co/papers/2205.14443,,,,2205.14443,5,0
 Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space,"Anas Barakat, Ilyas Fatkhullin, Niao He",http://arxiv.org/abs/2306.01854,,https://huggingface.co/papers/2306.01854,,,,2306.01854,3,0
 Leveraging Offline Data in Online Reinforcement Learning,"Andrew Wagenmaker, Aldo Pacchiano",http://arxiv.org/abs/2211.04974,,https://huggingface.co/papers/2211.04974,,,,2211.04974,2,0
 Implicit Regularization Leads to Benign Overfitting for Sparse Linear Regression,"Mo Zhou, Rong Ge",http://arxiv.org/abs/2302.00257,,https://huggingface.co/papers/2302.00257,,,,2302.00257,2,0
@@ -492,7 +492,7 @@ A Picture of the Space of Typical Learnable Tasks,"Rahul Ramesh, Jialin Mao, Ita
 Accounting For Informative Sampling When Learning to Forecast Treatment Outcomes Over Time,"Toon Vanderschueren, Alicia Curth, Wouter Verbeke, Mihaela van der Schaar",http://arxiv.org/abs/2306.04255,,https://huggingface.co/papers/2306.04255,,,,2306.04255,4,0
 AudioLDM: Text-to-Audio Generation with Latent Diffusion Models,"Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo Mandic, Wenwu Wang, Mark D Plumbley",http://arxiv.org/abs/2301.12503,,https://huggingface.co/papers/2301.12503,,,,2301.12503,8,1
 Revisiting Over-smoothing and Over-squashing Using Ollivier-Ricci Curvature,"Khang Nguyen, Nong Hieu, Vinh NGUYEN, Nhat Ho, Stanley Osher, TAN NGUYEN",http://arxiv.org/abs/2211.15779,,https://huggingface.co/papers/2211.15779,,,,2211.15779,6,1
-Lifelong Language Pretraining with Distribution-Specialized Experts,"Wuyang Chen, Yanqi Zhou, Nan Du, Yanping Huang, James Laudon, Zhifeng Chen, Claire Cui",http://arxiv.org/abs/2305.12281,,https://huggingface.co/papers/2305.12281,,,,2305.12281,7,0
 Delay-agnostic Asynchronous Coordinate Update Algorithm,"Xuyang Wu, Changxin Liu, Sindri Magnússon, Mikael Johansson",http://arxiv.org/abs/2305.08535,,https://huggingface.co/papers/2305.08535,,,,2305.08535,4,1
 Prototype-oriented unsupervised anomaly detection for multivariate time series,"yuxin li, Wenchao Chen, Bo Chen, Dongsheng Wang, Long Tian, Mingyuan Zhou",,,,,,,,,
 ClimaX: A foundation model for weather and climate,"Tung Nguyen, Johannes Brandstetter, Ashish Kapoor, Jayesh K. Gupta, Aditya Grover",http://arxiv.org/abs/2301.10343,,https://huggingface.co/papers/2301.10343,,,,2301.10343,5,1
@@ -597,7 +597,7 @@ Flash: Concept Drift Adaptation in Federated Learning,"Kunjal Panchal, Sunav Cho
 Conformal Prediction Sets for Graph Neural Networks,"Soroush H. Zargarbashi, Simone Antonelli, Aleksandar Bojchevski",,,,,,,,,
 Probabilistic Attention-to-Influence Neural Models for Event Sequences,"Xiao Shou, DEBARUN BHATTACHARJYA, Tian Gao, Dharmashankar Subramanian, Oktie Hassanzadeh, Kristin Bennett",,,,,,,,,
 Nearly-tight Bounds for Deep Kernel Learning,"Yi-Fan Zhang, Min-Ling Zhang",,,,,,,,,
-Generalized Disparate Impact for Configurable Fairness Solutions in ML,"Luca Giuliani, Eleonora Misino, Michele Lombardi",http://arxiv.org/abs/2305.18504,,https://huggingface.co/papers/2305.18504,,,,2305.18504,3,0
 Thompson Sampling with Less Exploration is Fast and Optimal,"Tianyuan Jin, XIANGLIN YANG, Xiaokui Xiao, Pan Xu",,,,,,,,,
 Do Machine Learning Models Learn Statistical Rules Inferred from Data?,"Aaditya Naik, Yinjun Wu, Mayur Naik, Eric Wong",http://arxiv.org/abs/2303.01433,https://github.com/DebugML/sqrl,https://huggingface.co/papers/2303.01433,,,,2303.01433,4,1
 Deep Perturbation Learning: Enhancing the Network Performance via Image Perturbations,"Zifan Song, Xiao Gong, Guosheng Hu, Cairong Zhao",,,,,,,,,
@@ -608,7 +608,7 @@ In Search for a Generalizable Method for Source Free Domain Adaptation,"Malik Bo
 GeCoNeRF: Few-shot Neural Radiance Fields via Geometric Consistency,"Min-Seop Kwak, Jiuhn Song, Seungryong Kim",http://arxiv.org/abs/2301.10941,,https://huggingface.co/papers/2301.10941,,,,2301.10941,3,1
 Input uncertainty propagation through trained neural networks,"Paul Monchot, Loic Coquelin, Sébastien J. Petit, Sébastien Marmin, Erwann LE PENNEC, Nicolas Fischer",,,,,,,,,
 Optimally-weighted Estimators of the Maximum Mean Discrepancy for Likelihood-Free Inference,"Ayush Bharti, Masha Naslidnyk, Oscar Key, Samuel Kaski, Francois-Xavier Briol",http://arxiv.org/abs/2301.11674,,https://huggingface.co/papers/2301.11674,,,,2301.11674,5,0
-SGD with large step sizes learns sparse features,"Maksym Andriushchenko, Aditya Vardhan Varre, Loucas Pillaud-Vivien, Nicolas Flammarion",http://arxiv.org/abs/2210.05337,https://github.com/tml-epfl/sgd-sparse-features,https://huggingface.co/papers/2210.05337,,,,2210.05337,4,0
 Kernel Logistic Regression Approximation of an Understandable ReLU Neural Network,"Marie Guyomard, Susana Barbosa, Lionel Fillatre",,,,,,,,,
 Cramming: Training a Language Model on a single GPU in one day.,"Jonas Geiping, Tom Goldstein",https://arxiv.org/abs//2212.14034,https://github.com/JonasGeiping/cramming,https://huggingface.co/papers/2212.14034,,https://huggingface.co/JonasGeiping/crammed-bert,https://huggingface.co/datasets/JonasGeiping/the_pile_WordPiecex32768_2efdb9d060d1ae95faf952ec1a50f020,2212.14034,2,1
 A Simple Zero-shot Prompt Weighting Technique to Improve Prompt Ensembling in Text-Image Models,"James Allingham, JIE REN, Michael Dusenberry, Jeremiah Liu, Xiuye Gu, Yin Cui, Dustin Tran, Balaji Lakshminarayanan",http://arxiv.org/abs/2302.06235,,https://huggingface.co/papers/2302.06235,,,,2302.06235,8,0
@@ -730,7 +730,7 @@ Dirichlet Diffusion Score Model for Biological Sequence Generation,"Pavel Avdeye
 Leveraging Proxy of Training Data for Test-Time Adaptation,"Juwon Kang, Nayeong Kim, Donghyeon Kwon, Jungseul Ok, Suha Kwak",,,,,,,,,
 Near-Optimal Algorithms for Private Online Optimization in the Realizable Regime,"Hilal Asi, Vitaly Feldman, Tomer Koren, Kunal Talwar",http://arxiv.org/abs/2302.14154,,https://huggingface.co/papers/2302.14154,,,,2302.14154,4,0
 Double-Weighting for Covariate Shift Adaptation,"José I. Segovia-Martín, Santiago Mazuelas, Anqi Liu",http://arxiv.org/abs/2305.08637,,https://huggingface.co/papers/2305.08637,,,,2305.08637,3,0
-Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise Constraints,"Donghao Li, Ruiquan Huang, Cong Shen, Jing Yang",http://arxiv.org/abs/2306.06265,,https://huggingface.co/papers/2306.06265,,,,2306.06265,4,0
 PASTA: Pessimistic Assortment Optimization,"Juncheng Dong, Weibin Mo, Zhengling Qi, Cong Shi, Ethan Fang, Vahid Tarokh",http://arxiv.org/abs/2302.03821,,https://huggingface.co/papers/2302.03821,,,,2302.03821,6,0
 Coarse-to-Fine: a Hierarchical Diffusion Model for Molecule Generation in 3D,"Bo Qiang, Yuxuan Song, Minkai Xu, Jingjing Gong, Bowen Gao, Hao Zhou, Wei-Ying Ma, Yanyan Lan",,,,,,,,,
 Off-Policy Average Reward Actor-Critic with Deterministic Policy Search,"Naman Saxena, Subhojyoti Khastagir, Shishir Nadubettu Yadukumar, Shalabh Bhatnagar",http://arxiv.org/abs/2305.12239,,https://huggingface.co/papers/2305.12239,,,,2305.12239,4,0
@@ -833,7 +833,7 @@ High Probability Convergence of Stochastic Gradient Methods ,"Zijian Liu, Ta Duy
 Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model,"Siyu Chen, Jibang Wu, Yifan Wu, Zhuoran Yang",http://arxiv.org/abs/2303.08613,,https://huggingface.co/papers/2303.08613,,,,2303.08613,4,1
 SLAMB: Accelerated Large Batch Training with Sparse Communication,"Hang Xu, Wenxuan Zhang, Jiawei Fei, Yuzhe Wu, TingWen Xie, Jun Huang, Yuchen Xie, Mohamed Elhoseiny, Panos Kalnis",,,,,,,,,
 Efficient Quantum Algorithms for Quantum Optimal Control,"Xiantao Li, Chunhao Wang",http://arxiv.org/abs/2304.02613,,https://huggingface.co/papers/2304.02613,,,,2304.02613,2,0
-Improved Policy Evaluation for Randomized Trials of Algorithmic Resource Allocation,"Aditya Mate, Bryan Wilder, Aparna Taneja, Milind Tambe",http://arxiv.org/abs/2302.02570,,https://huggingface.co/papers/2302.02570,,,,2302.02570,4,0
 Variational Sparse Inverse Cholesky Approximation for Latent Gaussian Processes via Double Kullback-Leibler Minimization,"Jian Cao, Myeongjong Kang, Felix Jimenez, Huiyan Sang, Florian Schaefer, Matthias Katzfuss",http://arxiv.org/abs/2301.13303,,https://huggingface.co/papers/2301.13303,,,,2301.13303,6,0
 Efficient exploration via epistemic-risk-seeking policy gradients,Brendan O'Donoghue,,,,,,,,,
 Probing the Deep Neural Manifold of Reinforcement Learning to Expose Volatility,"Ezgi Korkmaz, Jonah Brown-Cohen",,,,,,,,,
@@ -871,7 +871,7 @@ Characterizing Multicalibration via Property Elicitation,"Georgy Noarov, Aaron R
 Cut your Losses with Squentropy,"Like Hui, Misha Belkin, Stephen Wright",http://arxiv.org/abs/2302.03952,,https://huggingface.co/papers/2302.03952,,,,2302.03952,3,0
 Multi-Agent Learning from Learners,"MINE M CALISKAN, Francesco Chini, Setareh Maghsudi",,,,,,,,,
 Oracles and Followers: Stackelberg Equilibria in Deep Multi-Agent Reinforcement Learning,"Matthias Gerstgrasser, David Parkes",,,,,,,,,
-Robust Counterfactual Explanations for Neural Networks With Probabilistic Guarantees,"Faisal Hamman, Erfaun Noorani, Saumitra Mishra, Daniele Magazzeni, Sanghamitra Dutta",http://arxiv.org/abs/2305.11997,,https://huggingface.co/papers/2305.11997,,,,2305.11997,5,0
 Theoretical Behavior of XAI Methods in the Presence of Suppressor Variables,"Rick Wilming, Leo Kieslich, Benedict Clark, Stefan Haufe",http://arxiv.org/abs/2306.01464,,https://huggingface.co/papers/2306.01464,,,,2306.01464,4,1
 When do Minimax-fair Learning and Empirical Risk Minimization Coincide?,"Harvineet Singh, Matthäus Kleindessner, Volkan Cevher, Rumi Chunara, Chris Russell",,,,,,,,,
 Semi-Autoregressive Energy Flows: Towards Determinant-Free Training of Normalizing Flows ,"Phillip Si, Zeyi Chen, Subham S Sahoo, Yair Schiff, Volodymyr Kuleshov",,,,,,,,,
@@ -977,7 +977,7 @@ On the Statistical Benefits of Temporal Difference Learning,"David Cheikhi, Dani
 Bayes-optimal Learning of Deep Random Networks of Extensive-width,"Hugo Cui, FLORENT KRZAKALA, Lenka Zdeborova",,,,,,,,,
 Adapting to game trees in zero-sum imperfect information games,"Côme Fiegel, Pierre Menard, Tadashi Kozuno, Remi Munos, Vianney Perchet, Michal Valko",http://arxiv.org/abs/2212.12567,,https://huggingface.co/papers/2212.12567,,,,2212.12567,6,0
 Adversarial Policies Beat Superhuman Go AIs,"Tony Wang, Adam Gleave, Tom Tseng, Nora Belrose, Kellin Pelrine, Joseph Miller, Michael Dennis, Yawen Duan, Viktor Pogrebniak, Sergey Levine, Stuart Russell",http://arxiv.org/abs/2211.00241,,https://huggingface.co/papers/2211.00241,,,,2211.00241,11,1
-Pretraining Language Models with Human Preferences,"Tomasz Korbak, Kejian Shi, Angelica Chen, Rasika Bhalerao, Christopher Buckley, Jason Phang, Samuel Bowman, Ethan Perez",http://arxiv.org/abs/2302.08582,,https://huggingface.co/papers/2302.08582,,,,2302.08582,8,1
 Adversarial Example Does Good: Preventing Painting Imitation from Diffusion Models via Adversarial Examples,"Chumeng Liang, Xiaoyu Wu, Yang Hua, Jiaru Zhang, Yiming Xue, Tao Song, Zhengui XUE, Ruhui Ma, Haibing Guan",http://arxiv.org/abs/2302.04578,https://github.com/mist-project/mist.git,https://huggingface.co/papers/2302.04578,,,,2302.04578,9,0
 A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs,"Mikael Henaff, Minqi Jiang, Roberta Raileanu",http://arxiv.org/abs/2306.03236,,https://huggingface.co/papers/2306.03236,,,,2306.03236,3,0
 Using Large Language Models to Simulate Multiple Humans and Replicate Human Subject Studies,"Gati Aher, Rosa I. Arriaga, Adam Tauman Kalai",http://arxiv.org/abs/2208.10264,,https://huggingface.co/papers/2208.10264,,,,2208.10264,3,0
@@ -989,7 +989,7 @@ Buying Information for Stochastic Optimization,"Mingchen Ma, Christos Tzamos",ht
 Towards Theoretical Understanding of Inverse Reinforcement Learning,"Alberto Maria Metelli, Filippo Lazzati, Marcello Restelli",http://arxiv.org/abs/2304.12966,,https://huggingface.co/papers/2304.12966,,,,2304.12966,3,0
 Tighter Lower Bounds for Shuffling SGD: Random Permutations and Beyond,"Jaeyoung Cha, Jaewook Lee, Chulhee Yun",http://arxiv.org/abs/2303.07160,,https://huggingface.co/papers/2303.07160,,,,2303.07160,3,0
 Delayed Feedback in Kernel Bandits,"Sattar Vakili, Danyal Ahmed, Alberto Bernacchia, Ciara Pike-Burke",http://arxiv.org/abs/2302.00392,,https://huggingface.co/papers/2302.00392,,,,2302.00392,4,0
-Sharper Bounds for $\ell_p$ Sensitivity Sampling,"David Woodruff, Taisuke Yasuda",http://arxiv.org/abs/2306.00732,,https://huggingface.co/papers/2306.00732,,,,2306.00732,2,0
 Hyena Hierarchy: Towards Larger Convolutional Language Models,"Michael Poli, Stefano Massaroli, Eric Nguyen, Daniel Y Fu, Tri Dao, Stephen Baccus, Yoshua Bengio, Stefano Ermon, Christopher Re",http://arxiv.org/abs/2302.10866,,https://huggingface.co/papers/2302.10866,,,,2302.10866,9,0
 Delving into Noisy Label Detection with Clean Data,"Chenglin Yu, Xinsong Ma, Weiwei Liu",,,,,,,,,
 GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration,"Naoki Murata, Koichi Saito, Chieh-Hsin Lai, Yuhta Takida, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon",http://arxiv.org/abs/2301.12686,,https://huggingface.co/papers/2301.12686,,,,2301.12686,7,1
@@ -1056,7 +1056,7 @@ POUF: Prompt-Oriented Unsupervised Fine-tuning for Large Pre-trained Models,"Kor
 Towards Omni-generalizable Neural Methods for Vehicle Routing Problems,"Jianan Zhou, Yaoxin Wu, Wen Song, Zhiguang Cao, Jie Zhang",http://arxiv.org/abs/2305.19587,https://github.com/RoyalSkye/Omni-VRP,https://huggingface.co/papers/2305.19587,,,,2305.19587,5,0
 Protecting Language Generation Models via Invisible Watermarking,"Xuandong Zhao, Yu-Xiang Wang, Lei Li",http://arxiv.org/abs/2302.03162,,https://huggingface.co/papers/2302.03162,,,,2302.03162,3,0
 Global Optimization with Parametric Function Approximation,"Chong Liu, Yu-Xiang Wang",http://arxiv.org/abs/2211.09100,,https://huggingface.co/papers/2211.09100,,,,2211.09100,2,0
-Non-stationary Reinforcement Learning under General Function Approximation,"Songtao Feng, Ming Yin, Ruiquan Huang, Yu-Xiang Wang, Jing Yang, Yingbin LIANG",http://arxiv.org/abs/2306.00861,,https://huggingface.co/papers/2306.00861,,,,2306.00861,6,0
 Demystifying Disagreement-on-the-Line in High Dimensions,"Donghwan Lee, Behrad Moniri, Xinmeng Huang, Edgar Dobriban, Hamed Hassani",http://arxiv.org/abs/2301.13371,,https://huggingface.co/papers/2301.13371,,,,2301.13371,5,0
 Multisample Flow Matching: Straightening Flows with Minibatch Couplings,"Aram-Alexandre Pooladian, Heli Ben-Hamu, Carles Domingo i Enrich, Brandon Amos, Yaron Lipman, Ricky T. Q. Chen",http://arxiv.org/abs/2304.14772,,https://huggingface.co/papers/2304.14772,,,,2304.14772,6,1
 Competitive Gradient Optimization,"Abhijeet Vyas, Brian Bullins, Kamyar Azizzadenesheli",http://arxiv.org/abs/2205.14232,,https://huggingface.co/papers/2205.14232,,,,2205.14232,2,0
@@ -1096,7 +1096,7 @@ Identifying Interpretable Subspaces in Image Representations,"Neha Mukund Kalibh
 LegendreTron: Uprising Proper Multiclass Loss Learning,"Kevin H. Lam, Christian Walder, Spiridon Penev, Richard Nock",http://arxiv.org/abs/2301.11695,,https://huggingface.co/papers/2301.11695,,,,2301.11695,4,0
 R-U-SURE? Uncertainty-Aware Code Suggestions By Maximizing Utility Across Random User Intents,"Daniel D. Johnson, Daniel Tarlow, Christian Walder",,,,,,,,,
 High-dimensional Location Estimation via Norm Concentration for Subgamma Vectors,"Shivam Gupta, Jasper Lee, Eric Price",http://arxiv.org/abs/2302.02497,,https://huggingface.co/papers/2302.02497,,,,2302.02497,3,0
-COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models,"Jinqi Xiao, Miao Yin, Yu Gong, Xiao Zang, Jian Ren, Bo Yuan",http://arxiv.org/abs/2305.17235,,https://huggingface.co/papers/2305.17235,,,,2305.17235,6,0
 Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling,"Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'Brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, USVSN Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar van der Wal",http://arxiv.org/abs/2304.01373,https://github.com/EleutherAI/pythia,https://huggingface.co/papers/2304.01373,,,,2304.01373,13,3
 HyperTuning:  Toward Adapting Large Language Models without Back-propagation,"Jason Phang, Yi Mao, Pengcheng He, Weizhu Chen",,,,,,,,,
 Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models,"Zhihong Shao, Yeyun Gong, Yelong Shen, Minlie Huang, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2302.00618,,https://huggingface.co/papers/2302.00618,,,,2302.00618,6,0
@@ -1115,7 +1115,7 @@ Bootstrapped Representations in Reinforcement Learning,"Charline Le Lan, Stephen
 Quantile Credit Assignment,"Thomas Mesnard, Wenqi Chen, Alaa Saade, Yunhao Tang, Mark Rowland, Theophane Weber, Clare Lyle, Audrunas Gruslys, Michal Valko, Will Dabney, Georg Ostrovski, Eric Moulines, Remi Munos",,,,,,,,,
 Understanding Self-Predictive Learning for Reinforcement Learning,"Yunhao Tang, Zhaohan Guo, Pierre Richemond, Bernardo Avila Pires, Yash Chandak, Remi Munos, Mark Rowland, Mohammad Gheshlaghi Azar, Charline Le Lan, Clare Lyle, Andras Gyorgy, Shantanu Thakoor, Will Dabney, Bilal Piot, Daniele Calandriello, Michal Valko",http://arxiv.org/abs/2212.03319,,https://huggingface.co/papers/2212.03319,,,,2212.03319,16,0
 Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning,"Brett Daley, Martha White, Christopher Amato, Marlos C. Machado",http://arxiv.org/abs/2301.11321,,https://huggingface.co/papers/2301.11321,,,,2301.11321,4,0
-"For Pre-Trained Vision Models in Motor Control, Not All Policy Learning Methods are Created Equal","Yingdong Hu, Renhao Wang, Li Li, Yang Gao",http://arxiv.org/abs/2304.04591,,https://huggingface.co/papers/2304.04591,,,,2304.04591,4,0
 Weakly Supervised Regression with Interval Targets,"Xin Cheng, Yuzhou Cao, Ximing Li, Bo An, LEI FENG",,,,,,,,,
 Regret Bounds for Markov Decision Processes with Recursive Optimized Certainty Equivalents,"WENHAO XU, Xuefeng Gao, Xuedong He",http://arxiv.org/abs/2301.12601,,https://huggingface.co/papers/2301.12601,,,,2301.12601,3,0
 Decentralized Stochastic Bilevel Optimization with Improved per-Iteration Complexity,"Xuxing Chen, Minhui Huang, Shiqian Ma, Krishna Balasubramanian",http://arxiv.org/abs/2210.12839,,https://huggingface.co/papers/2210.12839,,,,2210.12839,4,0
@@ -1140,7 +1140,7 @@ Phase Transitions in the Detection of Correlated Databases,"Dor Elimelech, Wasim
 New metrics and search algorithms for weighted causal DAGs,"Davin Choo, Kirankumar Shiragur",http://arxiv.org/abs/2305.04445,,https://huggingface.co/papers/2305.04445,,,,2305.04445,2,0
 CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms,"Shengyi Huang, Rousslan Fernand Julien Dossa, Chang Ye, Jeff Braga, Dipam Chakraborty, Kinal Mehta, João Madeira Araujo",http://arxiv.org/abs/2111.08819,https://github.com/vwxyzjn/cleanrl,https://huggingface.co/papers/2111.08819,,,,2111.08819,4,1
 Simplifying Momentum-based Positive-definite Submanifold Optimization with Applications to Deep Learning,"Wu Lin, Valentin Duruisseaux, Melvin Leok, Frank Nielsen, Khan Emtiyaz, Mark Schmidt",http://arxiv.org/abs/2302.09738,https://github.com/yorkerlin/StructuredNGD-DL,https://huggingface.co/papers/2302.09738,,,,2302.09738,6,0
-Polarity Is All You Need to Learn and Transfer Faster,"Alice (Qingyang) Wang, Michael Powell, Eric Bridgeford, Ali Geisa, Joshua Vogelstein",http://arxiv.org/abs/2303.17589,,https://huggingface.co/papers/2303.17589,,,,2303.17589,5,0
 Scaling Vision Transformers to 22 Billion Parameters,"Mostafa Dehghani, Josip Djolonga, Basil Mustafa, Piotr Padlewski, Jonathan Heek, Justin Gilmer, Andreas Steiner, Mathilde Caron, Robert Geirhos, Ibrahim Alabdulmohsin, Rodolphe Jenatton, Lucas Beyer, Michael Tschannen, Anurag Arnab, Xiao Wang, Carlos Riquelme, Matthias Minderer, Joan Puigcerver, Utku Evci, Manoj Kumar, Sjoerd van Steenkiste, Gamaleldin Elsayed, Aravindh Mahendran, Fisher Yu, Avital Oliver, Fantine Huot, Jasmijn Bastings, Mark Collier, Alexey Gritsenko, Vighnesh N Birodkar, Cristina Vasconcelos, Yi Tay, Thomas Mensink, Alexander Kolesnikov, Filip Pavetic, Dustin Tran, Thomas Kipf, Mario Lucic, Xiaohua Zhai, Daniel Keysers, Jeremiah Harmsen, Neil Houlsby",http://arxiv.org/abs/2302.05442,,https://huggingface.co/papers/2302.05442,,,,2302.05442,22,0
 Toward Fair and Robust Estimation of Optimal Treatment Regimes,"Kwangho Kim, Jose Zubizarreta",,,,,,,,,
 Internally Rewarded Reinforcement Learning,"Mengdi Li, Xufeng Zhao, Jae Hee Lee, Cornelius Weber, Stefan Wermter",http://arxiv.org/abs/2302.00270,,https://huggingface.co/papers/2302.00270,,,,2302.00270,5,0
@@ -1154,7 +1154,7 @@ Efficient Latency-Aware CNN Depth Compression via Two-Stage Dynamic Programming,
 Critical Points and Convergence Analysis of Generative Deep Linear Networks Trained with Bures-Wasserstein Loss,"Pierre Bréchet, Katerina Papagiannouli, Jing An, Guido Montufar",http://arxiv.org/abs/2303.03027,,https://huggingface.co/papers/2303.03027,,,,2303.03027,4,0
 Policy Evaluation and Temporal-Difference Learning in Continuous Time and Space: A Martingale Approach,"Yanwei Jia, Xun Yu Zhou",http://arxiv.org/abs/2108.06655,,https://huggingface.co/papers/2108.06655,,,,2108.06655,2,0
 VIMA: Robot Manipulation with Multimodal Prompts,"Yunfan Jiang, Agrim Gupta, Zichen Zhang, Guanzhi Wang, Yongqiang Dou, Yanjun Chen, Li Fei-Fei, Anima Anandkumar, Yuke Zhu, Jim Fan",,,,,,,,,
-StriderNet: A Graph Reinforcement Learning Approach to Optimize Atomic Structures on Rough Energy Landscapes,"Vaibhav Bihani, Sahil Manchanda, Srikanth Sastry, Sayan Ranu, N M Anoop Krishnan",http://arxiv.org/abs/2301.12477,,https://huggingface.co/papers/2301.12477,,,,2301.12477,5,0
 Multi-agent Online Scheduling: MMS Allocations for Indivisible Items,"Shengwei Zhou, Rufan Bai, Xiaowei Wu",http://arxiv.org/abs/2304.13405,,https://huggingface.co/papers/2304.13405,,,,2304.13405,3,0
 Multi-Symmetry Ensembles: Improving Diversity and Generalization via Opposing Symmetries,"Charlotte Loh, Seungwook Han, Shivchander Sudalairaj, Rumen Dangovski, Kai Xu, Florian Wenzel, Marin Solja\v{c}i\'{c}, Akash Srivastava",http://arxiv.org/abs/2303.02484,,https://huggingface.co/papers/2303.02484,,,,2303.02484,8,0
 NP-SemiSeg: When Neural Processes meet Semi-Supervised Semantic Segmentation,"Jianfeng Wang, Daniela Massiceti, Xiaolin Hu, Vladimir Pavlovic, Thomas Lukasiewicz",,,,,,,,,
@@ -1178,13 +1178,13 @@ Instrumental Variable Estimation of Average Partial Causal Effects,"Yuta Kawakam
 Improving Adversarial Robustness Through the Contrastive-Guided Diffusion Process,"Yidong Ouyang, Liyan Xie, Guang Cheng",,,,,,,,,
 MetaModulation: Learning Variational Feature Hierarchies for Few-Shot Learning with Fewer Tasks,"Wenfang Sun, Yingjun Du, Xiantong Zhen, Fan Wang, Ling Wang, Cees Snoek",http://arxiv.org/abs/2305.10309,,https://huggingface.co/papers/2305.10309,,,,2305.10309,6,0
 Provable Dynamic Fusion for Low-Quality Multimodal Data,"qingyang zhang, Haitao Wu, Changqing Zhang, Qinghua Hu, Huazhu Fu, Joey Tianyi Zhou, Xi Peng",http://arxiv.org/abs/2306.02050,,https://huggingface.co/papers/2306.02050,,,,2306.02050,7,0
-Beyond Homophily: Reconstructing Structure for Graph-agnostic Clustering,"Erlin Pan, zhao kang",http://arxiv.org/abs/2305.02931,,https://huggingface.co/papers/2305.02931,,,,2305.02931,2,0
 SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation,"Huaishao Luo, Junwei Bao, Youzheng Wu, Xiaodong He, Tianrui Li",http://arxiv.org/abs/2211.14813,https://github.com/ArrowLuo/SegCLIP,https://huggingface.co/papers/2211.14813,,,,2211.14813,5,0
 Explainability as statistical inference,"Hugo Senetaire, Damien Garreau, Jes Frellsen, Pierre-Alexandre Mattei",http://arxiv.org/abs/2212.03131,,https://huggingface.co/papers/2212.03131,,,,2212.03131,4,0
 Learning Prescriptive ReLU Networks,"Wei Sun, Asterios Tsiourvas",http://arxiv.org/abs/2306.00651,,https://huggingface.co/papers/2306.00651,,,,2306.00651,2,0
 Bidirectional Adaptation for Robust Semi-Supervised Learning with Inconsistent Data Distributions,"Lin-Han Jia, Lan-Zhe Guo, Zhi Zhou, Jie-Jing Shao, Yuke Xiang, Yu-Feng Li",,,,,,,,,
 Beyond the Universal Law of Robustness: Sharper Laws for Random Features and Neural Tangent Kernels,"Simone Bombari, Shayan Kiyani, Marco Mondelli",http://arxiv.org/abs/2302.01629,,https://huggingface.co/papers/2302.01629,,,,2302.01629,3,0
-Human-Timescale Adaptation in an Open-Ended Task Space,"Jakob Bauer, Kate Baumli, Feryal Behbahani, Avishkar Bhoopchand, Natalie Bradley-Schmieg, Michael Chang, Natalie Clay, Adrian Collister, Vibhavari Dasagi, Lucy Gonzalez, Karol Gregor, Edward Hughes, Sheleem Kashem, Maria Loks-Thompson, Hannah Openshaw, Jack Parker-Holder, Shreya Pathak, Nicolas Perez-Nieves, Nemanja Rakicevic, Tim Rocktäschel, Yannick Schroecker, Satinder Singh, Jakub Sygnowski, Karl Tuyls, Sarah York, Alexander Zacherl, Lei Zhang",http://arxiv.org/abs/2301.07608,,https://huggingface.co/papers/2301.07608,,,,2301.07608,22,0
 Analysis of Error Feedback in Federated Non-Convex Optimization with Biased Compression: Linear Speedup and Partial Participation,"Xiaoyun Li, Ping Li",,,,,,,,,
 ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts,"Minghao Xu, Xinyu Yuan, Santiago Miret, Jian Tang",http://arxiv.org/abs/2301.12040,,https://huggingface.co/papers/2301.12040,,,,2301.12040,4,0
 Specializing Smaller Language Models towards Multi-Step Reasoning,"Yao Fu, Hao Peng, Litu Ou, Ashish Sabharwal, Tushar Khot",http://arxiv.org/abs/2301.12726,,https://huggingface.co/papers/2301.12726,,,,2301.12726,5,1
@@ -1192,7 +1192,7 @@ Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap,"Hang Wa
 Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models,"Dongjun Kim, Yeongmin Kim, Se Jung Kwon, Wanmo Kang, IL CHUL MOON",http://arxiv.org/abs/2211.17091,https://github.com/alsdudrla10/DG,https://huggingface.co/papers/2211.17091,,,,2211.17091,5,0
 Weighted flow diffusion for local graph clustering with node attributes: an algorithm and statistical guarantees,"Shenghao Yang, Kimon Fountoulakis",http://arxiv.org/abs/2301.13187,,https://huggingface.co/papers/2301.13187,,,,2301.13187,2,0
 Robust Budget Pacing with a Single Sample,"Santiago Balseiro, Rachitesh Kumar, Vahab Mirrokni, Balasubramanian Sivan, Di Wang",http://arxiv.org/abs/2302.02006,,https://huggingface.co/papers/2302.02006,,,,2302.02006,5,0
-Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the Machiavelli Benchmark,"Alexander Pan, Jun Shern Chan, Andy Zou, Nathaniel Li, Scott Emmons, Hanlin Zhang, Steven Basart, Thomas Woodside, Dan Hendrycks",http://arxiv.org/abs/2304.03279,,https://huggingface.co/papers/2304.03279,,,,2304.03279,10,0
 Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines?,"Victor Boutin, Thomas FEL, Lakshya Singhal, Rishav Mukherji, Akash Nagaraj, Julien Colin, Thomas Serre",http://arxiv.org/abs/2301.11722,,https://huggingface.co/papers/2301.11722,,,,2301.11722,7,1
 Random Classification Noise does not defeat All Convex Potential Boosters Irrespective of Model Choice,"Yishay Mansour, Richard Nock, Robert C. Williamson",,,,,,,,,
 "Fundamental Limits of Two-layer Autoencoders, and Achieving Them with Gradient Methods","Aleksandr Shevchenko, Kevin Kögler, Hamed Hassani, Marco Mondelli",http://arxiv.org/abs/2212.13468,,https://huggingface.co/papers/2212.13468,,,,2212.13468,4,0
@@ -1203,12 +1203,12 @@ Generalized Teacher Forcing for Learning Chaotic Dynamics,"Florian Hess, Zahra M
 HETAL: Efficient Privacy-preserving Transfer Learning with Homomorphic Encryption,"Seewoo Lee, Garam Lee, Jung Woo Kim, Junbum Shin, Mun-Kyu Lee",,,,,,,,,
 Marginalization is not Marginal: No Bad VAE Local Minima when Learning Optimal Sparse Representations,David Wipf,,,,,,,,,
 Direct Parameterization of Lipschitz-Bounded Deep Networks,"Ruigang Wang, Ian Manchester",http://arxiv.org/abs/2301.11526,https://github.com/acfr/LBDN,https://huggingface.co/papers/2301.11526,,,,2301.11526,2,0
-XAI Beyond Classification: Interpretable Neural Clustering,"Xi Peng, Yunfan Li, Ivor W. Tsang, Hongyuan Zhu, Jiancheng Lv, Joey Tianyi Zhou",http://arxiv.org/abs/1808.07292,,https://huggingface.co/papers/1808.07292,,,,1808.07292,6,0
 Exploiting locality in high-dimensional Factorial hidden Markov models,"Lorenzo Rimella, Nick Whiteley",http://arxiv.org/abs/1902.01639,,https://huggingface.co/papers/1902.01639,,,,1902.01639,2,0
 Mitigating the Effects of Non-Identifiability on Inference for Bayesian Neural Networks with Latent Variables,"Yaniv Yacoby, Weiwei Pan, Finale Doshi-Velez",http://arxiv.org/abs/1911.00569,,https://huggingface.co/papers/1911.00569,,,,1911.00569,3,0
 Project and Forget: Solving Large-Scale Metric Constrained Problems,"Rishi Sonthalia, Anna C. Gilbert",http://arxiv.org/abs/2005.03853,,https://huggingface.co/papers/2005.03853,,,,2005.03853,2,0
 "Let's Make Block Coordinate Descent Converge Faster: Faster Greedy Rules, Message-Passing, Active-Set Complexity, and Superlinear Convergence","Julie Nutini, Issam Laradji, Mark Schmidt",http://arxiv.org/abs/1712.08859,,https://huggingface.co/papers/1712.08859,,,,1712.08859,3,0
-Cluster-Specific Predictions with Multi-Task Gaussian Processes,"Arthur Leroy, Pierre Latouche, Benjamin Guedj, Servane Gey",http://arxiv.org/abs/2011.07866,,https://huggingface.co/papers/2011.07866,,,,2011.07866,4,0
 Non-asymptotic Properties of Individualized Treatment Rules from Sequentially Rule-Adaptive Trials,"Daiqi Gao, Yufeng Liu, Donglin Zeng",,,,,,,,,
 Mean-field Analysis of Piecewise Linear Solutions for Wide ReLU Networks,"Aleksandr Shevchenko, Vyacheslav Kungurtsev, Marco Mondelli",http://arxiv.org/abs/2111.02278,,https://huggingface.co/papers/2111.02278,,,,2111.02278,3,0
 "Multi-Agent Online Optimization with Delays: Asynchronicity, Adaptivity, and Optimism","Yu-Guan Hsieh, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos",http://arxiv.org/abs/2012.11579,,https://huggingface.co/papers/2012.11579,,,,2012.11579,4,0
@@ -1219,7 +1219,7 @@ Knowledge Hypergraph Embedding Meets Relational Algebra,"Bahare Fatemi, Perouz T
 Deep linear networks can benignly overfit when shallow ones do,"Niladri S. Chatterji, Phil Long",http://arxiv.org/abs/2209.09315,,https://huggingface.co/papers/2209.09315,,,,2209.09315,2,0
 Taming graph kernels with random features,Krzysztof Choromanski,http://arxiv.org/abs/2305.00156,,https://huggingface.co/papers/2305.00156,,,,2305.00156,1,0
 On Uni-Modal Feature Learning in Supervised Multi-Modal Learning,"Chenzhuang Du, Jiaye Teng, Tingle Li, Yichen Liu, Tianyuan Yuan, Yue Wang, Yang Yuan, Hang Zhao",http://arxiv.org/abs/2305.01233,,https://huggingface.co/papers/2305.01233,,,,2305.01233,8,0
-CSP: Self-Supervised Contrastive Spatial Pre-Training for Geospatial-Visual Representations,"Gengchen Mai, Ni Lao, Yutong He, Jiaming Song, Stefano Ermon",http://arxiv.org/abs/2305.01118,,https://huggingface.co/papers/2305.01118,,,,2305.01118,5,1
 CLIPood: Generalizing CLIP to Out-of-Distributions,"Yang Shu, Xingzhuo Guo, Jialong Wu, Ximei Wang, Jianmin Wang, Mingsheng Long",http://arxiv.org/abs/2302.00864,,https://huggingface.co/papers/2302.00864,,,,2302.00864,6,0
 Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning,"Yu Meng, Martin Michalski, Jiaxin Huang, Yu Zhang, Tarek Abdelzaher, Jiawei Han",http://arxiv.org/abs/2211.03044,,https://huggingface.co/papers/2211.03044,,,,2211.03044,6,1
 Meta-SAGE: Scale Meta-Learning Scheduled Adaptation with Guided Exploration for Mitigating Scale Shift on Combinatorial Optimization,"Jiwoo Son, Minsu Kim, Hyeonah Kim, Jinkyoo Park",,,,,,,,,
@@ -1251,7 +1251,7 @@ Test-time Adaptation with Slot-Centric Models,"Mihir Prabhudesai, Anirudh Goyal,
 Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification,"Dong Xing, Pengjie Gu, Qian Zheng, Xinrun Wang, Shanqi Liu, Longtao Zheng, Bo An, Gang Pan",,,,,,,,,
 Data-Efficient Contrastive Self-supervised Learning: Most Beneficial Examples for Supervised Learning Contribute the Least,"Siddharth Joshi, Baharan Mirzasoleiman",http://arxiv.org/abs/2302.09195,,https://huggingface.co/papers/2302.09195,,,,2302.09195,2,0
 Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs,"Guan-Ting Liu, En-Pei Hu, Pu-Jen Cheng, Hung-yi Lee, Shao-Hua Sun",http://arxiv.org/abs/2301.12950,,https://huggingface.co/papers/2301.12950,,,,2301.12950,5,0
-Cooperative Open-ended Learning Framework for Zero-Shot Coordination,"Yang Li, Shao Zhang, Jichen Sun, Yali Du, Ying Wen, Xinbing Wang, Wei Pan",http://arxiv.org/abs/2302.04831,,https://huggingface.co/papers/2302.04831,,,,2302.04831,7,0
 CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental Design,"Desi Ivanova, Joel Jennings, Tom Rainforth, Cheng Zhang, Adam Foster",,,,,,,,,
 On the Identifiability and Estimation of Causal Location-Scale Noise Models,"Alexander Immer, Christoph Schultheiss, Julia Vogt, Bernhard Schölkopf, Peter Bühlmann, Alexander Marx",http://arxiv.org/abs/2210.09054,,https://huggingface.co/papers/2210.09054,,,,2210.09054,6,0
 From Temporal to Contemporaneous Iterative Causal Discovery in the Presence of Latent Confounders,"Raanan Yehezkel Rohekar, Shami Nisimov, Yaniv Gurwicz, Gal Novik",http://arxiv.org/abs/2306.00624,,https://huggingface.co/papers/2306.00624,,,,2306.00624,4,0
@@ -1267,8 +1267,8 @@ Non-autoregressive Conditional Diffusion Models for Time Series Prediction,"Life
 Drug Discovery under Covariate Shift with Domain-Informed Prior Distributions over Functions,"Leo Klarner, Tim G. J. Rudner, Michael Reutlinger, Torsten Schindler, Garrett Morris, Charlotte Deane, Yee-Whye Teh",,,,,,,,,
 SOM-CPC: Unsupervised Contrastive Learning with Self-Organizing Maps for Structured Representations of High-Rate Time Series,"Iris Huijben, Arthur A. Nijdam, Sebastiaan Overeem, Merel Van Gilst, Ruud J. G. van Sloun",,,,,,,,,
 Hierarchical Grammar-Induced Geometry for Data-Efficient Molecular Property Prediction,"Minghao Guo, Veronika Thost, Samuel Song, Adithya Balachandran, Payel Das, Jie Chen, Wojciech Matusik",,,,,,,,,
-A Closer Look at the Intervention Procedure of Concept Bottleneck Models,"Sungbin Shin, Yohan Jo, Sungsoo Ahn, Namhoon Lee",http://arxiv.org/abs/2302.14260,,https://huggingface.co/papers/2302.14260,,,,2302.14260,4,0
-Simple Hardware-Efficient Long Convolutions for Sequence Modeling,"Daniel Y Fu, Elliot L Epstein, Eric Nguyen, Michael Zhang, Tri Dao, Atri Rudra, Christopher Re",http://arxiv.org/abs/2302.06646,,https://huggingface.co/papers/2302.06646,,,,2302.06646,8,0
 Towards Controlled Data Augmentations for Active Learning,"Jianan Yang, Jianan Yang, Haobo Wang, Sai Wu, Gang Chen, Junbo Zhao",,,,,,,,,
 "Bigger, Better, Faster: Human-level Atari with human-level efficiency","Max Schwarzer, Johan Obando Ceron, Aaron Courville, Marc Bellemare, Rishabh Agarwal, Pablo Samuel Castro",http://arxiv.org/abs/2305.19452,https://github.com/google-research/google-research/tree/master/bigger_better_faster,https://huggingface.co/papers/2305.19452,,,,2305.19452,6,3
 A Law of Robustness beyond Isoperimetry,"Yihan Wu, Heng Huang, Hongyang Zhang",http://arxiv.org/abs/2202.11592,,https://huggingface.co/papers/2202.11592,,,,2202.11592,3,0
@@ -1300,7 +1300,7 @@ ContraBAR: Contrastive Bayes-Adaptive Deep RL,"Era Choshen, Aviv Tamar",http://a
 Guiding Pretraining in Reinforcement Learning with Large Language Models,"Yuqing Du, Olivia Watkins, Zihan Wang, Cédric Colas, Trevor Darrell, Pieter Abbeel, Abhishek Gupta, Jacob Andreas",http://arxiv.org/abs/2302.06692,,https://huggingface.co/papers/2302.06692,,,,2302.06692,8,0
 PPG Reloaded: An Empirical Study on What Matters in Phasic Policy Gradient,"Kaixin Wang, Zhou Daquan, Jiashi Feng, Shie Mannor",,,,,,,,,
 Differentially Private Sharpness-Aware Training,"Jinseong Park, Hoki Kim, Yujin Choi, Jaewook Lee",http://arxiv.org/abs/2306.05651,https://github.com/jinseongP/DPSAT,https://huggingface.co/papers/2306.05651,,,,2306.05651,4,0
-Provably and Practically Efficient Neural Contextual Bandits,Sudeep Salgia,http://arxiv.org/abs/2206.00099,,https://huggingface.co/papers/2206.00099,,,,2206.00099,3,0
 How Does Information Bottleneck Help Deep Learning?,"Kenji Kawaguchi, Zhun Deng, Xu Ji, Jiaoyang Huang",http://arxiv.org/abs/2305.18887,https://github.com/xu-ji/information-bottleneck,https://huggingface.co/papers/2305.18887,,,,2305.18887,4,0
 Why Is Public Pretraining Necessary for Private Model Training?,"Arun Ganesh, Mahdi Haghifam, Milad Nasresfahani, Sewoong Oh, Thomas Steinke, Om Thakkar, Abhradeep Guha Thakurta, Lun Wang",http://arxiv.org/abs/2302.09483,,https://huggingface.co/papers/2302.09483,,,,2302.09483,8,0
 Learning Instance-Specific Augmentations by Capturing Local Invariances,"Ning Miao, Tom Rainforth, Emile Mathieu, Yann Dubois, Yee-Whye Teh, Adam Foster, Hyunjik Kim",http://arxiv.org/abs/2206.00051,,https://huggingface.co/papers/2206.00051,,,,2206.00051,7,0
@@ -1360,7 +1360,7 @@ Featured Graph Coarsening with Similarity Guarantees,"MANOJ KUMAR, Anurag Sharma
 Unleashing Mask: Explore the Intrinsic Out-of-Distribution Detection Capability,"Jianing Zhu, Hengzhuang Li, Jiangchao Yao, Tongliang Liu, Jianliang Xu, Bo Han",http://arxiv.org/abs/2306.03715,https://github.com/tmlr-group/Unleashing-Mask,https://huggingface.co/papers/2306.03715,,,,2306.03715,6,0
 Conditional Graph Information Bottleneck for Molecular Relational Learning,"Namkyeong Lee, Dongmin Hyun, Gyoung S. Na, Sungwon Kim, Junseok Lee, Chanyoung Park",http://arxiv.org/abs/2305.01520,https://github.com/Namkyeong/CGIB,https://huggingface.co/papers/2305.01520,,,,2305.01520,6,0
 Reconstructive Neuron Pruning for Backdoor Defense,"Yige Li, XIXIANG LYU, Xingjun Ma, Nodens Koren, Lingjuan Lyu, Bo Li, Yu-Gang Jiang",http://arxiv.org/abs/2305.14876,https://github.com/bboylyg/RNP,https://huggingface.co/papers/2305.14876,,,,2305.14876,7,0
-Abstract-to-Executable Trajectory Translation for One-Shot Task Generalization,"Stone Tao, Xiaochen Li, Tongzhou Mu, Zhiao Huang, Yuzhe Qin, Hao Su",http://arxiv.org/abs/2210.07658,,https://huggingface.co/papers/2210.07658,,,,2210.07658,6,0
 Multi-View Masked World Models for Visual Robotic Manipulation,"Younggyo Seo, Junsu Kim, Stephen James, Kimin Lee, Jinwoo Shin, Pieter Abbeel",http://arxiv.org/abs/2302.02408,,https://huggingface.co/papers/2302.02408,,,,2302.02408,6,0
 CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets,"Zachary Novack, Julian McAuley, Zachary Lipton, Saurabh Garg",http://arxiv.org/abs/2302.02551,https://github.com/acmi-lab/CHILS,https://huggingface.co/papers/2302.02551,,,,2302.02551,4,1
 Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization,"Zi-Hao Qiu, Quanqi Hu, Zhuoning Yuan, Denny Zhou, Lijun Zhang, Tianbao Yang",http://arxiv.org/abs/2305.11965,,https://huggingface.co/papers/2305.11965,,,,2305.11965,6,0
@@ -1407,12 +1407,12 @@ Omnipredictors for Constrained Optimization,"Lunjia Hu, Inbal Livni Navon, Omer
 Bandit Online Linear Optimization with Hints and Queries,"Aditya Bhaskara, Ashok Cutkosky, Ravi Kumar, Manish Purohit",,,,,,,,,
 Neural Network Approximations of PDEs Beyond Linearity: A Representational Perspective,"Tanya Marwah, Zachary Lipton, Jianfeng Lu, Andrej Risteski",http://arxiv.org/abs/2210.12101,,https://huggingface.co/papers/2210.12101,,,,2210.12101,4,0
 Attribute-Efficient PAC Learning of Low-Degree Polynomial Threshold Functions with Nasty Noise,"Shiwei Zeng, Jie Shen",http://arxiv.org/abs/2306.00673,,https://huggingface.co/papers/2306.00673,,,,2306.00673,2,0
-Sample Complexity Bounds for Learning High-dimensional Simplices in Noisy Regimes,"seyed amir saberi, Amir Najafi, Abolfazl Motahari, Babak Khalaj",http://arxiv.org/abs/2209.05953,,https://huggingface.co/papers/2209.05953,,,,2209.05953,4,0
 "Monge, Bregman and Occam: Interpretable Optimal Transport in High-Dimensions with Feature-Sparse Maps","Marco Cuturi, Michal Klein, Pierre Ablin",http://arxiv.org/abs/2302.04065,,https://huggingface.co/papers/2302.04065,,,,2302.04065,3,0
 Sketching Meets Differential Privacy: Fast Algorithm for Dynamic Kronecker Projection Maintenance,"Zhao Song, Xin Yang, Yuanyuan Yang, Lichen Zhang",http://arxiv.org/abs/2210.11542,,https://huggingface.co/papers/2210.11542,,,,2210.11542,4,0
 Combinatorial Neural Bandits,"Taehyun Hwang, Kyuwook Chai, Min-hwan Oh",http://arxiv.org/abs/2306.00242,,https://huggingface.co/papers/2306.00242,,,,2306.00242,3,0
 Reward-Mixing MDPs with Few Contexts are Learnable,"Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor",,,,,,,,,
-Quantum Speedups for Zero-Sum Games via Improved Dynamic Gibbs Sampling,"Adam Bouland, Yosheb Getachew, Yujia Jin, Aaron Sidford, Kevin Tian",http://arxiv.org/abs/2301.03763,,https://huggingface.co/papers/2301.03763,,,,2301.03763,5,0
 Tight Regret Bounds for Single-pass Streaming Multi-armed Bandits,Chen Wang,http://arxiv.org/abs/2306.02208,,https://huggingface.co/papers/2306.02208,,,,2306.02208,1,0
 Minimum Width of Leaky-ReLU Neural Networks for Uniform Universal Approximation,"Li'ang Li, Yifei duan, Guanghua Ji, Yongqiang Cai",http://arxiv.org/abs/2305.18460,,https://huggingface.co/papers/2305.18460,,,,2305.18460,4,0
 Dynamical Linear Bandits,"Marco Mussi, Alberto Maria Metelli, Marcello Restelli",http://arxiv.org/abs/2211.08997,,https://huggingface.co/papers/2211.08997,,,,2211.08997,3,0
@@ -1497,7 +1497,7 @@ Who Needs to Know? Minimal Knowledge for Optimal Coordination,"Niklas Lauffer, A
 Neural networks trained with SGD learn distributions of increasing complexity,"Maria Refinetti, Alessandro Ingrosso, Sebastian Goldt",http://arxiv.org/abs/2211.11567,,https://huggingface.co/papers/2211.11567,,,,2211.11567,3,0
 Scaling Laws for Multilingual Neural Machine Translation,"Patrick Fernandes, Behrooz Ghorbani, Xavier Garcia, Markus Freitag, Orhan Firat",http://arxiv.org/abs/2302.09650,,https://huggingface.co/papers/2302.09650,,,,2302.09650,5,0
 Explaining the effects of non-convergent MCMC in the training of Energy-Based Models,"Elisabeth Agoritsas, Giovanni Catania, Aurélien Decelle, Beatriz Seoane",,,,,,,,,
-A Three-regime Model of Network Pruning,"Yefan Zhou, Yaoqing Yang, Arin Chang, Michael Mahoney",http://arxiv.org/abs/2305.18383,,https://huggingface.co/papers/2305.18383,,,,2305.18383,4,0
 Metagenomic Binning using Connectivity-constrained Variational Autoencoders,"Andre Lamurias, Alessandro Tibo, Katja Hose, Mads Albertsen, Thomas D. Nielsen",,,,,,,,,
 SNeRL: Semantic-aware Neural Radiance Fields for Reinforcement Learning,"Dongseok Shim, Seungjae Lee, H. Kim",http://arxiv.org/abs/2301.11520,,https://huggingface.co/papers/2301.11520,,,,2301.11520,3,0
 Spatial-Temporal Graph Learning with Adversarial Contrastive Adaptation,"Qianru Zhang, Chao Huang, Lianghao Xia, Zheng Wang, Siu Ming Yiu, Ruihua Han",,,,,,,,,
@@ -1621,7 +1621,7 @@ On the Interplay Between Misspecification and Sub-optimality Gap in Linear Conte
 Brainformers: Trading Simplicity for Efficiency,"Yanqi Zhou, Nan Du, Yanping Huang, Daiyi Peng, Chang Lan, Da Huang, Siamak Shakeri, David So, Andrew Dai, Yifeng Lu, Zhifeng Chen, Quoc Le, Claire Cui, James Laudon, Jeff Dean",http://arxiv.org/abs/2306.00008,,https://huggingface.co/papers/2306.00008,,,,2306.00008,15,3
 On the Training Instability of Shuffling SGD with Batch Normalization,"David X. Wu, Chulhee Yun, Suvrit Sra",http://arxiv.org/abs/2302.12444,,https://huggingface.co/papers/2302.12444,,,,2302.12444,3,0
 Dropout Reduces Underfitting,"Zhuang Liu, Zhiqiu (Oscar) Xu, Joseph Jin, Zhiqiang Shen, Trevor Darrell",http://arxiv.org/abs/2303.01500,https://github.com/facebookresearch/dropout,https://huggingface.co/papers/2303.01500,,,,2303.01500,5,0
-A modern look at the relationship between sharpness and generalization,"Maksym Andriushchenko, Francesco Croce, Maximilian Müller, Matthias Hein, Nicolas Flammarion",http://arxiv.org/abs/2302.07011,https://github.com/tml-epfl/sharpness-vs-generalization,https://huggingface.co/papers/2302.07011,,,,2302.07011,5,0
 Weak Proxies are Sufficient and Preferable for Fairness with Missing Sensitive Attributes,"Zhaowei Zhu, Yuanshun Yao, Jiankai Sun, Hang Li, Yang Liu",http://arxiv.org/abs/2210.03175,,https://huggingface.co/papers/2210.03175,,,,2210.03175,5,0
 Cocktail Party Attack: Breaking Aggregation-Based Privacy in Federated Learning Using Independent Component Analysis,"Sanjay Kariyappa, Chuan Guo, Kiwan Maeng, Wenjie Xiong, G. Edward Suh, Moinuddin Qureshi, Hsien-Hsin Sean Lee",http://arxiv.org/abs/2209.05578,,https://huggingface.co/papers/2209.05578,,,,2209.05578,7,0
 On the Robustness of Randomized Ensembles to Adversarial Perturbations,"Hassan Dbouk, Naresh Shanbhag",http://arxiv.org/abs/2302.01375,https://github.com/hsndbk4/BARRE,https://huggingface.co/papers/2302.01375,,,,2302.01375,2,0
@@ -1647,7 +1647,7 @@ LinSATNet: The Positive Linear Satisfiability Neural Networks,"Runzhong Wang, Yu
 On the Complexity of Bayesian Generalization,"Yu-Zhe Shi, Manjie Xu, John Hopcroft, Kun He, Josh Tenenbaum, Song-Chun Zhu, Ying Nian Wu, Wenjuan Han, Yixin Zhu",http://arxiv.org/abs/2211.11033,,https://huggingface.co/papers/2211.11033,,,,2211.11033,9,0
 QAS-Bench: Rethinking Quantum Architecture Search and A Benchmark,"Xudong Lu, Kaisen Pan, Ge Yan, Jiaming Shan, Wenjie Wu, Junchi Yan",,,,,,,,,
 Not all Strongly Rayleigh Distributions Have Small Probabilistic Generating Circuits,Markus Bläser,,,,,,,,,
-PAL: Program-aided Language Models,"Luyu Gao, Aman Madaan, Shuyan Zhou, Uri Alon, Pengfei Liu, Yiming Yang, Jamie Callan, Graham Neubig",http://arxiv.org/abs/2211.10435,,https://huggingface.co/papers/2211.10435,,,,2211.10435,8,1
 Tighter Bounds on the Expressivity of Transformer Encoders,"David Chiang, Peter Cholak, Anand Pillay",http://arxiv.org/abs/2301.10743,,https://huggingface.co/papers/2301.10743,,,,2301.10743,3,0
 Efficient Algorithms for Exact Graph Matching on Correlated Stochastic Block Models with Constant Correlation,"Joonhyuk Yang, Shin Dongpil, Hye Won Chung",http://arxiv.org/abs/2305.19666,,https://huggingface.co/papers/2305.19666,,,,2305.19666,3,0
 Causal Discovery with Latent Confounders Based on Higher-Order Cumulants,"Ruichu Cai, Zhiyi Huang, Wei Chen, Zhifeng Hao, Kun Zhang",http://arxiv.org/abs/2305.19582,,https://huggingface.co/papers/2305.19582,,,,2305.19582,5,0
@@ -1685,7 +1685,7 @@ Robustness in Multimodal Learning under Train-Test Modality Mismatch,"Brandon Mc
 Learning Representations without Compositional Assumptions,"Tennison Liu, Jeroen Berrevoets, Zhaozhi Qian, Mihaela van der Schaar",http://arxiv.org/abs/2305.19726,,https://huggingface.co/papers/2305.19726,,,,2305.19726,4,0
 Making Transformers Compute-lite for CPU inference,"Zhanpeng Zeng, Michael Davies, Pranav Pulijala, Karthikeyan Sankaralingam, Vikas Singh",,,,,,,,,
 Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers,"Grant Strimel, Yi Xie, Brian King, martin radfar, Ariya Rastrow, Athanasios Mouchtaris",http://arxiv.org/abs/2305.04159,,https://huggingface.co/papers/2305.04159,,,,2305.04159,6,0
-Expected Gradients of Maxout Networks and Consequences to Parameter Initialization,"Hanna Tseran, Guido Montufar",http://arxiv.org/abs/2301.06956,,https://huggingface.co/papers/2301.06956,,,,2301.06956,2,0
 Competing for Shareable Arms in Multi-Player Multi-Armed Bandits,"Renzhe Xu, Haotian Wang, Xingxuan Zhang, Bo Li, Peng Cui",http://arxiv.org/abs/2305.19158,,https://huggingface.co/papers/2305.19158,,,,2305.19158,5,1
 Intrinsic Sliced Wasserstein Distances for Comparing Collections of Probability Distributions on Manifolds and Graphs,"Raif Rustamov, Subhabrata Majumdar",http://arxiv.org/abs/2010.15285,,https://huggingface.co/papers/2010.15285,,,,2010.15285,2,1
 Coordinated Dynamic Bidding in Repeated Second-Price Auctions with Budgets,"Yurong Chen, Zhaohua Chen, Xiaotie Deng, Zhijian Duan, Haoran Sun, Qian Wang, Xiang Yan",http://arxiv.org/abs/2306.07709,,https://huggingface.co/papers/2306.07709,,,,2306.07709,7,0
@@ -1699,12 +1699,12 @@ Faster Gradient-Free Algorithms for Nonsmooth Nonconvex Stochastic Optimization,
 One-Step Estimator for Permuted Sparse Recovery,"Hang Zhang, Ping Li",,,,,,,,,
 Cold Analysis of Rao-Blackwellized Straight-Through Gumbel-Softmax Gradient Estimator,Alexander Shekhovtsov,,,,,,,,,
 Estimating the Contamination Factor's Distribution in Unsupervised Anomaly Detection,"Lorenzo Perini, Paul Buerkner, Arto Klami",http://arxiv.org/abs/2210.10487,,https://huggingface.co/papers/2210.10487,,,,2210.10487,3,0
-Image generation with shortest path diffusion,"Ayan Das, Ayan Das, Stathi Fotiadis, Anil Batra, Farhang Nabiei, FengTing Liao, Sattar Vakili, Da-shan Shiu, Alberto Bernacchia",http://arxiv.org/abs/2306.00501,,https://huggingface.co/papers/2306.00501,,,,2306.00501,8,0
 Deep Anomaly Detection under Labeling Budget Constraints,"Aodong Li, Chen Qiu, Padhraic Smyth, Marius Kloft, Stephan Mandt, Maja Rudolph",http://arxiv.org/abs/2302.07832,,https://huggingface.co/papers/2302.07832,,,,2302.07832,6,0
 Transformed Distribution Matching for Missing Value Imputation,"He Zhao, Ke Sun, Amir Dezfouli, Edwin V Bonilla",http://arxiv.org/abs/2302.10363,,https://huggingface.co/papers/2302.10363,,,,2302.10363,4,0
 Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?,"Ruisi Cai, Zhenyu Zhang, Zhangyang “Atlas” Wang",http://arxiv.org/abs/2302.12480,,https://huggingface.co/papers/2302.12480,,,,2302.12480,3,0
 Git-Theta: A Git Extension for Collaborative Development of Machine Learning Models,"Nikhil Kandpal, Brian Lester, Mohammed Muqeeth, Anisha Mascarenhas, Monty Evans, Vishal Baskaran, Tenghao Huang, Haokun Liu, Colin Raffel",,,,,,,,,
-Better Diffusion Models Further Improve Adversarial Training,"Zekai Wang, Tianyu Pang, Chao Du, Min Lin, Weiwei Liu, Shuicheng YAN",http://arxiv.org/abs/2302.04638,https://github.com/wzekai99/DM-Improves-AT,https://huggingface.co/papers/2302.04638,,,,2302.04638,6,0
 On the Expressive Power of Geometric Graph Neural Networks,"Chaitanya Joshi, Cristian Bodnar, Simon Mathis, Taco Cohen, Pietro Lió",http://arxiv.org/abs/2301.09308,https://github.com/chaitjo/geometric-gnn-dojo,https://huggingface.co/papers/2301.09308,,,,2301.09308,5,0
 Randomized Schur Complement Views for Graph Contrastive Learning,Vignesh Kothapalli,http://arxiv.org/abs/2306.04004,,https://huggingface.co/papers/2306.04004,,,,2306.04004,1,1
 Path Neural Networks: Expressive and Accurate Graph Neural Networks,"Gaspard Michel, Giannis Nikolentzos, Johannes Lutzeyer, Michalis Vazirgiannis",http://arxiv.org/abs/2306.05955,,https://huggingface.co/papers/2306.05955,,,,2306.05955,4,0
@@ -1780,7 +1780,7 @@ The Monge Gap: A Regularizer to Learn All Transport Maps,"Théo Uscidda, Marco C
 AbODE: Ab initio antibody design using conjoined ODEs,"Yogesh Verma, Markus Heinonen, Vikas K Garg",http://arxiv.org/abs/2306.01005,,https://huggingface.co/papers/2306.01005,,,,2306.01005,3,0
 Learning-augmented private algorithms for multiple quantile release,"Mikhail Khodak, Kareem Amin, Travis Dick, Sergei Vassilvitskii",http://arxiv.org/abs/2210.11222,,https://huggingface.co/papers/2210.11222,,,,2210.11222,4,0
 Horizon-free Learning for Markov Decision Processes and Games: Stochastically Bounded Rewards and Improved Bounds,"Shengshi Li, Lin Yang",,,,,,,,,
-Variational Autoencoding Neural Operators,"Jacob H. Seidman, Georgios Kissas, George J. Pappas, Paris Perdikaris",http://arxiv.org/abs/2302.10351,,https://huggingface.co/papers/2302.10351,,,,2302.10351,4,0
 Efficient Parametric Approximations of Neural Network Function Space Distance,"Nikita Dhawan, Sicong Huang, Juhan Bae, Roger Grosse",http://arxiv.org/abs/2302.03519,,https://huggingface.co/papers/2302.03519,,,,2302.03519,4,0
 Theory on Forgetting and Generalization of Continual Learning,"Sen Lin, Peizhong Ju, Yingbin LIANG, Ness Shroff",http://arxiv.org/abs/2302.05836,,https://huggingface.co/papers/2302.05836,,,,2302.05836,4,0
 Trapdoor Normalization with Irreversible Ownership Verification,"Hanwen Liu, Zhenyu Weng, Yuesheng Zhu, Yadong Mu",,,,,,,,,

 Graph Ladling: Shockingly Simple Parallel GNN Training without Intermediate Communication,"Ajay Jaiswal, Shiwei Liu, Tianlong Chen,  Ding, Zhangyang “Atlas” Wang",,,,,,,,,
 A Critical Revisit of Adversarial Robustness in 3D Point Cloud Recognition with Diffusion-Driven Purification,"Jiachen Sun, Jiongxiao Wang, Weili Nie, Zhiding Yu, Zhuoqing Morley Mao, Chaowei Xiao",,,,,,,,,
 COLA: Orchestrating Error Coding and Learning for Robust Neural Network Inference Against Hardware Defects,"Anlan Yu, Ning Lyu, Jieming Yin, Zhiyuan Yan, Wujie Wen",,,,,,,,,
+A Closer Look at Self-Supervised Lightweight Vision Transformers,"Shaoru Wang, Jin Gao, Zeming Li, Xiaoqin Zhang, Weiming Hu",http://arxiv.org/abs/2205.14443,https://github.com/wangsr126/mae-lite,https://huggingface.co/papers/2205.14443,,,,2205.14443,5,1
 Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space,"Anas Barakat, Ilyas Fatkhullin, Niao He",http://arxiv.org/abs/2306.01854,,https://huggingface.co/papers/2306.01854,,,,2306.01854,3,0
 Leveraging Offline Data in Online Reinforcement Learning,"Andrew Wagenmaker, Aldo Pacchiano",http://arxiv.org/abs/2211.04974,,https://huggingface.co/papers/2211.04974,,,,2211.04974,2,0
 Implicit Regularization Leads to Benign Overfitting for Sparse Linear Regression,"Mo Zhou, Rong Ge",http://arxiv.org/abs/2302.00257,,https://huggingface.co/papers/2302.00257,,,,2302.00257,2,0
 Accounting For Informative Sampling When Learning to Forecast Treatment Outcomes Over Time,"Toon Vanderschueren, Alicia Curth, Wouter Verbeke, Mihaela van der Schaar",http://arxiv.org/abs/2306.04255,,https://huggingface.co/papers/2306.04255,,,,2306.04255,4,0
 AudioLDM: Text-to-Audio Generation with Latent Diffusion Models,"Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo Mandic, Wenwu Wang, Mark D Plumbley",http://arxiv.org/abs/2301.12503,,https://huggingface.co/papers/2301.12503,,,,2301.12503,8,1
 Revisiting Over-smoothing and Over-squashing Using Ollivier-Ricci Curvature,"Khang Nguyen, Nong Hieu, Vinh NGUYEN, Nhat Ho, Stanley Osher, TAN NGUYEN",http://arxiv.org/abs/2211.15779,,https://huggingface.co/papers/2211.15779,,,,2211.15779,6,1
+Lifelong Language Pretraining with Distribution-Specialized Experts,"Wuyang Chen, Yanqi Zhou, Nan Du, Yanping Huang, James Laudon, Zhifeng Chen, Claire Cui",http://arxiv.org/abs/2305.12281,,https://huggingface.co/papers/2305.12281,,,,2305.12281,7,1
 Delay-agnostic Asynchronous Coordinate Update Algorithm,"Xuyang Wu, Changxin Liu, Sindri Magnússon, Mikael Johansson",http://arxiv.org/abs/2305.08535,,https://huggingface.co/papers/2305.08535,,,,2305.08535,4,1
 Prototype-oriented unsupervised anomaly detection for multivariate time series,"yuxin li, Wenchao Chen, Bo Chen, Dongsheng Wang, Long Tian, Mingyuan Zhou",,,,,,,,,
 ClimaX: A foundation model for weather and climate,"Tung Nguyen, Johannes Brandstetter, Ashish Kapoor, Jayesh K. Gupta, Aditya Grover",http://arxiv.org/abs/2301.10343,,https://huggingface.co/papers/2301.10343,,,,2301.10343,5,1
 Conformal Prediction Sets for Graph Neural Networks,"Soroush H. Zargarbashi, Simone Antonelli, Aleksandar Bojchevski",,,,,,,,,
 Probabilistic Attention-to-Influence Neural Models for Event Sequences,"Xiao Shou, DEBARUN BHATTACHARJYA, Tian Gao, Dharmashankar Subramanian, Oktie Hassanzadeh, Kristin Bennett",,,,,,,,,
 Nearly-tight Bounds for Deep Kernel Learning,"Yi-Fan Zhang, Min-Ling Zhang",,,,,,,,,
+Generalized Disparate Impact for Configurable Fairness Solutions in ML,"Luca Giuliani, Eleonora Misino, Michele Lombardi",http://arxiv.org/abs/2305.18504,,https://huggingface.co/papers/2305.18504,,,,2305.18504,3,1
 Thompson Sampling with Less Exploration is Fast and Optimal,"Tianyuan Jin, XIANGLIN YANG, Xiaokui Xiao, Pan Xu",,,,,,,,,
 Do Machine Learning Models Learn Statistical Rules Inferred from Data?,"Aaditya Naik, Yinjun Wu, Mayur Naik, Eric Wong",http://arxiv.org/abs/2303.01433,https://github.com/DebugML/sqrl,https://huggingface.co/papers/2303.01433,,,,2303.01433,4,1
 Deep Perturbation Learning: Enhancing the Network Performance via Image Perturbations,"Zifan Song, Xiao Gong, Guosheng Hu, Cairong Zhao",,,,,,,,,
 GeCoNeRF: Few-shot Neural Radiance Fields via Geometric Consistency,"Min-Seop Kwak, Jiuhn Song, Seungryong Kim",http://arxiv.org/abs/2301.10941,,https://huggingface.co/papers/2301.10941,,,,2301.10941,3,1
 Input uncertainty propagation through trained neural networks,"Paul Monchot, Loic Coquelin, Sébastien J. Petit, Sébastien Marmin, Erwann LE PENNEC, Nicolas Fischer",,,,,,,,,
 Optimally-weighted Estimators of the Maximum Mean Discrepancy for Likelihood-Free Inference,"Ayush Bharti, Masha Naslidnyk, Oscar Key, Samuel Kaski, Francois-Xavier Briol",http://arxiv.org/abs/2301.11674,,https://huggingface.co/papers/2301.11674,,,,2301.11674,5,0
+SGD with large step sizes learns sparse features,"Maksym Andriushchenko, Aditya Vardhan Varre, Loucas Pillaud-Vivien, Nicolas Flammarion",http://arxiv.org/abs/2210.05337,https://github.com/tml-epfl/sgd-sparse-features,https://huggingface.co/papers/2210.05337,,,,2210.05337,4,1
 Kernel Logistic Regression Approximation of an Understandable ReLU Neural Network,"Marie Guyomard, Susana Barbosa, Lionel Fillatre",,,,,,,,,
 Cramming: Training a Language Model on a single GPU in one day.,"Jonas Geiping, Tom Goldstein",https://arxiv.org/abs//2212.14034,https://github.com/JonasGeiping/cramming,https://huggingface.co/papers/2212.14034,,https://huggingface.co/JonasGeiping/crammed-bert,https://huggingface.co/datasets/JonasGeiping/the_pile_WordPiecex32768_2efdb9d060d1ae95faf952ec1a50f020,2212.14034,2,1
 A Simple Zero-shot Prompt Weighting Technique to Improve Prompt Ensembling in Text-Image Models,"James Allingham, JIE REN, Michael Dusenberry, Jeremiah Liu, Xiuye Gu, Yin Cui, Dustin Tran, Balaji Lakshminarayanan",http://arxiv.org/abs/2302.06235,,https://huggingface.co/papers/2302.06235,,,,2302.06235,8,0
 Leveraging Proxy of Training Data for Test-Time Adaptation,"Juwon Kang, Nayeong Kim, Donghyeon Kwon, Jungseul Ok, Suha Kwak",,,,,,,,,
 Near-Optimal Algorithms for Private Online Optimization in the Realizable Regime,"Hilal Asi, Vitaly Feldman, Tomer Koren, Kunal Talwar",http://arxiv.org/abs/2302.14154,,https://huggingface.co/papers/2302.14154,,,,2302.14154,4,0
 Double-Weighting for Covariate Shift Adaptation,"José I. Segovia-Martín, Santiago Mazuelas, Anqi Liu",http://arxiv.org/abs/2305.08637,,https://huggingface.co/papers/2305.08637,,,,2305.08637,3,0
+Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise Constraints,"Donghao Li, Ruiquan Huang, Cong Shen, Jing Yang",http://arxiv.org/abs/2306.06265,,https://huggingface.co/papers/2306.06265,,,,2306.06265,4,1
 PASTA: Pessimistic Assortment Optimization,"Juncheng Dong, Weibin Mo, Zhengling Qi, Cong Shi, Ethan Fang, Vahid Tarokh",http://arxiv.org/abs/2302.03821,,https://huggingface.co/papers/2302.03821,,,,2302.03821,6,0
 Coarse-to-Fine: a Hierarchical Diffusion Model for Molecule Generation in 3D,"Bo Qiang, Yuxuan Song, Minkai Xu, Jingjing Gong, Bowen Gao, Hao Zhou, Wei-Ying Ma, Yanyan Lan",,,,,,,,,
 Off-Policy Average Reward Actor-Critic with Deterministic Policy Search,"Naman Saxena, Subhojyoti Khastagir, Shishir Nadubettu Yadukumar, Shalabh Bhatnagar",http://arxiv.org/abs/2305.12239,,https://huggingface.co/papers/2305.12239,,,,2305.12239,4,0
 Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model,"Siyu Chen, Jibang Wu, Yifan Wu, Zhuoran Yang",http://arxiv.org/abs/2303.08613,,https://huggingface.co/papers/2303.08613,,,,2303.08613,4,1
 SLAMB: Accelerated Large Batch Training with Sparse Communication,"Hang Xu, Wenxuan Zhang, Jiawei Fei, Yuzhe Wu, TingWen Xie, Jun Huang, Yuchen Xie, Mohamed Elhoseiny, Panos Kalnis",,,,,,,,,
 Efficient Quantum Algorithms for Quantum Optimal Control,"Xiantao Li, Chunhao Wang",http://arxiv.org/abs/2304.02613,,https://huggingface.co/papers/2304.02613,,,,2304.02613,2,0
+Improved Policy Evaluation for Randomized Trials of Algorithmic Resource Allocation,"Aditya Mate, Bryan Wilder, Aparna Taneja, Milind Tambe",http://arxiv.org/abs/2302.02570,,https://huggingface.co/papers/2302.02570,,,,2302.02570,4,1
 Variational Sparse Inverse Cholesky Approximation for Latent Gaussian Processes via Double Kullback-Leibler Minimization,"Jian Cao, Myeongjong Kang, Felix Jimenez, Huiyan Sang, Florian Schaefer, Matthias Katzfuss",http://arxiv.org/abs/2301.13303,,https://huggingface.co/papers/2301.13303,,,,2301.13303,6,0
 Efficient exploration via epistemic-risk-seeking policy gradients,Brendan O'Donoghue,,,,,,,,,
 Probing the Deep Neural Manifold of Reinforcement Learning to Expose Volatility,"Ezgi Korkmaz, Jonah Brown-Cohen",,,,,,,,,
 Cut your Losses with Squentropy,"Like Hui, Misha Belkin, Stephen Wright",http://arxiv.org/abs/2302.03952,,https://huggingface.co/papers/2302.03952,,,,2302.03952,3,0
 Multi-Agent Learning from Learners,"MINE M CALISKAN, Francesco Chini, Setareh Maghsudi",,,,,,,,,
 Oracles and Followers: Stackelberg Equilibria in Deep Multi-Agent Reinforcement Learning,"Matthias Gerstgrasser, David Parkes",,,,,,,,,
+Robust Counterfactual Explanations for Neural Networks With Probabilistic Guarantees,"Faisal Hamman, Erfaun Noorani, Saumitra Mishra, Daniele Magazzeni, Sanghamitra Dutta",http://arxiv.org/abs/2305.11997,,https://huggingface.co/papers/2305.11997,,,,2305.11997,5,1
 Theoretical Behavior of XAI Methods in the Presence of Suppressor Variables,"Rick Wilming, Leo Kieslich, Benedict Clark, Stefan Haufe",http://arxiv.org/abs/2306.01464,,https://huggingface.co/papers/2306.01464,,,,2306.01464,4,1
 When do Minimax-fair Learning and Empirical Risk Minimization Coincide?,"Harvineet Singh, Matthäus Kleindessner, Volkan Cevher, Rumi Chunara, Chris Russell",,,,,,,,,
 Semi-Autoregressive Energy Flows: Towards Determinant-Free Training of Normalizing Flows ,"Phillip Si, Zeyi Chen, Subham S Sahoo, Yair Schiff, Volodymyr Kuleshov",,,,,,,,,
 Bayes-optimal Learning of Deep Random Networks of Extensive-width,"Hugo Cui, FLORENT KRZAKALA, Lenka Zdeborova",,,,,,,,,
 Adapting to game trees in zero-sum imperfect information games,"Côme Fiegel, Pierre Menard, Tadashi Kozuno, Remi Munos, Vianney Perchet, Michal Valko",http://arxiv.org/abs/2212.12567,,https://huggingface.co/papers/2212.12567,,,,2212.12567,6,0
 Adversarial Policies Beat Superhuman Go AIs,"Tony Wang, Adam Gleave, Tom Tseng, Nora Belrose, Kellin Pelrine, Joseph Miller, Michael Dennis, Yawen Duan, Viktor Pogrebniak, Sergey Levine, Stuart Russell",http://arxiv.org/abs/2211.00241,,https://huggingface.co/papers/2211.00241,,,,2211.00241,11,1
+Pretraining Language Models with Human Preferences,"Tomasz Korbak, Kejian Shi, Angelica Chen, Rasika Bhalerao, Christopher Buckley, Jason Phang, Samuel Bowman, Ethan Perez",http://arxiv.org/abs/2302.08582,,https://huggingface.co/papers/2302.08582,,,,2302.08582,8,2
 Adversarial Example Does Good: Preventing Painting Imitation from Diffusion Models via Adversarial Examples,"Chumeng Liang, Xiaoyu Wu, Yang Hua, Jiaru Zhang, Yiming Xue, Tao Song, Zhengui XUE, Ruhui Ma, Haibing Guan",http://arxiv.org/abs/2302.04578,https://github.com/mist-project/mist.git,https://huggingface.co/papers/2302.04578,,,,2302.04578,9,0
 A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs,"Mikael Henaff, Minqi Jiang, Roberta Raileanu",http://arxiv.org/abs/2306.03236,,https://huggingface.co/papers/2306.03236,,,,2306.03236,3,0
 Using Large Language Models to Simulate Multiple Humans and Replicate Human Subject Studies,"Gati Aher, Rosa I. Arriaga, Adam Tauman Kalai",http://arxiv.org/abs/2208.10264,,https://huggingface.co/papers/2208.10264,,,,2208.10264,3,0
 Towards Theoretical Understanding of Inverse Reinforcement Learning,"Alberto Maria Metelli, Filippo Lazzati, Marcello Restelli",http://arxiv.org/abs/2304.12966,,https://huggingface.co/papers/2304.12966,,,,2304.12966,3,0
 Tighter Lower Bounds for Shuffling SGD: Random Permutations and Beyond,"Jaeyoung Cha, Jaewook Lee, Chulhee Yun",http://arxiv.org/abs/2303.07160,,https://huggingface.co/papers/2303.07160,,,,2303.07160,3,0
 Delayed Feedback in Kernel Bandits,"Sattar Vakili, Danyal Ahmed, Alberto Bernacchia, Ciara Pike-Burke",http://arxiv.org/abs/2302.00392,,https://huggingface.co/papers/2302.00392,,,,2302.00392,4,0
+Sharper Bounds for $\ell_p$ Sensitivity Sampling,"David Woodruff, Taisuke Yasuda",http://arxiv.org/abs/2306.00732,,https://huggingface.co/papers/2306.00732,,,,2306.00732,2,1
 Hyena Hierarchy: Towards Larger Convolutional Language Models,"Michael Poli, Stefano Massaroli, Eric Nguyen, Daniel Y Fu, Tri Dao, Stephen Baccus, Yoshua Bengio, Stefano Ermon, Christopher Re",http://arxiv.org/abs/2302.10866,,https://huggingface.co/papers/2302.10866,,,,2302.10866,9,0
 Delving into Noisy Label Detection with Clean Data,"Chenglin Yu, Xinsong Ma, Weiwei Liu",,,,,,,,,
 GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration,"Naoki Murata, Koichi Saito, Chieh-Hsin Lai, Yuhta Takida, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon",http://arxiv.org/abs/2301.12686,,https://huggingface.co/papers/2301.12686,,,,2301.12686,7,1
 Towards Omni-generalizable Neural Methods for Vehicle Routing Problems,"Jianan Zhou, Yaoxin Wu, Wen Song, Zhiguang Cao, Jie Zhang",http://arxiv.org/abs/2305.19587,https://github.com/RoyalSkye/Omni-VRP,https://huggingface.co/papers/2305.19587,,,,2305.19587,5,0
 Protecting Language Generation Models via Invisible Watermarking,"Xuandong Zhao, Yu-Xiang Wang, Lei Li",http://arxiv.org/abs/2302.03162,,https://huggingface.co/papers/2302.03162,,,,2302.03162,3,0
 Global Optimization with Parametric Function Approximation,"Chong Liu, Yu-Xiang Wang",http://arxiv.org/abs/2211.09100,,https://huggingface.co/papers/2211.09100,,,,2211.09100,2,0
+Non-stationary Reinforcement Learning under General Function Approximation,"Songtao Feng, Ming Yin, Ruiquan Huang, Yu-Xiang Wang, Jing Yang, Yingbin LIANG",http://arxiv.org/abs/2306.00861,,https://huggingface.co/papers/2306.00861,,,,2306.00861,6,1
 Demystifying Disagreement-on-the-Line in High Dimensions,"Donghwan Lee, Behrad Moniri, Xinmeng Huang, Edgar Dobriban, Hamed Hassani",http://arxiv.org/abs/2301.13371,,https://huggingface.co/papers/2301.13371,,,,2301.13371,5,0
 Multisample Flow Matching: Straightening Flows with Minibatch Couplings,"Aram-Alexandre Pooladian, Heli Ben-Hamu, Carles Domingo i Enrich, Brandon Amos, Yaron Lipman, Ricky T. Q. Chen",http://arxiv.org/abs/2304.14772,,https://huggingface.co/papers/2304.14772,,,,2304.14772,6,1
 Competitive Gradient Optimization,"Abhijeet Vyas, Brian Bullins, Kamyar Azizzadenesheli",http://arxiv.org/abs/2205.14232,,https://huggingface.co/papers/2205.14232,,,,2205.14232,2,0
 LegendreTron: Uprising Proper Multiclass Loss Learning,"Kevin H. Lam, Christian Walder, Spiridon Penev, Richard Nock",http://arxiv.org/abs/2301.11695,,https://huggingface.co/papers/2301.11695,,,,2301.11695,4,0
 R-U-SURE? Uncertainty-Aware Code Suggestions By Maximizing Utility Across Random User Intents,"Daniel D. Johnson, Daniel Tarlow, Christian Walder",,,,,,,,,
 High-dimensional Location Estimation via Norm Concentration for Subgamma Vectors,"Shivam Gupta, Jasper Lee, Eric Price",http://arxiv.org/abs/2302.02497,,https://huggingface.co/papers/2302.02497,,,,2302.02497,3,0
+COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models,"Jinqi Xiao, Miao Yin, Yu Gong, Xiao Zang, Jian Ren, Bo Yuan",http://arxiv.org/abs/2305.17235,,https://huggingface.co/papers/2305.17235,,,,2305.17235,6,1
 Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling,"Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'Brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, USVSN Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar van der Wal",http://arxiv.org/abs/2304.01373,https://github.com/EleutherAI/pythia,https://huggingface.co/papers/2304.01373,,,,2304.01373,13,3
 HyperTuning:  Toward Adapting Large Language Models without Back-propagation,"Jason Phang, Yi Mao, Pengcheng He, Weizhu Chen",,,,,,,,,
 Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models,"Zhihong Shao, Yeyun Gong, Yelong Shen, Minlie Huang, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2302.00618,,https://huggingface.co/papers/2302.00618,,,,2302.00618,6,0
 Quantile Credit Assignment,"Thomas Mesnard, Wenqi Chen, Alaa Saade, Yunhao Tang, Mark Rowland, Theophane Weber, Clare Lyle, Audrunas Gruslys, Michal Valko, Will Dabney, Georg Ostrovski, Eric Moulines, Remi Munos",,,,,,,,,
 Understanding Self-Predictive Learning for Reinforcement Learning,"Yunhao Tang, Zhaohan Guo, Pierre Richemond, Bernardo Avila Pires, Yash Chandak, Remi Munos, Mark Rowland, Mohammad Gheshlaghi Azar, Charline Le Lan, Clare Lyle, Andras Gyorgy, Shantanu Thakoor, Will Dabney, Bilal Piot, Daniele Calandriello, Michal Valko",http://arxiv.org/abs/2212.03319,,https://huggingface.co/papers/2212.03319,,,,2212.03319,16,0
 Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning,"Brett Daley, Martha White, Christopher Amato, Marlos C. Machado",http://arxiv.org/abs/2301.11321,,https://huggingface.co/papers/2301.11321,,,,2301.11321,4,0
+"For Pre-Trained Vision Models in Motor Control, Not All Policy Learning Methods are Created Equal","Yingdong Hu, Renhao Wang, Li Li, Yang Gao",http://arxiv.org/abs/2304.04591,,https://huggingface.co/papers/2304.04591,,,,2304.04591,4,1
 Weakly Supervised Regression with Interval Targets,"Xin Cheng, Yuzhou Cao, Ximing Li, Bo An, LEI FENG",,,,,,,,,
 Regret Bounds for Markov Decision Processes with Recursive Optimized Certainty Equivalents,"WENHAO XU, Xuefeng Gao, Xuedong He",http://arxiv.org/abs/2301.12601,,https://huggingface.co/papers/2301.12601,,,,2301.12601,3,0
 Decentralized Stochastic Bilevel Optimization with Improved per-Iteration Complexity,"Xuxing Chen, Minhui Huang, Shiqian Ma, Krishna Balasubramanian",http://arxiv.org/abs/2210.12839,,https://huggingface.co/papers/2210.12839,,,,2210.12839,4,0
 New metrics and search algorithms for weighted causal DAGs,"Davin Choo, Kirankumar Shiragur",http://arxiv.org/abs/2305.04445,,https://huggingface.co/papers/2305.04445,,,,2305.04445,2,0
 CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms,"Shengyi Huang, Rousslan Fernand Julien Dossa, Chang Ye, Jeff Braga, Dipam Chakraborty, Kinal Mehta, João Madeira Araujo",http://arxiv.org/abs/2111.08819,https://github.com/vwxyzjn/cleanrl,https://huggingface.co/papers/2111.08819,,,,2111.08819,4,1
 Simplifying Momentum-based Positive-definite Submanifold Optimization with Applications to Deep Learning,"Wu Lin, Valentin Duruisseaux, Melvin Leok, Frank Nielsen, Khan Emtiyaz, Mark Schmidt",http://arxiv.org/abs/2302.09738,https://github.com/yorkerlin/StructuredNGD-DL,https://huggingface.co/papers/2302.09738,,,,2302.09738,6,0
+Polarity Is All You Need to Learn and Transfer Faster,"Alice (Qingyang) Wang, Michael Powell, Eric Bridgeford, Ali Geisa, Joshua Vogelstein",http://arxiv.org/abs/2303.17589,,https://huggingface.co/papers/2303.17589,,,,2303.17589,5,1
 Scaling Vision Transformers to 22 Billion Parameters,"Mostafa Dehghani, Josip Djolonga, Basil Mustafa, Piotr Padlewski, Jonathan Heek, Justin Gilmer, Andreas Steiner, Mathilde Caron, Robert Geirhos, Ibrahim Alabdulmohsin, Rodolphe Jenatton, Lucas Beyer, Michael Tschannen, Anurag Arnab, Xiao Wang, Carlos Riquelme, Matthias Minderer, Joan Puigcerver, Utku Evci, Manoj Kumar, Sjoerd van Steenkiste, Gamaleldin Elsayed, Aravindh Mahendran, Fisher Yu, Avital Oliver, Fantine Huot, Jasmijn Bastings, Mark Collier, Alexey Gritsenko, Vighnesh N Birodkar, Cristina Vasconcelos, Yi Tay, Thomas Mensink, Alexander Kolesnikov, Filip Pavetic, Dustin Tran, Thomas Kipf, Mario Lucic, Xiaohua Zhai, Daniel Keysers, Jeremiah Harmsen, Neil Houlsby",http://arxiv.org/abs/2302.05442,,https://huggingface.co/papers/2302.05442,,,,2302.05442,22,0
 Toward Fair and Robust Estimation of Optimal Treatment Regimes,"Kwangho Kim, Jose Zubizarreta",,,,,,,,,
 Internally Rewarded Reinforcement Learning,"Mengdi Li, Xufeng Zhao, Jae Hee Lee, Cornelius Weber, Stefan Wermter",http://arxiv.org/abs/2302.00270,,https://huggingface.co/papers/2302.00270,,,,2302.00270,5,0
 Critical Points and Convergence Analysis of Generative Deep Linear Networks Trained with Bures-Wasserstein Loss,"Pierre Bréchet, Katerina Papagiannouli, Jing An, Guido Montufar",http://arxiv.org/abs/2303.03027,,https://huggingface.co/papers/2303.03027,,,,2303.03027,4,0
 Policy Evaluation and Temporal-Difference Learning in Continuous Time and Space: A Martingale Approach,"Yanwei Jia, Xun Yu Zhou",http://arxiv.org/abs/2108.06655,,https://huggingface.co/papers/2108.06655,,,,2108.06655,2,0
 VIMA: Robot Manipulation with Multimodal Prompts,"Yunfan Jiang, Agrim Gupta, Zichen Zhang, Guanzhi Wang, Yongqiang Dou, Yanjun Chen, Li Fei-Fei, Anima Anandkumar, Yuke Zhu, Jim Fan",,,,,,,,,
+StriderNet: A Graph Reinforcement Learning Approach to Optimize Atomic Structures on Rough Energy Landscapes,"Vaibhav Bihani, Sahil Manchanda, Srikanth Sastry, Sayan Ranu, N M Anoop Krishnan",http://arxiv.org/abs/2301.12477,,https://huggingface.co/papers/2301.12477,,,,2301.12477,5,1
 Multi-agent Online Scheduling: MMS Allocations for Indivisible Items,"Shengwei Zhou, Rufan Bai, Xiaowei Wu",http://arxiv.org/abs/2304.13405,,https://huggingface.co/papers/2304.13405,,,,2304.13405,3,0
 Multi-Symmetry Ensembles: Improving Diversity and Generalization via Opposing Symmetries,"Charlotte Loh, Seungwook Han, Shivchander Sudalairaj, Rumen Dangovski, Kai Xu, Florian Wenzel, Marin Solja\v{c}i\'{c}, Akash Srivastava",http://arxiv.org/abs/2303.02484,,https://huggingface.co/papers/2303.02484,,,,2303.02484,8,0
 NP-SemiSeg: When Neural Processes meet Semi-Supervised Semantic Segmentation,"Jianfeng Wang, Daniela Massiceti, Xiaolin Hu, Vladimir Pavlovic, Thomas Lukasiewicz",,,,,,,,,
 Improving Adversarial Robustness Through the Contrastive-Guided Diffusion Process,"Yidong Ouyang, Liyan Xie, Guang Cheng",,,,,,,,,
 MetaModulation: Learning Variational Feature Hierarchies for Few-Shot Learning with Fewer Tasks,"Wenfang Sun, Yingjun Du, Xiantong Zhen, Fan Wang, Ling Wang, Cees Snoek",http://arxiv.org/abs/2305.10309,,https://huggingface.co/papers/2305.10309,,,,2305.10309,6,0
 Provable Dynamic Fusion for Low-Quality Multimodal Data,"qingyang zhang, Haitao Wu, Changqing Zhang, Qinghua Hu, Huazhu Fu, Joey Tianyi Zhou, Xi Peng",http://arxiv.org/abs/2306.02050,,https://huggingface.co/papers/2306.02050,,,,2306.02050,7,0
+Beyond Homophily: Reconstructing Structure for Graph-agnostic Clustering,"Erlin Pan, zhao kang",http://arxiv.org/abs/2305.02931,,https://huggingface.co/papers/2305.02931,,,,2305.02931,2,1
 SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation,"Huaishao Luo, Junwei Bao, Youzheng Wu, Xiaodong He, Tianrui Li",http://arxiv.org/abs/2211.14813,https://github.com/ArrowLuo/SegCLIP,https://huggingface.co/papers/2211.14813,,,,2211.14813,5,0
 Explainability as statistical inference,"Hugo Senetaire, Damien Garreau, Jes Frellsen, Pierre-Alexandre Mattei",http://arxiv.org/abs/2212.03131,,https://huggingface.co/papers/2212.03131,,,,2212.03131,4,0
 Learning Prescriptive ReLU Networks,"Wei Sun, Asterios Tsiourvas",http://arxiv.org/abs/2306.00651,,https://huggingface.co/papers/2306.00651,,,,2306.00651,2,0
 Bidirectional Adaptation for Robust Semi-Supervised Learning with Inconsistent Data Distributions,"Lin-Han Jia, Lan-Zhe Guo, Zhi Zhou, Jie-Jing Shao, Yuke Xiang, Yu-Feng Li",,,,,,,,,
 Beyond the Universal Law of Robustness: Sharper Laws for Random Features and Neural Tangent Kernels,"Simone Bombari, Shayan Kiyani, Marco Mondelli",http://arxiv.org/abs/2302.01629,,https://huggingface.co/papers/2302.01629,,,,2302.01629,3,0
+Human-Timescale Adaptation in an Open-Ended Task Space,"Jakob Bauer, Kate Baumli, Feryal Behbahani, Avishkar Bhoopchand, Natalie Bradley-Schmieg, Michael Chang, Natalie Clay, Adrian Collister, Vibhavari Dasagi, Lucy Gonzalez, Karol Gregor, Edward Hughes, Sheleem Kashem, Maria Loks-Thompson, Hannah Openshaw, Jack Parker-Holder, Shreya Pathak, Nicolas Perez-Nieves, Nemanja Rakicevic, Tim Rocktäschel, Yannick Schroecker, Satinder Singh, Jakub Sygnowski, Karl Tuyls, Sarah York, Alexander Zacherl, Lei Zhang",http://arxiv.org/abs/2301.07608,,https://huggingface.co/papers/2301.07608,,,,2301.07608,22,1
 Analysis of Error Feedback in Federated Non-Convex Optimization with Biased Compression: Linear Speedup and Partial Participation,"Xiaoyun Li, Ping Li",,,,,,,,,
 ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts,"Minghao Xu, Xinyu Yuan, Santiago Miret, Jian Tang",http://arxiv.org/abs/2301.12040,,https://huggingface.co/papers/2301.12040,,,,2301.12040,4,0
 Specializing Smaller Language Models towards Multi-Step Reasoning,"Yao Fu, Hao Peng, Litu Ou, Ashish Sabharwal, Tushar Khot",http://arxiv.org/abs/2301.12726,,https://huggingface.co/papers/2301.12726,,,,2301.12726,5,1
 Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models,"Dongjun Kim, Yeongmin Kim, Se Jung Kwon, Wanmo Kang, IL CHUL MOON",http://arxiv.org/abs/2211.17091,https://github.com/alsdudrla10/DG,https://huggingface.co/papers/2211.17091,,,,2211.17091,5,0
 Weighted flow diffusion for local graph clustering with node attributes: an algorithm and statistical guarantees,"Shenghao Yang, Kimon Fountoulakis",http://arxiv.org/abs/2301.13187,,https://huggingface.co/papers/2301.13187,,,,2301.13187,2,0
 Robust Budget Pacing with a Single Sample,"Santiago Balseiro, Rachitesh Kumar, Vahab Mirrokni, Balasubramanian Sivan, Di Wang",http://arxiv.org/abs/2302.02006,,https://huggingface.co/papers/2302.02006,,,,2302.02006,5,0
+Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the Machiavelli Benchmark,"Alexander Pan, Jun Shern Chan, Andy Zou, Nathaniel Li, Scott Emmons, Hanlin Zhang, Steven Basart, Thomas Woodside, Dan Hendrycks",http://arxiv.org/abs/2304.03279,,https://huggingface.co/papers/2304.03279,,,,2304.03279,10,1
 Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines?,"Victor Boutin, Thomas FEL, Lakshya Singhal, Rishav Mukherji, Akash Nagaraj, Julien Colin, Thomas Serre",http://arxiv.org/abs/2301.11722,,https://huggingface.co/papers/2301.11722,,,,2301.11722,7,1
 Random Classification Noise does not defeat All Convex Potential Boosters Irrespective of Model Choice,"Yishay Mansour, Richard Nock, Robert C. Williamson",,,,,,,,,
 "Fundamental Limits of Two-layer Autoencoders, and Achieving Them with Gradient Methods","Aleksandr Shevchenko, Kevin Kögler, Hamed Hassani, Marco Mondelli",http://arxiv.org/abs/2212.13468,,https://huggingface.co/papers/2212.13468,,,,2212.13468,4,0
 HETAL: Efficient Privacy-preserving Transfer Learning with Homomorphic Encryption,"Seewoo Lee, Garam Lee, Jung Woo Kim, Junbum Shin, Mun-Kyu Lee",,,,,,,,,
 Marginalization is not Marginal: No Bad VAE Local Minima when Learning Optimal Sparse Representations,David Wipf,,,,,,,,,
 Direct Parameterization of Lipschitz-Bounded Deep Networks,"Ruigang Wang, Ian Manchester",http://arxiv.org/abs/2301.11526,https://github.com/acfr/LBDN,https://huggingface.co/papers/2301.11526,,,,2301.11526,2,0
+XAI Beyond Classification: Interpretable Neural Clustering,"Xi Peng, Yunfan Li, Ivor W. Tsang, Hongyuan Zhu, Jiancheng Lv, Joey Tianyi Zhou",http://arxiv.org/abs/1808.07292,,https://huggingface.co/papers/1808.07292,,,,1808.07292,6,1
 Exploiting locality in high-dimensional Factorial hidden Markov models,"Lorenzo Rimella, Nick Whiteley",http://arxiv.org/abs/1902.01639,,https://huggingface.co/papers/1902.01639,,,,1902.01639,2,0
 Mitigating the Effects of Non-Identifiability on Inference for Bayesian Neural Networks with Latent Variables,"Yaniv Yacoby, Weiwei Pan, Finale Doshi-Velez",http://arxiv.org/abs/1911.00569,,https://huggingface.co/papers/1911.00569,,,,1911.00569,3,0
 Project and Forget: Solving Large-Scale Metric Constrained Problems,"Rishi Sonthalia, Anna C. Gilbert",http://arxiv.org/abs/2005.03853,,https://huggingface.co/papers/2005.03853,,,,2005.03853,2,0
 "Let's Make Block Coordinate Descent Converge Faster: Faster Greedy Rules, Message-Passing, Active-Set Complexity, and Superlinear Convergence","Julie Nutini, Issam Laradji, Mark Schmidt",http://arxiv.org/abs/1712.08859,,https://huggingface.co/papers/1712.08859,,,,1712.08859,3,0
+Cluster-Specific Predictions with Multi-Task Gaussian Processes,"Arthur Leroy, Pierre Latouche, Benjamin Guedj, Servane Gey",http://arxiv.org/abs/2011.07866,,https://huggingface.co/papers/2011.07866,,,,2011.07866,4,1
 Non-asymptotic Properties of Individualized Treatment Rules from Sequentially Rule-Adaptive Trials,"Daiqi Gao, Yufeng Liu, Donglin Zeng",,,,,,,,,
 Mean-field Analysis of Piecewise Linear Solutions for Wide ReLU Networks,"Aleksandr Shevchenko, Vyacheslav Kungurtsev, Marco Mondelli",http://arxiv.org/abs/2111.02278,,https://huggingface.co/papers/2111.02278,,,,2111.02278,3,0
 "Multi-Agent Online Optimization with Delays: Asynchronicity, Adaptivity, and Optimism","Yu-Guan Hsieh, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos",http://arxiv.org/abs/2012.11579,,https://huggingface.co/papers/2012.11579,,,,2012.11579,4,0
 Deep linear networks can benignly overfit when shallow ones do,"Niladri S. Chatterji, Phil Long",http://arxiv.org/abs/2209.09315,,https://huggingface.co/papers/2209.09315,,,,2209.09315,2,0
 Taming graph kernels with random features,Krzysztof Choromanski,http://arxiv.org/abs/2305.00156,,https://huggingface.co/papers/2305.00156,,,,2305.00156,1,0
 On Uni-Modal Feature Learning in Supervised Multi-Modal Learning,"Chenzhuang Du, Jiaye Teng, Tingle Li, Yichen Liu, Tianyuan Yuan, Yue Wang, Yang Yuan, Hang Zhao",http://arxiv.org/abs/2305.01233,,https://huggingface.co/papers/2305.01233,,,,2305.01233,8,0
+CSP: Self-Supervised Contrastive Spatial Pre-Training for Geospatial-Visual Representations,"Gengchen Mai, Ni Lao, Yutong He, Jiaming Song, Stefano Ermon",http://arxiv.org/abs/2305.01118,,https://huggingface.co/papers/2305.01118,,,,2305.01118,5,2
 CLIPood: Generalizing CLIP to Out-of-Distributions,"Yang Shu, Xingzhuo Guo, Jialong Wu, Ximei Wang, Jianmin Wang, Mingsheng Long",http://arxiv.org/abs/2302.00864,,https://huggingface.co/papers/2302.00864,,,,2302.00864,6,0
 Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning,"Yu Meng, Martin Michalski, Jiaxin Huang, Yu Zhang, Tarek Abdelzaher, Jiawei Han",http://arxiv.org/abs/2211.03044,,https://huggingface.co/papers/2211.03044,,,,2211.03044,6,1
 Meta-SAGE: Scale Meta-Learning Scheduled Adaptation with Guided Exploration for Mitigating Scale Shift on Combinatorial Optimization,"Jiwoo Son, Minsu Kim, Hyeonah Kim, Jinkyoo Park",,,,,,,,,
 Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification,"Dong Xing, Pengjie Gu, Qian Zheng, Xinrun Wang, Shanqi Liu, Longtao Zheng, Bo An, Gang Pan",,,,,,,,,
 Data-Efficient Contrastive Self-supervised Learning: Most Beneficial Examples for Supervised Learning Contribute the Least,"Siddharth Joshi, Baharan Mirzasoleiman",http://arxiv.org/abs/2302.09195,,https://huggingface.co/papers/2302.09195,,,,2302.09195,2,0
 Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs,"Guan-Ting Liu, En-Pei Hu, Pu-Jen Cheng, Hung-yi Lee, Shao-Hua Sun",http://arxiv.org/abs/2301.12950,,https://huggingface.co/papers/2301.12950,,,,2301.12950,5,0
+Cooperative Open-ended Learning Framework for Zero-Shot Coordination,"Yang Li, Shao Zhang, Jichen Sun, Yali Du, Ying Wen, Xinbing Wang, Wei Pan",http://arxiv.org/abs/2302.04831,,https://huggingface.co/papers/2302.04831,,,,2302.04831,7,1
 CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental Design,"Desi Ivanova, Joel Jennings, Tom Rainforth, Cheng Zhang, Adam Foster",,,,,,,,,
 On the Identifiability and Estimation of Causal Location-Scale Noise Models,"Alexander Immer, Christoph Schultheiss, Julia Vogt, Bernhard Schölkopf, Peter Bühlmann, Alexander Marx",http://arxiv.org/abs/2210.09054,,https://huggingface.co/papers/2210.09054,,,,2210.09054,6,0
 From Temporal to Contemporaneous Iterative Causal Discovery in the Presence of Latent Confounders,"Raanan Yehezkel Rohekar, Shami Nisimov, Yaniv Gurwicz, Gal Novik",http://arxiv.org/abs/2306.00624,,https://huggingface.co/papers/2306.00624,,,,2306.00624,4,0
 Drug Discovery under Covariate Shift with Domain-Informed Prior Distributions over Functions,"Leo Klarner, Tim G. J. Rudner, Michael Reutlinger, Torsten Schindler, Garrett Morris, Charlotte Deane, Yee-Whye Teh",,,,,,,,,
 SOM-CPC: Unsupervised Contrastive Learning with Self-Organizing Maps for Structured Representations of High-Rate Time Series,"Iris Huijben, Arthur A. Nijdam, Sebastiaan Overeem, Merel Van Gilst, Ruud J. G. van Sloun",,,,,,,,,
 Hierarchical Grammar-Induced Geometry for Data-Efficient Molecular Property Prediction,"Minghao Guo, Veronika Thost, Samuel Song, Adithya Balachandran, Payel Das, Jie Chen, Wojciech Matusik",,,,,,,,,
+A Closer Look at the Intervention Procedure of Concept Bottleneck Models,"Sungbin Shin, Yohan Jo, Sungsoo Ahn, Namhoon Lee",http://arxiv.org/abs/2302.14260,,https://huggingface.co/papers/2302.14260,,,,2302.14260,4,1
+Simple Hardware-Efficient Long Convolutions for Sequence Modeling,"Daniel Y Fu, Elliot L Epstein, Eric Nguyen, Michael Zhang, Tri Dao, Atri Rudra, Christopher Re",http://arxiv.org/abs/2302.06646,,https://huggingface.co/papers/2302.06646,,,,2302.06646,8,1
 Towards Controlled Data Augmentations for Active Learning,"Jianan Yang, Jianan Yang, Haobo Wang, Sai Wu, Gang Chen, Junbo Zhao",,,,,,,,,
 "Bigger, Better, Faster: Human-level Atari with human-level efficiency","Max Schwarzer, Johan Obando Ceron, Aaron Courville, Marc Bellemare, Rishabh Agarwal, Pablo Samuel Castro",http://arxiv.org/abs/2305.19452,https://github.com/google-research/google-research/tree/master/bigger_better_faster,https://huggingface.co/papers/2305.19452,,,,2305.19452,6,3
 A Law of Robustness beyond Isoperimetry,"Yihan Wu, Heng Huang, Hongyang Zhang",http://arxiv.org/abs/2202.11592,,https://huggingface.co/papers/2202.11592,,,,2202.11592,3,0
 Guiding Pretraining in Reinforcement Learning with Large Language Models,"Yuqing Du, Olivia Watkins, Zihan Wang, Cédric Colas, Trevor Darrell, Pieter Abbeel, Abhishek Gupta, Jacob Andreas",http://arxiv.org/abs/2302.06692,,https://huggingface.co/papers/2302.06692,,,,2302.06692,8,0
 PPG Reloaded: An Empirical Study on What Matters in Phasic Policy Gradient,"Kaixin Wang, Zhou Daquan, Jiashi Feng, Shie Mannor",,,,,,,,,
 Differentially Private Sharpness-Aware Training,"Jinseong Park, Hoki Kim, Yujin Choi, Jaewook Lee",http://arxiv.org/abs/2306.05651,https://github.com/jinseongP/DPSAT,https://huggingface.co/papers/2306.05651,,,,2306.05651,4,0
+Provably and Practically Efficient Neural Contextual Bandits,Sudeep Salgia,http://arxiv.org/abs/2206.00099,,https://huggingface.co/papers/2206.00099,,,,2206.00099,3,1
 How Does Information Bottleneck Help Deep Learning?,"Kenji Kawaguchi, Zhun Deng, Xu Ji, Jiaoyang Huang",http://arxiv.org/abs/2305.18887,https://github.com/xu-ji/information-bottleneck,https://huggingface.co/papers/2305.18887,,,,2305.18887,4,0
 Why Is Public Pretraining Necessary for Private Model Training?,"Arun Ganesh, Mahdi Haghifam, Milad Nasresfahani, Sewoong Oh, Thomas Steinke, Om Thakkar, Abhradeep Guha Thakurta, Lun Wang",http://arxiv.org/abs/2302.09483,,https://huggingface.co/papers/2302.09483,,,,2302.09483,8,0
 Learning Instance-Specific Augmentations by Capturing Local Invariances,"Ning Miao, Tom Rainforth, Emile Mathieu, Yann Dubois, Yee-Whye Teh, Adam Foster, Hyunjik Kim",http://arxiv.org/abs/2206.00051,,https://huggingface.co/papers/2206.00051,,,,2206.00051,7,0
 Unleashing Mask: Explore the Intrinsic Out-of-Distribution Detection Capability,"Jianing Zhu, Hengzhuang Li, Jiangchao Yao, Tongliang Liu, Jianliang Xu, Bo Han",http://arxiv.org/abs/2306.03715,https://github.com/tmlr-group/Unleashing-Mask,https://huggingface.co/papers/2306.03715,,,,2306.03715,6,0
 Conditional Graph Information Bottleneck for Molecular Relational Learning,"Namkyeong Lee, Dongmin Hyun, Gyoung S. Na, Sungwon Kim, Junseok Lee, Chanyoung Park",http://arxiv.org/abs/2305.01520,https://github.com/Namkyeong/CGIB,https://huggingface.co/papers/2305.01520,,,,2305.01520,6,0
 Reconstructive Neuron Pruning for Backdoor Defense,"Yige Li, XIXIANG LYU, Xingjun Ma, Nodens Koren, Lingjuan Lyu, Bo Li, Yu-Gang Jiang",http://arxiv.org/abs/2305.14876,https://github.com/bboylyg/RNP,https://huggingface.co/papers/2305.14876,,,,2305.14876,7,0
+Abstract-to-Executable Trajectory Translation for One-Shot Task Generalization,"Stone Tao, Xiaochen Li, Tongzhou Mu, Zhiao Huang, Yuzhe Qin, Hao Su",http://arxiv.org/abs/2210.07658,,https://huggingface.co/papers/2210.07658,,,,2210.07658,6,1
 Multi-View Masked World Models for Visual Robotic Manipulation,"Younggyo Seo, Junsu Kim, Stephen James, Kimin Lee, Jinwoo Shin, Pieter Abbeel",http://arxiv.org/abs/2302.02408,,https://huggingface.co/papers/2302.02408,,,,2302.02408,6,0
 CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets,"Zachary Novack, Julian McAuley, Zachary Lipton, Saurabh Garg",http://arxiv.org/abs/2302.02551,https://github.com/acmi-lab/CHILS,https://huggingface.co/papers/2302.02551,,,,2302.02551,4,1
 Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization,"Zi-Hao Qiu, Quanqi Hu, Zhuoning Yuan, Denny Zhou, Lijun Zhang, Tianbao Yang",http://arxiv.org/abs/2305.11965,,https://huggingface.co/papers/2305.11965,,,,2305.11965,6,0
 Bandit Online Linear Optimization with Hints and Queries,"Aditya Bhaskara, Ashok Cutkosky, Ravi Kumar, Manish Purohit",,,,,,,,,
 Neural Network Approximations of PDEs Beyond Linearity: A Representational Perspective,"Tanya Marwah, Zachary Lipton, Jianfeng Lu, Andrej Risteski",http://arxiv.org/abs/2210.12101,,https://huggingface.co/papers/2210.12101,,,,2210.12101,4,0
 Attribute-Efficient PAC Learning of Low-Degree Polynomial Threshold Functions with Nasty Noise,"Shiwei Zeng, Jie Shen",http://arxiv.org/abs/2306.00673,,https://huggingface.co/papers/2306.00673,,,,2306.00673,2,0
+Sample Complexity Bounds for Learning High-dimensional Simplices in Noisy Regimes,"seyed amir saberi, Amir Najafi, Abolfazl Motahari, Babak Khalaj",http://arxiv.org/abs/2209.05953,,https://huggingface.co/papers/2209.05953,,,,2209.05953,4,1
 "Monge, Bregman and Occam: Interpretable Optimal Transport in High-Dimensions with Feature-Sparse Maps","Marco Cuturi, Michal Klein, Pierre Ablin",http://arxiv.org/abs/2302.04065,,https://huggingface.co/papers/2302.04065,,,,2302.04065,3,0
 Sketching Meets Differential Privacy: Fast Algorithm for Dynamic Kronecker Projection Maintenance,"Zhao Song, Xin Yang, Yuanyuan Yang, Lichen Zhang",http://arxiv.org/abs/2210.11542,,https://huggingface.co/papers/2210.11542,,,,2210.11542,4,0
 Combinatorial Neural Bandits,"Taehyun Hwang, Kyuwook Chai, Min-hwan Oh",http://arxiv.org/abs/2306.00242,,https://huggingface.co/papers/2306.00242,,,,2306.00242,3,0
 Reward-Mixing MDPs with Few Contexts are Learnable,"Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor",,,,,,,,,
+Quantum Speedups for Zero-Sum Games via Improved Dynamic Gibbs Sampling,"Adam Bouland, Yosheb Getachew, Yujia Jin, Aaron Sidford, Kevin Tian",http://arxiv.org/abs/2301.03763,,https://huggingface.co/papers/2301.03763,,,,2301.03763,5,1
 Tight Regret Bounds for Single-pass Streaming Multi-armed Bandits,Chen Wang,http://arxiv.org/abs/2306.02208,,https://huggingface.co/papers/2306.02208,,,,2306.02208,1,0
 Minimum Width of Leaky-ReLU Neural Networks for Uniform Universal Approximation,"Li'ang Li, Yifei duan, Guanghua Ji, Yongqiang Cai",http://arxiv.org/abs/2305.18460,,https://huggingface.co/papers/2305.18460,,,,2305.18460,4,0
 Dynamical Linear Bandits,"Marco Mussi, Alberto Maria Metelli, Marcello Restelli",http://arxiv.org/abs/2211.08997,,https://huggingface.co/papers/2211.08997,,,,2211.08997,3,0
 Neural networks trained with SGD learn distributions of increasing complexity,"Maria Refinetti, Alessandro Ingrosso, Sebastian Goldt",http://arxiv.org/abs/2211.11567,,https://huggingface.co/papers/2211.11567,,,,2211.11567,3,0
 Scaling Laws for Multilingual Neural Machine Translation,"Patrick Fernandes, Behrooz Ghorbani, Xavier Garcia, Markus Freitag, Orhan Firat",http://arxiv.org/abs/2302.09650,,https://huggingface.co/papers/2302.09650,,,,2302.09650,5,0
 Explaining the effects of non-convergent MCMC in the training of Energy-Based Models,"Elisabeth Agoritsas, Giovanni Catania, Aurélien Decelle, Beatriz Seoane",,,,,,,,,
+A Three-regime Model of Network Pruning,"Yefan Zhou, Yaoqing Yang, Arin Chang, Michael Mahoney",http://arxiv.org/abs/2305.18383,,https://huggingface.co/papers/2305.18383,,,,2305.18383,4,1
 Metagenomic Binning using Connectivity-constrained Variational Autoencoders,"Andre Lamurias, Alessandro Tibo, Katja Hose, Mads Albertsen, Thomas D. Nielsen",,,,,,,,,
 SNeRL: Semantic-aware Neural Radiance Fields for Reinforcement Learning,"Dongseok Shim, Seungjae Lee, H. Kim",http://arxiv.org/abs/2301.11520,,https://huggingface.co/papers/2301.11520,,,,2301.11520,3,0
 Spatial-Temporal Graph Learning with Adversarial Contrastive Adaptation,"Qianru Zhang, Chao Huang, Lianghao Xia, Zheng Wang, Siu Ming Yiu, Ruihua Han",,,,,,,,,
 Brainformers: Trading Simplicity for Efficiency,"Yanqi Zhou, Nan Du, Yanping Huang, Daiyi Peng, Chang Lan, Da Huang, Siamak Shakeri, David So, Andrew Dai, Yifeng Lu, Zhifeng Chen, Quoc Le, Claire Cui, James Laudon, Jeff Dean",http://arxiv.org/abs/2306.00008,,https://huggingface.co/papers/2306.00008,,,,2306.00008,15,3
 On the Training Instability of Shuffling SGD with Batch Normalization,"David X. Wu, Chulhee Yun, Suvrit Sra",http://arxiv.org/abs/2302.12444,,https://huggingface.co/papers/2302.12444,,,,2302.12444,3,0
 Dropout Reduces Underfitting,"Zhuang Liu, Zhiqiu (Oscar) Xu, Joseph Jin, Zhiqiang Shen, Trevor Darrell",http://arxiv.org/abs/2303.01500,https://github.com/facebookresearch/dropout,https://huggingface.co/papers/2303.01500,,,,2303.01500,5,0
+A modern look at the relationship between sharpness and generalization,"Maksym Andriushchenko, Francesco Croce, Maximilian Müller, Matthias Hein, Nicolas Flammarion",http://arxiv.org/abs/2302.07011,https://github.com/tml-epfl/sharpness-vs-generalization,https://huggingface.co/papers/2302.07011,,,,2302.07011,5,1
 Weak Proxies are Sufficient and Preferable for Fairness with Missing Sensitive Attributes,"Zhaowei Zhu, Yuanshun Yao, Jiankai Sun, Hang Li, Yang Liu",http://arxiv.org/abs/2210.03175,,https://huggingface.co/papers/2210.03175,,,,2210.03175,5,0
 Cocktail Party Attack: Breaking Aggregation-Based Privacy in Federated Learning Using Independent Component Analysis,"Sanjay Kariyappa, Chuan Guo, Kiwan Maeng, Wenjie Xiong, G. Edward Suh, Moinuddin Qureshi, Hsien-Hsin Sean Lee",http://arxiv.org/abs/2209.05578,,https://huggingface.co/papers/2209.05578,,,,2209.05578,7,0
 On the Robustness of Randomized Ensembles to Adversarial Perturbations,"Hassan Dbouk, Naresh Shanbhag",http://arxiv.org/abs/2302.01375,https://github.com/hsndbk4/BARRE,https://huggingface.co/papers/2302.01375,,,,2302.01375,2,0
 On the Complexity of Bayesian Generalization,"Yu-Zhe Shi, Manjie Xu, John Hopcroft, Kun He, Josh Tenenbaum, Song-Chun Zhu, Ying Nian Wu, Wenjuan Han, Yixin Zhu",http://arxiv.org/abs/2211.11033,,https://huggingface.co/papers/2211.11033,,,,2211.11033,9,0
 QAS-Bench: Rethinking Quantum Architecture Search and A Benchmark,"Xudong Lu, Kaisen Pan, Ge Yan, Jiaming Shan, Wenjie Wu, Junchi Yan",,,,,,,,,
 Not all Strongly Rayleigh Distributions Have Small Probabilistic Generating Circuits,Markus Bläser,,,,,,,,,
+PAL: Program-aided Language Models,"Luyu Gao, Aman Madaan, Shuyan Zhou, Uri Alon, Pengfei Liu, Yiming Yang, Jamie Callan, Graham Neubig",http://arxiv.org/abs/2211.10435,,https://huggingface.co/papers/2211.10435,,,,2211.10435,8,2
 Tighter Bounds on the Expressivity of Transformer Encoders,"David Chiang, Peter Cholak, Anand Pillay",http://arxiv.org/abs/2301.10743,,https://huggingface.co/papers/2301.10743,,,,2301.10743,3,0
 Efficient Algorithms for Exact Graph Matching on Correlated Stochastic Block Models with Constant Correlation,"Joonhyuk Yang, Shin Dongpil, Hye Won Chung",http://arxiv.org/abs/2305.19666,,https://huggingface.co/papers/2305.19666,,,,2305.19666,3,0
 Causal Discovery with Latent Confounders Based on Higher-Order Cumulants,"Ruichu Cai, Zhiyi Huang, Wei Chen, Zhifeng Hao, Kun Zhang",http://arxiv.org/abs/2305.19582,,https://huggingface.co/papers/2305.19582,,,,2305.19582,5,0
 Learning Representations without Compositional Assumptions,"Tennison Liu, Jeroen Berrevoets, Zhaozhi Qian, Mihaela van der Schaar",http://arxiv.org/abs/2305.19726,,https://huggingface.co/papers/2305.19726,,,,2305.19726,4,0
 Making Transformers Compute-lite for CPU inference,"Zhanpeng Zeng, Michael Davies, Pranav Pulijala, Karthikeyan Sankaralingam, Vikas Singh",,,,,,,,,
 Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers,"Grant Strimel, Yi Xie, Brian King, martin radfar, Ariya Rastrow, Athanasios Mouchtaris",http://arxiv.org/abs/2305.04159,,https://huggingface.co/papers/2305.04159,,,,2305.04159,6,0
+Expected Gradients of Maxout Networks and Consequences to Parameter Initialization,"Hanna Tseran, Guido Montufar",http://arxiv.org/abs/2301.06956,,https://huggingface.co/papers/2301.06956,,,,2301.06956,2,1
 Competing for Shareable Arms in Multi-Player Multi-Armed Bandits,"Renzhe Xu, Haotian Wang, Xingxuan Zhang, Bo Li, Peng Cui",http://arxiv.org/abs/2305.19158,,https://huggingface.co/papers/2305.19158,,,,2305.19158,5,1
 Intrinsic Sliced Wasserstein Distances for Comparing Collections of Probability Distributions on Manifolds and Graphs,"Raif Rustamov, Subhabrata Majumdar",http://arxiv.org/abs/2010.15285,,https://huggingface.co/papers/2010.15285,,,,2010.15285,2,1
 Coordinated Dynamic Bidding in Repeated Second-Price Auctions with Budgets,"Yurong Chen, Zhaohua Chen, Xiaotie Deng, Zhijian Duan, Haoran Sun, Qian Wang, Xiang Yan",http://arxiv.org/abs/2306.07709,,https://huggingface.co/papers/2306.07709,,,,2306.07709,7,0
 One-Step Estimator for Permuted Sparse Recovery,"Hang Zhang, Ping Li",,,,,,,,,
 Cold Analysis of Rao-Blackwellized Straight-Through Gumbel-Softmax Gradient Estimator,Alexander Shekhovtsov,,,,,,,,,
 Estimating the Contamination Factor's Distribution in Unsupervised Anomaly Detection,"Lorenzo Perini, Paul Buerkner, Arto Klami",http://arxiv.org/abs/2210.10487,,https://huggingface.co/papers/2210.10487,,,,2210.10487,3,0
+Image generation with shortest path diffusion,"Ayan Das, Ayan Das, Stathi Fotiadis, Anil Batra, Farhang Nabiei, FengTing Liao, Sattar Vakili, Da-shan Shiu, Alberto Bernacchia",http://arxiv.org/abs/2306.00501,,https://huggingface.co/papers/2306.00501,,,,2306.00501,8,2
 Deep Anomaly Detection under Labeling Budget Constraints,"Aodong Li, Chen Qiu, Padhraic Smyth, Marius Kloft, Stephan Mandt, Maja Rudolph",http://arxiv.org/abs/2302.07832,,https://huggingface.co/papers/2302.07832,,,,2302.07832,6,0
 Transformed Distribution Matching for Missing Value Imputation,"He Zhao, Ke Sun, Amir Dezfouli, Edwin V Bonilla",http://arxiv.org/abs/2302.10363,,https://huggingface.co/papers/2302.10363,,,,2302.10363,4,0
 Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?,"Ruisi Cai, Zhenyu Zhang, Zhangyang “Atlas” Wang",http://arxiv.org/abs/2302.12480,,https://huggingface.co/papers/2302.12480,,,,2302.12480,3,0
 Git-Theta: A Git Extension for Collaborative Development of Machine Learning Models,"Nikhil Kandpal, Brian Lester, Mohammed Muqeeth, Anisha Mascarenhas, Monty Evans, Vishal Baskaran, Tenghao Huang, Haokun Liu, Colin Raffel",,,,,,,,,
+Better Diffusion Models Further Improve Adversarial Training,"Zekai Wang, Tianyu Pang, Chao Du, Min Lin, Weiwei Liu, Shuicheng YAN",http://arxiv.org/abs/2302.04638,https://github.com/wzekai99/DM-Improves-AT,https://huggingface.co/papers/2302.04638,,,,2302.04638,6,1
 On the Expressive Power of Geometric Graph Neural Networks,"Chaitanya Joshi, Cristian Bodnar, Simon Mathis, Taco Cohen, Pietro Lió",http://arxiv.org/abs/2301.09308,https://github.com/chaitjo/geometric-gnn-dojo,https://huggingface.co/papers/2301.09308,,,,2301.09308,5,0
 Randomized Schur Complement Views for Graph Contrastive Learning,Vignesh Kothapalli,http://arxiv.org/abs/2306.04004,,https://huggingface.co/papers/2306.04004,,,,2306.04004,1,1
 Path Neural Networks: Expressive and Accurate Graph Neural Networks,"Gaspard Michel, Giannis Nikolentzos, Johannes Lutzeyer, Michalis Vazirgiannis",http://arxiv.org/abs/2306.05955,,https://huggingface.co/papers/2306.05955,,,,2306.05955,4,0
 AbODE: Ab initio antibody design using conjoined ODEs,"Yogesh Verma, Markus Heinonen, Vikas K Garg",http://arxiv.org/abs/2306.01005,,https://huggingface.co/papers/2306.01005,,,,2306.01005,3,0
 Learning-augmented private algorithms for multiple quantile release,"Mikhail Khodak, Kareem Amin, Travis Dick, Sergei Vassilvitskii",http://arxiv.org/abs/2210.11222,,https://huggingface.co/papers/2210.11222,,,,2210.11222,4,0
 Horizon-free Learning for Markov Decision Processes and Games: Stochastically Bounded Rewards and Improved Bounds,"Shengshi Li, Lin Yang",,,,,,,,,
+Variational Autoencoding Neural Operators,"Jacob H. Seidman, Georgios Kissas, George J. Pappas, Paris Perdikaris",http://arxiv.org/abs/2302.10351,,https://huggingface.co/papers/2302.10351,,,,2302.10351,4,1
 Efficient Parametric Approximations of Neural Network Function Space Distance,"Nikita Dhawan, Sicong Huang, Juhan Bae, Roger Grosse",http://arxiv.org/abs/2302.03519,,https://huggingface.co/papers/2302.03519,,,,2302.03519,4,0
 Theory on Forgetting and Generalization of Continual Learning,"Sen Lin, Peizhong Ju, Yingbin LIANG, Ness Shroff",http://arxiv.org/abs/2302.05836,,https://huggingface.co/papers/2302.05836,,,,2302.05836,4,0
 Trapdoor Normalization with Irreversible Ownership Verification,"Hanwen Liu, Zhenyu Weng, Yuesheng Zhu, Yadong Mu",,,,,,,,,