hysts HF Staff commited on
Commit
e92c33d
·
1 Parent(s): 88971bc

Upload papers.csv with huggingface_hub

Browse files
Files changed (1) hide show
  1. papers.csv +34 -34
papers.csv CHANGED
@@ -172,7 +172,7 @@ Adaptive Identification of Populations with Treatment Benefit in Clinical Trials
172
  Graph Ladling: Shockingly Simple Parallel GNN Training without Intermediate Communication,"Ajay Jaiswal, Shiwei Liu, Tianlong Chen, Ding, Zhangyang “Atlas” Wang",,,,,,,,,
173
  A Critical Revisit of Adversarial Robustness in 3D Point Cloud Recognition with Diffusion-Driven Purification,"Jiachen Sun, Jiongxiao Wang, Weili Nie, Zhiding Yu, Zhuoqing Morley Mao, Chaowei Xiao",,,,,,,,,
174
  COLA: Orchestrating Error Coding and Learning for Robust Neural Network Inference Against Hardware Defects,"Anlan Yu, Ning Lyu, Jieming Yin, Zhiyuan Yan, Wujie Wen",,,,,,,,,
175
- A Closer Look at Self-Supervised Lightweight Vision Transformers,"Shaoru Wang, Jin Gao, Zeming Li, Xiaoqin Zhang, Weiming Hu",http://arxiv.org/abs/2205.14443,https://github.com/wangsr126/mae-lite,https://huggingface.co/papers/2205.14443,,,,2205.14443,5,0
176
  Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space,"Anas Barakat, Ilyas Fatkhullin, Niao He",http://arxiv.org/abs/2306.01854,,https://huggingface.co/papers/2306.01854,,,,2306.01854,3,0
177
  Leveraging Offline Data in Online Reinforcement Learning,"Andrew Wagenmaker, Aldo Pacchiano",http://arxiv.org/abs/2211.04974,,https://huggingface.co/papers/2211.04974,,,,2211.04974,2,0
178
  Implicit Regularization Leads to Benign Overfitting for Sparse Linear Regression,"Mo Zhou, Rong Ge",http://arxiv.org/abs/2302.00257,,https://huggingface.co/papers/2302.00257,,,,2302.00257,2,0
@@ -492,7 +492,7 @@ A Picture of the Space of Typical Learnable Tasks,"Rahul Ramesh, Jialin Mao, Ita
492
  Accounting For Informative Sampling When Learning to Forecast Treatment Outcomes Over Time,"Toon Vanderschueren, Alicia Curth, Wouter Verbeke, Mihaela van der Schaar",http://arxiv.org/abs/2306.04255,,https://huggingface.co/papers/2306.04255,,,,2306.04255,4,0
493
  AudioLDM: Text-to-Audio Generation with Latent Diffusion Models,"Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo Mandic, Wenwu Wang, Mark D Plumbley",http://arxiv.org/abs/2301.12503,,https://huggingface.co/papers/2301.12503,,,,2301.12503,8,1
494
  Revisiting Over-smoothing and Over-squashing Using Ollivier-Ricci Curvature,"Khang Nguyen, Nong Hieu, Vinh NGUYEN, Nhat Ho, Stanley Osher, TAN NGUYEN",http://arxiv.org/abs/2211.15779,,https://huggingface.co/papers/2211.15779,,,,2211.15779,6,1
495
- Lifelong Language Pretraining with Distribution-Specialized Experts,"Wuyang Chen, Yanqi Zhou, Nan Du, Yanping Huang, James Laudon, Zhifeng Chen, Claire Cui",http://arxiv.org/abs/2305.12281,,https://huggingface.co/papers/2305.12281,,,,2305.12281,7,0
496
  Delay-agnostic Asynchronous Coordinate Update Algorithm,"Xuyang Wu, Changxin Liu, Sindri Magnússon, Mikael Johansson",http://arxiv.org/abs/2305.08535,,https://huggingface.co/papers/2305.08535,,,,2305.08535,4,1
497
  Prototype-oriented unsupervised anomaly detection for multivariate time series,"yuxin li, Wenchao Chen, Bo Chen, Dongsheng Wang, Long Tian, Mingyuan Zhou",,,,,,,,,
498
  ClimaX: A foundation model for weather and climate,"Tung Nguyen, Johannes Brandstetter, Ashish Kapoor, Jayesh K. Gupta, Aditya Grover",http://arxiv.org/abs/2301.10343,,https://huggingface.co/papers/2301.10343,,,,2301.10343,5,1
@@ -597,7 +597,7 @@ Flash: Concept Drift Adaptation in Federated Learning,"Kunjal Panchal, Sunav Cho
597
  Conformal Prediction Sets for Graph Neural Networks,"Soroush H. Zargarbashi, Simone Antonelli, Aleksandar Bojchevski",,,,,,,,,
598
  Probabilistic Attention-to-Influence Neural Models for Event Sequences,"Xiao Shou, DEBARUN BHATTACHARJYA, Tian Gao, Dharmashankar Subramanian, Oktie Hassanzadeh, Kristin Bennett",,,,,,,,,
599
  Nearly-tight Bounds for Deep Kernel Learning,"Yi-Fan Zhang, Min-Ling Zhang",,,,,,,,,
600
- Generalized Disparate Impact for Configurable Fairness Solutions in ML,"Luca Giuliani, Eleonora Misino, Michele Lombardi",http://arxiv.org/abs/2305.18504,,https://huggingface.co/papers/2305.18504,,,,2305.18504,3,0
601
  Thompson Sampling with Less Exploration is Fast and Optimal,"Tianyuan Jin, XIANGLIN YANG, Xiaokui Xiao, Pan Xu",,,,,,,,,
602
  Do Machine Learning Models Learn Statistical Rules Inferred from Data?,"Aaditya Naik, Yinjun Wu, Mayur Naik, Eric Wong",http://arxiv.org/abs/2303.01433,https://github.com/DebugML/sqrl,https://huggingface.co/papers/2303.01433,,,,2303.01433,4,1
603
  Deep Perturbation Learning: Enhancing the Network Performance via Image Perturbations,"Zifan Song, Xiao Gong, Guosheng Hu, Cairong Zhao",,,,,,,,,
@@ -608,7 +608,7 @@ In Search for a Generalizable Method for Source Free Domain Adaptation,"Malik Bo
608
  GeCoNeRF: Few-shot Neural Radiance Fields via Geometric Consistency,"Min-Seop Kwak, Jiuhn Song, Seungryong Kim",http://arxiv.org/abs/2301.10941,,https://huggingface.co/papers/2301.10941,,,,2301.10941,3,1
609
  Input uncertainty propagation through trained neural networks,"Paul Monchot, Loic Coquelin, Sébastien J. Petit, Sébastien Marmin, Erwann LE PENNEC, Nicolas Fischer",,,,,,,,,
610
  Optimally-weighted Estimators of the Maximum Mean Discrepancy for Likelihood-Free Inference,"Ayush Bharti, Masha Naslidnyk, Oscar Key, Samuel Kaski, Francois-Xavier Briol",http://arxiv.org/abs/2301.11674,,https://huggingface.co/papers/2301.11674,,,,2301.11674,5,0
611
- SGD with large step sizes learns sparse features,"Maksym Andriushchenko, Aditya Vardhan Varre, Loucas Pillaud-Vivien, Nicolas Flammarion",http://arxiv.org/abs/2210.05337,https://github.com/tml-epfl/sgd-sparse-features,https://huggingface.co/papers/2210.05337,,,,2210.05337,4,0
612
  Kernel Logistic Regression Approximation of an Understandable ReLU Neural Network,"Marie Guyomard, Susana Barbosa, Lionel Fillatre",,,,,,,,,
613
  Cramming: Training a Language Model on a single GPU in one day.,"Jonas Geiping, Tom Goldstein",https://arxiv.org/abs//2212.14034,https://github.com/JonasGeiping/cramming,https://huggingface.co/papers/2212.14034,,https://huggingface.co/JonasGeiping/crammed-bert,https://huggingface.co/datasets/JonasGeiping/the_pile_WordPiecex32768_2efdb9d060d1ae95faf952ec1a50f020,2212.14034,2,1
614
  A Simple Zero-shot Prompt Weighting Technique to Improve Prompt Ensembling in Text-Image Models,"James Allingham, JIE REN, Michael Dusenberry, Jeremiah Liu, Xiuye Gu, Yin Cui, Dustin Tran, Balaji Lakshminarayanan",http://arxiv.org/abs/2302.06235,,https://huggingface.co/papers/2302.06235,,,,2302.06235,8,0
@@ -730,7 +730,7 @@ Dirichlet Diffusion Score Model for Biological Sequence Generation,"Pavel Avdeye
730
  Leveraging Proxy of Training Data for Test-Time Adaptation,"Juwon Kang, Nayeong Kim, Donghyeon Kwon, Jungseul Ok, Suha Kwak",,,,,,,,,
731
  Near-Optimal Algorithms for Private Online Optimization in the Realizable Regime,"Hilal Asi, Vitaly Feldman, Tomer Koren, Kunal Talwar",http://arxiv.org/abs/2302.14154,,https://huggingface.co/papers/2302.14154,,,,2302.14154,4,0
732
  Double-Weighting for Covariate Shift Adaptation,"José I. Segovia-Martín, Santiago Mazuelas, Anqi Liu",http://arxiv.org/abs/2305.08637,,https://huggingface.co/papers/2305.08637,,,,2305.08637,3,0
733
- Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise Constraints,"Donghao Li, Ruiquan Huang, Cong Shen, Jing Yang",http://arxiv.org/abs/2306.06265,,https://huggingface.co/papers/2306.06265,,,,2306.06265,4,0
734
  PASTA: Pessimistic Assortment Optimization,"Juncheng Dong, Weibin Mo, Zhengling Qi, Cong Shi, Ethan Fang, Vahid Tarokh",http://arxiv.org/abs/2302.03821,,https://huggingface.co/papers/2302.03821,,,,2302.03821,6,0
735
  Coarse-to-Fine: a Hierarchical Diffusion Model for Molecule Generation in 3D,"Bo Qiang, Yuxuan Song, Minkai Xu, Jingjing Gong, Bowen Gao, Hao Zhou, Wei-Ying Ma, Yanyan Lan",,,,,,,,,
736
  Off-Policy Average Reward Actor-Critic with Deterministic Policy Search,"Naman Saxena, Subhojyoti Khastagir, Shishir Nadubettu Yadukumar, Shalabh Bhatnagar",http://arxiv.org/abs/2305.12239,,https://huggingface.co/papers/2305.12239,,,,2305.12239,4,0
@@ -833,7 +833,7 @@ High Probability Convergence of Stochastic Gradient Methods ,"Zijian Liu, Ta Duy
833
  Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model,"Siyu Chen, Jibang Wu, Yifan Wu, Zhuoran Yang",http://arxiv.org/abs/2303.08613,,https://huggingface.co/papers/2303.08613,,,,2303.08613,4,1
834
  SLAMB: Accelerated Large Batch Training with Sparse Communication,"Hang Xu, Wenxuan Zhang, Jiawei Fei, Yuzhe Wu, TingWen Xie, Jun Huang, Yuchen Xie, Mohamed Elhoseiny, Panos Kalnis",,,,,,,,,
835
  Efficient Quantum Algorithms for Quantum Optimal Control,"Xiantao Li, Chunhao Wang",http://arxiv.org/abs/2304.02613,,https://huggingface.co/papers/2304.02613,,,,2304.02613,2,0
836
- Improved Policy Evaluation for Randomized Trials of Algorithmic Resource Allocation,"Aditya Mate, Bryan Wilder, Aparna Taneja, Milind Tambe",http://arxiv.org/abs/2302.02570,,https://huggingface.co/papers/2302.02570,,,,2302.02570,4,0
837
  Variational Sparse Inverse Cholesky Approximation for Latent Gaussian Processes via Double Kullback-Leibler Minimization,"Jian Cao, Myeongjong Kang, Felix Jimenez, Huiyan Sang, Florian Schaefer, Matthias Katzfuss",http://arxiv.org/abs/2301.13303,,https://huggingface.co/papers/2301.13303,,,,2301.13303,6,0
838
  Efficient exploration via epistemic-risk-seeking policy gradients,Brendan O'Donoghue,,,,,,,,,
839
  Probing the Deep Neural Manifold of Reinforcement Learning to Expose Volatility,"Ezgi Korkmaz, Jonah Brown-Cohen",,,,,,,,,
@@ -871,7 +871,7 @@ Characterizing Multicalibration via Property Elicitation,"Georgy Noarov, Aaron R
871
  Cut your Losses with Squentropy,"Like Hui, Misha Belkin, Stephen Wright",http://arxiv.org/abs/2302.03952,,https://huggingface.co/papers/2302.03952,,,,2302.03952,3,0
872
  Multi-Agent Learning from Learners,"MINE M CALISKAN, Francesco Chini, Setareh Maghsudi",,,,,,,,,
873
  Oracles and Followers: Stackelberg Equilibria in Deep Multi-Agent Reinforcement Learning,"Matthias Gerstgrasser, David Parkes",,,,,,,,,
874
- Robust Counterfactual Explanations for Neural Networks With Probabilistic Guarantees,"Faisal Hamman, Erfaun Noorani, Saumitra Mishra, Daniele Magazzeni, Sanghamitra Dutta",http://arxiv.org/abs/2305.11997,,https://huggingface.co/papers/2305.11997,,,,2305.11997,5,0
875
  Theoretical Behavior of XAI Methods in the Presence of Suppressor Variables,"Rick Wilming, Leo Kieslich, Benedict Clark, Stefan Haufe",http://arxiv.org/abs/2306.01464,,https://huggingface.co/papers/2306.01464,,,,2306.01464,4,1
876
  When do Minimax-fair Learning and Empirical Risk Minimization Coincide?,"Harvineet Singh, Matthäus Kleindessner, Volkan Cevher, Rumi Chunara, Chris Russell",,,,,,,,,
877
  Semi-Autoregressive Energy Flows: Towards Determinant-Free Training of Normalizing Flows ,"Phillip Si, Zeyi Chen, Subham S Sahoo, Yair Schiff, Volodymyr Kuleshov",,,,,,,,,
@@ -977,7 +977,7 @@ On the Statistical Benefits of Temporal Difference Learning,"David Cheikhi, Dani
977
  Bayes-optimal Learning of Deep Random Networks of Extensive-width,"Hugo Cui, FLORENT KRZAKALA, Lenka Zdeborova",,,,,,,,,
978
  Adapting to game trees in zero-sum imperfect information games,"Côme Fiegel, Pierre Menard, Tadashi Kozuno, Remi Munos, Vianney Perchet, Michal Valko",http://arxiv.org/abs/2212.12567,,https://huggingface.co/papers/2212.12567,,,,2212.12567,6,0
979
  Adversarial Policies Beat Superhuman Go AIs,"Tony Wang, Adam Gleave, Tom Tseng, Nora Belrose, Kellin Pelrine, Joseph Miller, Michael Dennis, Yawen Duan, Viktor Pogrebniak, Sergey Levine, Stuart Russell",http://arxiv.org/abs/2211.00241,,https://huggingface.co/papers/2211.00241,,,,2211.00241,11,1
980
- Pretraining Language Models with Human Preferences,"Tomasz Korbak, Kejian Shi, Angelica Chen, Rasika Bhalerao, Christopher Buckley, Jason Phang, Samuel Bowman, Ethan Perez",http://arxiv.org/abs/2302.08582,,https://huggingface.co/papers/2302.08582,,,,2302.08582,8,1
981
  Adversarial Example Does Good: Preventing Painting Imitation from Diffusion Models via Adversarial Examples,"Chumeng Liang, Xiaoyu Wu, Yang Hua, Jiaru Zhang, Yiming Xue, Tao Song, Zhengui XUE, Ruhui Ma, Haibing Guan",http://arxiv.org/abs/2302.04578,https://github.com/mist-project/mist.git,https://huggingface.co/papers/2302.04578,,,,2302.04578,9,0
982
  A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs,"Mikael Henaff, Minqi Jiang, Roberta Raileanu",http://arxiv.org/abs/2306.03236,,https://huggingface.co/papers/2306.03236,,,,2306.03236,3,0
983
  Using Large Language Models to Simulate Multiple Humans and Replicate Human Subject Studies,"Gati Aher, Rosa I. Arriaga, Adam Tauman Kalai",http://arxiv.org/abs/2208.10264,,https://huggingface.co/papers/2208.10264,,,,2208.10264,3,0
@@ -989,7 +989,7 @@ Buying Information for Stochastic Optimization,"Mingchen Ma, Christos Tzamos",ht
989
  Towards Theoretical Understanding of Inverse Reinforcement Learning,"Alberto Maria Metelli, Filippo Lazzati, Marcello Restelli",http://arxiv.org/abs/2304.12966,,https://huggingface.co/papers/2304.12966,,,,2304.12966,3,0
990
  Tighter Lower Bounds for Shuffling SGD: Random Permutations and Beyond,"Jaeyoung Cha, Jaewook Lee, Chulhee Yun",http://arxiv.org/abs/2303.07160,,https://huggingface.co/papers/2303.07160,,,,2303.07160,3,0
991
  Delayed Feedback in Kernel Bandits,"Sattar Vakili, Danyal Ahmed, Alberto Bernacchia, Ciara Pike-Burke",http://arxiv.org/abs/2302.00392,,https://huggingface.co/papers/2302.00392,,,,2302.00392,4,0
992
- Sharper Bounds for $\ell_p$ Sensitivity Sampling,"David Woodruff, Taisuke Yasuda",http://arxiv.org/abs/2306.00732,,https://huggingface.co/papers/2306.00732,,,,2306.00732,2,0
993
  Hyena Hierarchy: Towards Larger Convolutional Language Models,"Michael Poli, Stefano Massaroli, Eric Nguyen, Daniel Y Fu, Tri Dao, Stephen Baccus, Yoshua Bengio, Stefano Ermon, Christopher Re",http://arxiv.org/abs/2302.10866,,https://huggingface.co/papers/2302.10866,,,,2302.10866,9,0
994
  Delving into Noisy Label Detection with Clean Data,"Chenglin Yu, Xinsong Ma, Weiwei Liu",,,,,,,,,
995
  GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration,"Naoki Murata, Koichi Saito, Chieh-Hsin Lai, Yuhta Takida, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon",http://arxiv.org/abs/2301.12686,,https://huggingface.co/papers/2301.12686,,,,2301.12686,7,1
@@ -1056,7 +1056,7 @@ POUF: Prompt-Oriented Unsupervised Fine-tuning for Large Pre-trained Models,"Kor
1056
  Towards Omni-generalizable Neural Methods for Vehicle Routing Problems,"Jianan Zhou, Yaoxin Wu, Wen Song, Zhiguang Cao, Jie Zhang",http://arxiv.org/abs/2305.19587,https://github.com/RoyalSkye/Omni-VRP,https://huggingface.co/papers/2305.19587,,,,2305.19587,5,0
1057
  Protecting Language Generation Models via Invisible Watermarking,"Xuandong Zhao, Yu-Xiang Wang, Lei Li",http://arxiv.org/abs/2302.03162,,https://huggingface.co/papers/2302.03162,,,,2302.03162,3,0
1058
  Global Optimization with Parametric Function Approximation,"Chong Liu, Yu-Xiang Wang",http://arxiv.org/abs/2211.09100,,https://huggingface.co/papers/2211.09100,,,,2211.09100,2,0
1059
- Non-stationary Reinforcement Learning under General Function Approximation,"Songtao Feng, Ming Yin, Ruiquan Huang, Yu-Xiang Wang, Jing Yang, Yingbin LIANG",http://arxiv.org/abs/2306.00861,,https://huggingface.co/papers/2306.00861,,,,2306.00861,6,0
1060
  Demystifying Disagreement-on-the-Line in High Dimensions,"Donghwan Lee, Behrad Moniri, Xinmeng Huang, Edgar Dobriban, Hamed Hassani",http://arxiv.org/abs/2301.13371,,https://huggingface.co/papers/2301.13371,,,,2301.13371,5,0
1061
  Multisample Flow Matching: Straightening Flows with Minibatch Couplings,"Aram-Alexandre Pooladian, Heli Ben-Hamu, Carles Domingo i Enrich, Brandon Amos, Yaron Lipman, Ricky T. Q. Chen",http://arxiv.org/abs/2304.14772,,https://huggingface.co/papers/2304.14772,,,,2304.14772,6,1
1062
  Competitive Gradient Optimization,"Abhijeet Vyas, Brian Bullins, Kamyar Azizzadenesheli",http://arxiv.org/abs/2205.14232,,https://huggingface.co/papers/2205.14232,,,,2205.14232,2,0
@@ -1096,7 +1096,7 @@ Identifying Interpretable Subspaces in Image Representations,"Neha Mukund Kalibh
1096
  LegendreTron: Uprising Proper Multiclass Loss Learning,"Kevin H. Lam, Christian Walder, Spiridon Penev, Richard Nock",http://arxiv.org/abs/2301.11695,,https://huggingface.co/papers/2301.11695,,,,2301.11695,4,0
1097
  R-U-SURE? Uncertainty-Aware Code Suggestions By Maximizing Utility Across Random User Intents,"Daniel D. Johnson, Daniel Tarlow, Christian Walder",,,,,,,,,
1098
  High-dimensional Location Estimation via Norm Concentration for Subgamma Vectors,"Shivam Gupta, Jasper Lee, Eric Price",http://arxiv.org/abs/2302.02497,,https://huggingface.co/papers/2302.02497,,,,2302.02497,3,0
1099
- COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models,"Jinqi Xiao, Miao Yin, Yu Gong, Xiao Zang, Jian Ren, Bo Yuan",http://arxiv.org/abs/2305.17235,,https://huggingface.co/papers/2305.17235,,,,2305.17235,6,0
1100
  Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling,"Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'Brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, USVSN Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar van der Wal",http://arxiv.org/abs/2304.01373,https://github.com/EleutherAI/pythia,https://huggingface.co/papers/2304.01373,,,,2304.01373,13,3
1101
  HyperTuning: Toward Adapting Large Language Models without Back-propagation,"Jason Phang, Yi Mao, Pengcheng He, Weizhu Chen",,,,,,,,,
1102
  Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models,"Zhihong Shao, Yeyun Gong, Yelong Shen, Minlie Huang, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2302.00618,,https://huggingface.co/papers/2302.00618,,,,2302.00618,6,0
@@ -1115,7 +1115,7 @@ Bootstrapped Representations in Reinforcement Learning,"Charline Le Lan, Stephen
1115
  Quantile Credit Assignment,"Thomas Mesnard, Wenqi Chen, Alaa Saade, Yunhao Tang, Mark Rowland, Theophane Weber, Clare Lyle, Audrunas Gruslys, Michal Valko, Will Dabney, Georg Ostrovski, Eric Moulines, Remi Munos",,,,,,,,,
1116
  Understanding Self-Predictive Learning for Reinforcement Learning,"Yunhao Tang, Zhaohan Guo, Pierre Richemond, Bernardo Avila Pires, Yash Chandak, Remi Munos, Mark Rowland, Mohammad Gheshlaghi Azar, Charline Le Lan, Clare Lyle, Andras Gyorgy, Shantanu Thakoor, Will Dabney, Bilal Piot, Daniele Calandriello, Michal Valko",http://arxiv.org/abs/2212.03319,,https://huggingface.co/papers/2212.03319,,,,2212.03319,16,0
1117
  Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning,"Brett Daley, Martha White, Christopher Amato, Marlos C. Machado",http://arxiv.org/abs/2301.11321,,https://huggingface.co/papers/2301.11321,,,,2301.11321,4,0
1118
- "For Pre-Trained Vision Models in Motor Control, Not All Policy Learning Methods are Created Equal","Yingdong Hu, Renhao Wang, Li Li, Yang Gao",http://arxiv.org/abs/2304.04591,,https://huggingface.co/papers/2304.04591,,,,2304.04591,4,0
1119
  Weakly Supervised Regression with Interval Targets,"Xin Cheng, Yuzhou Cao, Ximing Li, Bo An, LEI FENG",,,,,,,,,
1120
  Regret Bounds for Markov Decision Processes with Recursive Optimized Certainty Equivalents,"WENHAO XU, Xuefeng Gao, Xuedong He",http://arxiv.org/abs/2301.12601,,https://huggingface.co/papers/2301.12601,,,,2301.12601,3,0
1121
  Decentralized Stochastic Bilevel Optimization with Improved per-Iteration Complexity,"Xuxing Chen, Minhui Huang, Shiqian Ma, Krishna Balasubramanian",http://arxiv.org/abs/2210.12839,,https://huggingface.co/papers/2210.12839,,,,2210.12839,4,0
@@ -1140,7 +1140,7 @@ Phase Transitions in the Detection of Correlated Databases,"Dor Elimelech, Wasim
1140
  New metrics and search algorithms for weighted causal DAGs,"Davin Choo, Kirankumar Shiragur",http://arxiv.org/abs/2305.04445,,https://huggingface.co/papers/2305.04445,,,,2305.04445,2,0
1141
  CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms,"Shengyi Huang, Rousslan Fernand Julien Dossa, Chang Ye, Jeff Braga, Dipam Chakraborty, Kinal Mehta, João Madeira Araujo",http://arxiv.org/abs/2111.08819,https://github.com/vwxyzjn/cleanrl,https://huggingface.co/papers/2111.08819,,,,2111.08819,4,1
1142
  Simplifying Momentum-based Positive-definite Submanifold Optimization with Applications to Deep Learning,"Wu Lin, Valentin Duruisseaux, Melvin Leok, Frank Nielsen, Khan Emtiyaz, Mark Schmidt",http://arxiv.org/abs/2302.09738,https://github.com/yorkerlin/StructuredNGD-DL,https://huggingface.co/papers/2302.09738,,,,2302.09738,6,0
1143
- Polarity Is All You Need to Learn and Transfer Faster,"Alice (Qingyang) Wang, Michael Powell, Eric Bridgeford, Ali Geisa, Joshua Vogelstein",http://arxiv.org/abs/2303.17589,,https://huggingface.co/papers/2303.17589,,,,2303.17589,5,0
1144
  Scaling Vision Transformers to 22 Billion Parameters,"Mostafa Dehghani, Josip Djolonga, Basil Mustafa, Piotr Padlewski, Jonathan Heek, Justin Gilmer, Andreas Steiner, Mathilde Caron, Robert Geirhos, Ibrahim Alabdulmohsin, Rodolphe Jenatton, Lucas Beyer, Michael Tschannen, Anurag Arnab, Xiao Wang, Carlos Riquelme, Matthias Minderer, Joan Puigcerver, Utku Evci, Manoj Kumar, Sjoerd van Steenkiste, Gamaleldin Elsayed, Aravindh Mahendran, Fisher Yu, Avital Oliver, Fantine Huot, Jasmijn Bastings, Mark Collier, Alexey Gritsenko, Vighnesh N Birodkar, Cristina Vasconcelos, Yi Tay, Thomas Mensink, Alexander Kolesnikov, Filip Pavetic, Dustin Tran, Thomas Kipf, Mario Lucic, Xiaohua Zhai, Daniel Keysers, Jeremiah Harmsen, Neil Houlsby",http://arxiv.org/abs/2302.05442,,https://huggingface.co/papers/2302.05442,,,,2302.05442,22,0
1145
  Toward Fair and Robust Estimation of Optimal Treatment Regimes,"Kwangho Kim, Jose Zubizarreta",,,,,,,,,
1146
  Internally Rewarded Reinforcement Learning,"Mengdi Li, Xufeng Zhao, Jae Hee Lee, Cornelius Weber, Stefan Wermter",http://arxiv.org/abs/2302.00270,,https://huggingface.co/papers/2302.00270,,,,2302.00270,5,0
@@ -1154,7 +1154,7 @@ Efficient Latency-Aware CNN Depth Compression via Two-Stage Dynamic Programming,
1154
  Critical Points and Convergence Analysis of Generative Deep Linear Networks Trained with Bures-Wasserstein Loss,"Pierre Bréchet, Katerina Papagiannouli, Jing An, Guido Montufar",http://arxiv.org/abs/2303.03027,,https://huggingface.co/papers/2303.03027,,,,2303.03027,4,0
1155
  Policy Evaluation and Temporal-Difference Learning in Continuous Time and Space: A Martingale Approach,"Yanwei Jia, Xun Yu Zhou",http://arxiv.org/abs/2108.06655,,https://huggingface.co/papers/2108.06655,,,,2108.06655,2,0
1156
  VIMA: Robot Manipulation with Multimodal Prompts,"Yunfan Jiang, Agrim Gupta, Zichen Zhang, Guanzhi Wang, Yongqiang Dou, Yanjun Chen, Li Fei-Fei, Anima Anandkumar, Yuke Zhu, Jim Fan",,,,,,,,,
1157
- StriderNet: A Graph Reinforcement Learning Approach to Optimize Atomic Structures on Rough Energy Landscapes,"Vaibhav Bihani, Sahil Manchanda, Srikanth Sastry, Sayan Ranu, N M Anoop Krishnan",http://arxiv.org/abs/2301.12477,,https://huggingface.co/papers/2301.12477,,,,2301.12477,5,0
1158
  Multi-agent Online Scheduling: MMS Allocations for Indivisible Items,"Shengwei Zhou, Rufan Bai, Xiaowei Wu",http://arxiv.org/abs/2304.13405,,https://huggingface.co/papers/2304.13405,,,,2304.13405,3,0
1159
  Multi-Symmetry Ensembles: Improving Diversity and Generalization via Opposing Symmetries,"Charlotte Loh, Seungwook Han, Shivchander Sudalairaj, Rumen Dangovski, Kai Xu, Florian Wenzel, Marin Solja\v{c}i\'{c}, Akash Srivastava",http://arxiv.org/abs/2303.02484,,https://huggingface.co/papers/2303.02484,,,,2303.02484,8,0
1160
  NP-SemiSeg: When Neural Processes meet Semi-Supervised Semantic Segmentation,"Jianfeng Wang, Daniela Massiceti, Xiaolin Hu, Vladimir Pavlovic, Thomas Lukasiewicz",,,,,,,,,
@@ -1178,13 +1178,13 @@ Instrumental Variable Estimation of Average Partial Causal Effects,"Yuta Kawakam
1178
  Improving Adversarial Robustness Through the Contrastive-Guided Diffusion Process,"Yidong Ouyang, Liyan Xie, Guang Cheng",,,,,,,,,
1179
  MetaModulation: Learning Variational Feature Hierarchies for Few-Shot Learning with Fewer Tasks,"Wenfang Sun, Yingjun Du, Xiantong Zhen, Fan Wang, Ling Wang, Cees Snoek",http://arxiv.org/abs/2305.10309,,https://huggingface.co/papers/2305.10309,,,,2305.10309,6,0
1180
  Provable Dynamic Fusion for Low-Quality Multimodal Data,"qingyang zhang, Haitao Wu, Changqing Zhang, Qinghua Hu, Huazhu Fu, Joey Tianyi Zhou, Xi Peng",http://arxiv.org/abs/2306.02050,,https://huggingface.co/papers/2306.02050,,,,2306.02050,7,0
1181
- Beyond Homophily: Reconstructing Structure for Graph-agnostic Clustering,"Erlin Pan, zhao kang",http://arxiv.org/abs/2305.02931,,https://huggingface.co/papers/2305.02931,,,,2305.02931,2,0
1182
  SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation,"Huaishao Luo, Junwei Bao, Youzheng Wu, Xiaodong He, Tianrui Li",http://arxiv.org/abs/2211.14813,https://github.com/ArrowLuo/SegCLIP,https://huggingface.co/papers/2211.14813,,,,2211.14813,5,0
1183
  Explainability as statistical inference,"Hugo Senetaire, Damien Garreau, Jes Frellsen, Pierre-Alexandre Mattei",http://arxiv.org/abs/2212.03131,,https://huggingface.co/papers/2212.03131,,,,2212.03131,4,0
1184
  Learning Prescriptive ReLU Networks,"Wei Sun, Asterios Tsiourvas",http://arxiv.org/abs/2306.00651,,https://huggingface.co/papers/2306.00651,,,,2306.00651,2,0
1185
  Bidirectional Adaptation for Robust Semi-Supervised Learning with Inconsistent Data Distributions,"Lin-Han Jia, Lan-Zhe Guo, Zhi Zhou, Jie-Jing Shao, Yuke Xiang, Yu-Feng Li",,,,,,,,,
1186
  Beyond the Universal Law of Robustness: Sharper Laws for Random Features and Neural Tangent Kernels,"Simone Bombari, Shayan Kiyani, Marco Mondelli",http://arxiv.org/abs/2302.01629,,https://huggingface.co/papers/2302.01629,,,,2302.01629,3,0
1187
- Human-Timescale Adaptation in an Open-Ended Task Space,"Jakob Bauer, Kate Baumli, Feryal Behbahani, Avishkar Bhoopchand, Natalie Bradley-Schmieg, Michael Chang, Natalie Clay, Adrian Collister, Vibhavari Dasagi, Lucy Gonzalez, Karol Gregor, Edward Hughes, Sheleem Kashem, Maria Loks-Thompson, Hannah Openshaw, Jack Parker-Holder, Shreya Pathak, Nicolas Perez-Nieves, Nemanja Rakicevic, Tim Rocktäschel, Yannick Schroecker, Satinder Singh, Jakub Sygnowski, Karl Tuyls, Sarah York, Alexander Zacherl, Lei Zhang",http://arxiv.org/abs/2301.07608,,https://huggingface.co/papers/2301.07608,,,,2301.07608,22,0
1188
  Analysis of Error Feedback in Federated Non-Convex Optimization with Biased Compression: Linear Speedup and Partial Participation,"Xiaoyun Li, Ping Li",,,,,,,,,
1189
  ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts,"Minghao Xu, Xinyu Yuan, Santiago Miret, Jian Tang",http://arxiv.org/abs/2301.12040,,https://huggingface.co/papers/2301.12040,,,,2301.12040,4,0
1190
  Specializing Smaller Language Models towards Multi-Step Reasoning,"Yao Fu, Hao Peng, Litu Ou, Ashish Sabharwal, Tushar Khot",http://arxiv.org/abs/2301.12726,,https://huggingface.co/papers/2301.12726,,,,2301.12726,5,1
@@ -1192,7 +1192,7 @@ Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap,"Hang Wa
1192
  Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models,"Dongjun Kim, Yeongmin Kim, Se Jung Kwon, Wanmo Kang, IL CHUL MOON",http://arxiv.org/abs/2211.17091,https://github.com/alsdudrla10/DG,https://huggingface.co/papers/2211.17091,,,,2211.17091,5,0
1193
  Weighted flow diffusion for local graph clustering with node attributes: an algorithm and statistical guarantees,"Shenghao Yang, Kimon Fountoulakis",http://arxiv.org/abs/2301.13187,,https://huggingface.co/papers/2301.13187,,,,2301.13187,2,0
1194
  Robust Budget Pacing with a Single Sample,"Santiago Balseiro, Rachitesh Kumar, Vahab Mirrokni, Balasubramanian Sivan, Di Wang",http://arxiv.org/abs/2302.02006,,https://huggingface.co/papers/2302.02006,,,,2302.02006,5,0
1195
- Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the Machiavelli Benchmark,"Alexander Pan, Jun Shern Chan, Andy Zou, Nathaniel Li, Scott Emmons, Hanlin Zhang, Steven Basart, Thomas Woodside, Dan Hendrycks",http://arxiv.org/abs/2304.03279,,https://huggingface.co/papers/2304.03279,,,,2304.03279,10,0
1196
  Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines?,"Victor Boutin, Thomas FEL, Lakshya Singhal, Rishav Mukherji, Akash Nagaraj, Julien Colin, Thomas Serre",http://arxiv.org/abs/2301.11722,,https://huggingface.co/papers/2301.11722,,,,2301.11722,7,1
1197
  Random Classification Noise does not defeat All Convex Potential Boosters Irrespective of Model Choice,"Yishay Mansour, Richard Nock, Robert C. Williamson",,,,,,,,,
1198
  "Fundamental Limits of Two-layer Autoencoders, and Achieving Them with Gradient Methods","Aleksandr Shevchenko, Kevin Kögler, Hamed Hassani, Marco Mondelli",http://arxiv.org/abs/2212.13468,,https://huggingface.co/papers/2212.13468,,,,2212.13468,4,0
@@ -1203,12 +1203,12 @@ Generalized Teacher Forcing for Learning Chaotic Dynamics,"Florian Hess, Zahra M
1203
  HETAL: Efficient Privacy-preserving Transfer Learning with Homomorphic Encryption,"Seewoo Lee, Garam Lee, Jung Woo Kim, Junbum Shin, Mun-Kyu Lee",,,,,,,,,
1204
  Marginalization is not Marginal: No Bad VAE Local Minima when Learning Optimal Sparse Representations,David Wipf,,,,,,,,,
1205
  Direct Parameterization of Lipschitz-Bounded Deep Networks,"Ruigang Wang, Ian Manchester",http://arxiv.org/abs/2301.11526,https://github.com/acfr/LBDN,https://huggingface.co/papers/2301.11526,,,,2301.11526,2,0
1206
- XAI Beyond Classification: Interpretable Neural Clustering,"Xi Peng, Yunfan Li, Ivor W. Tsang, Hongyuan Zhu, Jiancheng Lv, Joey Tianyi Zhou",http://arxiv.org/abs/1808.07292,,https://huggingface.co/papers/1808.07292,,,,1808.07292,6,0
1207
  Exploiting locality in high-dimensional Factorial hidden Markov models,"Lorenzo Rimella, Nick Whiteley",http://arxiv.org/abs/1902.01639,,https://huggingface.co/papers/1902.01639,,,,1902.01639,2,0
1208
  Mitigating the Effects of Non-Identifiability on Inference for Bayesian Neural Networks with Latent Variables,"Yaniv Yacoby, Weiwei Pan, Finale Doshi-Velez",http://arxiv.org/abs/1911.00569,,https://huggingface.co/papers/1911.00569,,,,1911.00569,3,0
1209
  Project and Forget: Solving Large-Scale Metric Constrained Problems,"Rishi Sonthalia, Anna C. Gilbert",http://arxiv.org/abs/2005.03853,,https://huggingface.co/papers/2005.03853,,,,2005.03853,2,0
1210
  "Let's Make Block Coordinate Descent Converge Faster: Faster Greedy Rules, Message-Passing, Active-Set Complexity, and Superlinear Convergence","Julie Nutini, Issam Laradji, Mark Schmidt",http://arxiv.org/abs/1712.08859,,https://huggingface.co/papers/1712.08859,,,,1712.08859,3,0
1211
- Cluster-Specific Predictions with Multi-Task Gaussian Processes,"Arthur Leroy, Pierre Latouche, Benjamin Guedj, Servane Gey",http://arxiv.org/abs/2011.07866,,https://huggingface.co/papers/2011.07866,,,,2011.07866,4,0
1212
  Non-asymptotic Properties of Individualized Treatment Rules from Sequentially Rule-Adaptive Trials,"Daiqi Gao, Yufeng Liu, Donglin Zeng",,,,,,,,,
1213
  Mean-field Analysis of Piecewise Linear Solutions for Wide ReLU Networks,"Aleksandr Shevchenko, Vyacheslav Kungurtsev, Marco Mondelli",http://arxiv.org/abs/2111.02278,,https://huggingface.co/papers/2111.02278,,,,2111.02278,3,0
1214
  "Multi-Agent Online Optimization with Delays: Asynchronicity, Adaptivity, and Optimism","Yu-Guan Hsieh, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos",http://arxiv.org/abs/2012.11579,,https://huggingface.co/papers/2012.11579,,,,2012.11579,4,0
@@ -1219,7 +1219,7 @@ Knowledge Hypergraph Embedding Meets Relational Algebra,"Bahare Fatemi, Perouz T
1219
  Deep linear networks can benignly overfit when shallow ones do,"Niladri S. Chatterji, Phil Long",http://arxiv.org/abs/2209.09315,,https://huggingface.co/papers/2209.09315,,,,2209.09315,2,0
1220
  Taming graph kernels with random features,Krzysztof Choromanski,http://arxiv.org/abs/2305.00156,,https://huggingface.co/papers/2305.00156,,,,2305.00156,1,0
1221
  On Uni-Modal Feature Learning in Supervised Multi-Modal Learning,"Chenzhuang Du, Jiaye Teng, Tingle Li, Yichen Liu, Tianyuan Yuan, Yue Wang, Yang Yuan, Hang Zhao",http://arxiv.org/abs/2305.01233,,https://huggingface.co/papers/2305.01233,,,,2305.01233,8,0
1222
- CSP: Self-Supervised Contrastive Spatial Pre-Training for Geospatial-Visual Representations,"Gengchen Mai, Ni Lao, Yutong He, Jiaming Song, Stefano Ermon",http://arxiv.org/abs/2305.01118,,https://huggingface.co/papers/2305.01118,,,,2305.01118,5,1
1223
  CLIPood: Generalizing CLIP to Out-of-Distributions,"Yang Shu, Xingzhuo Guo, Jialong Wu, Ximei Wang, Jianmin Wang, Mingsheng Long",http://arxiv.org/abs/2302.00864,,https://huggingface.co/papers/2302.00864,,,,2302.00864,6,0
1224
  Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning,"Yu Meng, Martin Michalski, Jiaxin Huang, Yu Zhang, Tarek Abdelzaher, Jiawei Han",http://arxiv.org/abs/2211.03044,,https://huggingface.co/papers/2211.03044,,,,2211.03044,6,1
1225
  Meta-SAGE: Scale Meta-Learning Scheduled Adaptation with Guided Exploration for Mitigating Scale Shift on Combinatorial Optimization,"Jiwoo Son, Minsu Kim, Hyeonah Kim, Jinkyoo Park",,,,,,,,,
@@ -1251,7 +1251,7 @@ Test-time Adaptation with Slot-Centric Models,"Mihir Prabhudesai, Anirudh Goyal,
1251
  Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification,"Dong Xing, Pengjie Gu, Qian Zheng, Xinrun Wang, Shanqi Liu, Longtao Zheng, Bo An, Gang Pan",,,,,,,,,
1252
  Data-Efficient Contrastive Self-supervised Learning: Most Beneficial Examples for Supervised Learning Contribute the Least,"Siddharth Joshi, Baharan Mirzasoleiman",http://arxiv.org/abs/2302.09195,,https://huggingface.co/papers/2302.09195,,,,2302.09195,2,0
1253
  Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs,"Guan-Ting Liu, En-Pei Hu, Pu-Jen Cheng, Hung-yi Lee, Shao-Hua Sun",http://arxiv.org/abs/2301.12950,,https://huggingface.co/papers/2301.12950,,,,2301.12950,5,0
1254
- Cooperative Open-ended Learning Framework for Zero-Shot Coordination,"Yang Li, Shao Zhang, Jichen Sun, Yali Du, Ying Wen, Xinbing Wang, Wei Pan",http://arxiv.org/abs/2302.04831,,https://huggingface.co/papers/2302.04831,,,,2302.04831,7,0
1255
  CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental Design,"Desi Ivanova, Joel Jennings, Tom Rainforth, Cheng Zhang, Adam Foster",,,,,,,,,
1256
  On the Identifiability and Estimation of Causal Location-Scale Noise Models,"Alexander Immer, Christoph Schultheiss, Julia Vogt, Bernhard Schölkopf, Peter Bühlmann, Alexander Marx",http://arxiv.org/abs/2210.09054,,https://huggingface.co/papers/2210.09054,,,,2210.09054,6,0
1257
  From Temporal to Contemporaneous Iterative Causal Discovery in the Presence of Latent Confounders,"Raanan Yehezkel Rohekar, Shami Nisimov, Yaniv Gurwicz, Gal Novik",http://arxiv.org/abs/2306.00624,,https://huggingface.co/papers/2306.00624,,,,2306.00624,4,0
@@ -1267,8 +1267,8 @@ Non-autoregressive Conditional Diffusion Models for Time Series Prediction,"Life
1267
  Drug Discovery under Covariate Shift with Domain-Informed Prior Distributions over Functions,"Leo Klarner, Tim G. J. Rudner, Michael Reutlinger, Torsten Schindler, Garrett Morris, Charlotte Deane, Yee-Whye Teh",,,,,,,,,
1268
  SOM-CPC: Unsupervised Contrastive Learning with Self-Organizing Maps for Structured Representations of High-Rate Time Series,"Iris Huijben, Arthur A. Nijdam, Sebastiaan Overeem, Merel Van Gilst, Ruud J. G. van Sloun",,,,,,,,,
1269
  Hierarchical Grammar-Induced Geometry for Data-Efficient Molecular Property Prediction,"Minghao Guo, Veronika Thost, Samuel Song, Adithya Balachandran, Payel Das, Jie Chen, Wojciech Matusik",,,,,,,,,
1270
- A Closer Look at the Intervention Procedure of Concept Bottleneck Models,"Sungbin Shin, Yohan Jo, Sungsoo Ahn, Namhoon Lee",http://arxiv.org/abs/2302.14260,,https://huggingface.co/papers/2302.14260,,,,2302.14260,4,0
1271
- Simple Hardware-Efficient Long Convolutions for Sequence Modeling,"Daniel Y Fu, Elliot L Epstein, Eric Nguyen, Michael Zhang, Tri Dao, Atri Rudra, Christopher Re",http://arxiv.org/abs/2302.06646,,https://huggingface.co/papers/2302.06646,,,,2302.06646,8,0
1272
  Towards Controlled Data Augmentations for Active Learning,"Jianan Yang, Jianan Yang, Haobo Wang, Sai Wu, Gang Chen, Junbo Zhao",,,,,,,,,
1273
  "Bigger, Better, Faster: Human-level Atari with human-level efficiency","Max Schwarzer, Johan Obando Ceron, Aaron Courville, Marc Bellemare, Rishabh Agarwal, Pablo Samuel Castro",http://arxiv.org/abs/2305.19452,https://github.com/google-research/google-research/tree/master/bigger_better_faster,https://huggingface.co/papers/2305.19452,,,,2305.19452,6,3
1274
  A Law of Robustness beyond Isoperimetry,"Yihan Wu, Heng Huang, Hongyang Zhang",http://arxiv.org/abs/2202.11592,,https://huggingface.co/papers/2202.11592,,,,2202.11592,3,0
@@ -1300,7 +1300,7 @@ ContraBAR: Contrastive Bayes-Adaptive Deep RL,"Era Choshen, Aviv Tamar",http://a
1300
  Guiding Pretraining in Reinforcement Learning with Large Language Models,"Yuqing Du, Olivia Watkins, Zihan Wang, Cédric Colas, Trevor Darrell, Pieter Abbeel, Abhishek Gupta, Jacob Andreas",http://arxiv.org/abs/2302.06692,,https://huggingface.co/papers/2302.06692,,,,2302.06692,8,0
1301
  PPG Reloaded: An Empirical Study on What Matters in Phasic Policy Gradient,"Kaixin Wang, Zhou Daquan, Jiashi Feng, Shie Mannor",,,,,,,,,
1302
  Differentially Private Sharpness-Aware Training,"Jinseong Park, Hoki Kim, Yujin Choi, Jaewook Lee",http://arxiv.org/abs/2306.05651,https://github.com/jinseongP/DPSAT,https://huggingface.co/papers/2306.05651,,,,2306.05651,4,0
1303
- Provably and Practically Efficient Neural Contextual Bandits,Sudeep Salgia,http://arxiv.org/abs/2206.00099,,https://huggingface.co/papers/2206.00099,,,,2206.00099,3,0
1304
  How Does Information Bottleneck Help Deep Learning?,"Kenji Kawaguchi, Zhun Deng, Xu Ji, Jiaoyang Huang",http://arxiv.org/abs/2305.18887,https://github.com/xu-ji/information-bottleneck,https://huggingface.co/papers/2305.18887,,,,2305.18887,4,0
1305
  Why Is Public Pretraining Necessary for Private Model Training?,"Arun Ganesh, Mahdi Haghifam, Milad Nasresfahani, Sewoong Oh, Thomas Steinke, Om Thakkar, Abhradeep Guha Thakurta, Lun Wang",http://arxiv.org/abs/2302.09483,,https://huggingface.co/papers/2302.09483,,,,2302.09483,8,0
1306
  Learning Instance-Specific Augmentations by Capturing Local Invariances,"Ning Miao, Tom Rainforth, Emile Mathieu, Yann Dubois, Yee-Whye Teh, Adam Foster, Hyunjik Kim",http://arxiv.org/abs/2206.00051,,https://huggingface.co/papers/2206.00051,,,,2206.00051,7,0
@@ -1360,7 +1360,7 @@ Featured Graph Coarsening with Similarity Guarantees,"MANOJ KUMAR, Anurag Sharma
1360
  Unleashing Mask: Explore the Intrinsic Out-of-Distribution Detection Capability,"Jianing Zhu, Hengzhuang Li, Jiangchao Yao, Tongliang Liu, Jianliang Xu, Bo Han",http://arxiv.org/abs/2306.03715,https://github.com/tmlr-group/Unleashing-Mask,https://huggingface.co/papers/2306.03715,,,,2306.03715,6,0
1361
  Conditional Graph Information Bottleneck for Molecular Relational Learning,"Namkyeong Lee, Dongmin Hyun, Gyoung S. Na, Sungwon Kim, Junseok Lee, Chanyoung Park",http://arxiv.org/abs/2305.01520,https://github.com/Namkyeong/CGIB,https://huggingface.co/papers/2305.01520,,,,2305.01520,6,0
1362
  Reconstructive Neuron Pruning for Backdoor Defense,"Yige Li, XIXIANG LYU, Xingjun Ma, Nodens Koren, Lingjuan Lyu, Bo Li, Yu-Gang Jiang",http://arxiv.org/abs/2305.14876,https://github.com/bboylyg/RNP,https://huggingface.co/papers/2305.14876,,,,2305.14876,7,0
1363
- Abstract-to-Executable Trajectory Translation for One-Shot Task Generalization,"Stone Tao, Xiaochen Li, Tongzhou Mu, Zhiao Huang, Yuzhe Qin, Hao Su",http://arxiv.org/abs/2210.07658,,https://huggingface.co/papers/2210.07658,,,,2210.07658,6,0
1364
  Multi-View Masked World Models for Visual Robotic Manipulation,"Younggyo Seo, Junsu Kim, Stephen James, Kimin Lee, Jinwoo Shin, Pieter Abbeel",http://arxiv.org/abs/2302.02408,,https://huggingface.co/papers/2302.02408,,,,2302.02408,6,0
1365
  CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets,"Zachary Novack, Julian McAuley, Zachary Lipton, Saurabh Garg",http://arxiv.org/abs/2302.02551,https://github.com/acmi-lab/CHILS,https://huggingface.co/papers/2302.02551,,,,2302.02551,4,1
1366
  Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization,"Zi-Hao Qiu, Quanqi Hu, Zhuoning Yuan, Denny Zhou, Lijun Zhang, Tianbao Yang",http://arxiv.org/abs/2305.11965,,https://huggingface.co/papers/2305.11965,,,,2305.11965,6,0
@@ -1407,12 +1407,12 @@ Omnipredictors for Constrained Optimization,"Lunjia Hu, Inbal Livni Navon, Omer
1407
  Bandit Online Linear Optimization with Hints and Queries,"Aditya Bhaskara, Ashok Cutkosky, Ravi Kumar, Manish Purohit",,,,,,,,,
1408
  Neural Network Approximations of PDEs Beyond Linearity: A Representational Perspective,"Tanya Marwah, Zachary Lipton, Jianfeng Lu, Andrej Risteski",http://arxiv.org/abs/2210.12101,,https://huggingface.co/papers/2210.12101,,,,2210.12101,4,0
1409
  Attribute-Efficient PAC Learning of Low-Degree Polynomial Threshold Functions with Nasty Noise,"Shiwei Zeng, Jie Shen",http://arxiv.org/abs/2306.00673,,https://huggingface.co/papers/2306.00673,,,,2306.00673,2,0
1410
- Sample Complexity Bounds for Learning High-dimensional Simplices in Noisy Regimes,"seyed amir saberi, Amir Najafi, Abolfazl Motahari, Babak Khalaj",http://arxiv.org/abs/2209.05953,,https://huggingface.co/papers/2209.05953,,,,2209.05953,4,0
1411
  "Monge, Bregman and Occam: Interpretable Optimal Transport in High-Dimensions with Feature-Sparse Maps","Marco Cuturi, Michal Klein, Pierre Ablin",http://arxiv.org/abs/2302.04065,,https://huggingface.co/papers/2302.04065,,,,2302.04065,3,0
1412
  Sketching Meets Differential Privacy: Fast Algorithm for Dynamic Kronecker Projection Maintenance,"Zhao Song, Xin Yang, Yuanyuan Yang, Lichen Zhang",http://arxiv.org/abs/2210.11542,,https://huggingface.co/papers/2210.11542,,,,2210.11542,4,0
1413
  Combinatorial Neural Bandits,"Taehyun Hwang, Kyuwook Chai, Min-hwan Oh",http://arxiv.org/abs/2306.00242,,https://huggingface.co/papers/2306.00242,,,,2306.00242,3,0
1414
  Reward-Mixing MDPs with Few Contexts are Learnable,"Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor",,,,,,,,,
1415
- Quantum Speedups for Zero-Sum Games via Improved Dynamic Gibbs Sampling,"Adam Bouland, Yosheb Getachew, Yujia Jin, Aaron Sidford, Kevin Tian",http://arxiv.org/abs/2301.03763,,https://huggingface.co/papers/2301.03763,,,,2301.03763,5,0
1416
  Tight Regret Bounds for Single-pass Streaming Multi-armed Bandits,Chen Wang,http://arxiv.org/abs/2306.02208,,https://huggingface.co/papers/2306.02208,,,,2306.02208,1,0
1417
  Minimum Width of Leaky-ReLU Neural Networks for Uniform Universal Approximation,"Li'ang Li, Yifei duan, Guanghua Ji, Yongqiang Cai",http://arxiv.org/abs/2305.18460,,https://huggingface.co/papers/2305.18460,,,,2305.18460,4,0
1418
  Dynamical Linear Bandits,"Marco Mussi, Alberto Maria Metelli, Marcello Restelli",http://arxiv.org/abs/2211.08997,,https://huggingface.co/papers/2211.08997,,,,2211.08997,3,0
@@ -1497,7 +1497,7 @@ Who Needs to Know? Minimal Knowledge for Optimal Coordination,"Niklas Lauffer, A
1497
  Neural networks trained with SGD learn distributions of increasing complexity,"Maria Refinetti, Alessandro Ingrosso, Sebastian Goldt",http://arxiv.org/abs/2211.11567,,https://huggingface.co/papers/2211.11567,,,,2211.11567,3,0
1498
  Scaling Laws for Multilingual Neural Machine Translation,"Patrick Fernandes, Behrooz Ghorbani, Xavier Garcia, Markus Freitag, Orhan Firat",http://arxiv.org/abs/2302.09650,,https://huggingface.co/papers/2302.09650,,,,2302.09650,5,0
1499
  Explaining the effects of non-convergent MCMC in the training of Energy-Based Models,"Elisabeth Agoritsas, Giovanni Catania, Aurélien Decelle, Beatriz Seoane",,,,,,,,,
1500
- A Three-regime Model of Network Pruning,"Yefan Zhou, Yaoqing Yang, Arin Chang, Michael Mahoney",http://arxiv.org/abs/2305.18383,,https://huggingface.co/papers/2305.18383,,,,2305.18383,4,0
1501
  Metagenomic Binning using Connectivity-constrained Variational Autoencoders,"Andre Lamurias, Alessandro Tibo, Katja Hose, Mads Albertsen, Thomas D. Nielsen",,,,,,,,,
1502
  SNeRL: Semantic-aware Neural Radiance Fields for Reinforcement Learning,"Dongseok Shim, Seungjae Lee, H. Kim",http://arxiv.org/abs/2301.11520,,https://huggingface.co/papers/2301.11520,,,,2301.11520,3,0
1503
  Spatial-Temporal Graph Learning with Adversarial Contrastive Adaptation,"Qianru Zhang, Chao Huang, Lianghao Xia, Zheng Wang, Siu Ming Yiu, Ruihua Han",,,,,,,,,
@@ -1621,7 +1621,7 @@ On the Interplay Between Misspecification and Sub-optimality Gap in Linear Conte
1621
  Brainformers: Trading Simplicity for Efficiency,"Yanqi Zhou, Nan Du, Yanping Huang, Daiyi Peng, Chang Lan, Da Huang, Siamak Shakeri, David So, Andrew Dai, Yifeng Lu, Zhifeng Chen, Quoc Le, Claire Cui, James Laudon, Jeff Dean",http://arxiv.org/abs/2306.00008,,https://huggingface.co/papers/2306.00008,,,,2306.00008,15,3
1622
  On the Training Instability of Shuffling SGD with Batch Normalization,"David X. Wu, Chulhee Yun, Suvrit Sra",http://arxiv.org/abs/2302.12444,,https://huggingface.co/papers/2302.12444,,,,2302.12444,3,0
1623
  Dropout Reduces Underfitting,"Zhuang Liu, Zhiqiu (Oscar) Xu, Joseph Jin, Zhiqiang Shen, Trevor Darrell",http://arxiv.org/abs/2303.01500,https://github.com/facebookresearch/dropout,https://huggingface.co/papers/2303.01500,,,,2303.01500,5,0
1624
- A modern look at the relationship between sharpness and generalization,"Maksym Andriushchenko, Francesco Croce, Maximilian Müller, Matthias Hein, Nicolas Flammarion",http://arxiv.org/abs/2302.07011,https://github.com/tml-epfl/sharpness-vs-generalization,https://huggingface.co/papers/2302.07011,,,,2302.07011,5,0
1625
  Weak Proxies are Sufficient and Preferable for Fairness with Missing Sensitive Attributes,"Zhaowei Zhu, Yuanshun Yao, Jiankai Sun, Hang Li, Yang Liu",http://arxiv.org/abs/2210.03175,,https://huggingface.co/papers/2210.03175,,,,2210.03175,5,0
1626
  Cocktail Party Attack: Breaking Aggregation-Based Privacy in Federated Learning Using Independent Component Analysis,"Sanjay Kariyappa, Chuan Guo, Kiwan Maeng, Wenjie Xiong, G. Edward Suh, Moinuddin Qureshi, Hsien-Hsin Sean Lee",http://arxiv.org/abs/2209.05578,,https://huggingface.co/papers/2209.05578,,,,2209.05578,7,0
1627
  On the Robustness of Randomized Ensembles to Adversarial Perturbations,"Hassan Dbouk, Naresh Shanbhag",http://arxiv.org/abs/2302.01375,https://github.com/hsndbk4/BARRE,https://huggingface.co/papers/2302.01375,,,,2302.01375,2,0
@@ -1647,7 +1647,7 @@ LinSATNet: The Positive Linear Satisfiability Neural Networks,"Runzhong Wang, Yu
1647
  On the Complexity of Bayesian Generalization,"Yu-Zhe Shi, Manjie Xu, John Hopcroft, Kun He, Josh Tenenbaum, Song-Chun Zhu, Ying Nian Wu, Wenjuan Han, Yixin Zhu",http://arxiv.org/abs/2211.11033,,https://huggingface.co/papers/2211.11033,,,,2211.11033,9,0
1648
  QAS-Bench: Rethinking Quantum Architecture Search and A Benchmark,"Xudong Lu, Kaisen Pan, Ge Yan, Jiaming Shan, Wenjie Wu, Junchi Yan",,,,,,,,,
1649
  Not all Strongly Rayleigh Distributions Have Small Probabilistic Generating Circuits,Markus Bläser,,,,,,,,,
1650
- PAL: Program-aided Language Models,"Luyu Gao, Aman Madaan, Shuyan Zhou, Uri Alon, Pengfei Liu, Yiming Yang, Jamie Callan, Graham Neubig",http://arxiv.org/abs/2211.10435,,https://huggingface.co/papers/2211.10435,,,,2211.10435,8,1
1651
  Tighter Bounds on the Expressivity of Transformer Encoders,"David Chiang, Peter Cholak, Anand Pillay",http://arxiv.org/abs/2301.10743,,https://huggingface.co/papers/2301.10743,,,,2301.10743,3,0
1652
  Efficient Algorithms for Exact Graph Matching on Correlated Stochastic Block Models with Constant Correlation,"Joonhyuk Yang, Shin Dongpil, Hye Won Chung",http://arxiv.org/abs/2305.19666,,https://huggingface.co/papers/2305.19666,,,,2305.19666,3,0
1653
  Causal Discovery with Latent Confounders Based on Higher-Order Cumulants,"Ruichu Cai, Zhiyi Huang, Wei Chen, Zhifeng Hao, Kun Zhang",http://arxiv.org/abs/2305.19582,,https://huggingface.co/papers/2305.19582,,,,2305.19582,5,0
@@ -1685,7 +1685,7 @@ Robustness in Multimodal Learning under Train-Test Modality Mismatch,"Brandon Mc
1685
  Learning Representations without Compositional Assumptions,"Tennison Liu, Jeroen Berrevoets, Zhaozhi Qian, Mihaela van der Schaar",http://arxiv.org/abs/2305.19726,,https://huggingface.co/papers/2305.19726,,,,2305.19726,4,0
1686
  Making Transformers Compute-lite for CPU inference,"Zhanpeng Zeng, Michael Davies, Pranav Pulijala, Karthikeyan Sankaralingam, Vikas Singh",,,,,,,,,
1687
  Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers,"Grant Strimel, Yi Xie, Brian King, martin radfar, Ariya Rastrow, Athanasios Mouchtaris",http://arxiv.org/abs/2305.04159,,https://huggingface.co/papers/2305.04159,,,,2305.04159,6,0
1688
- Expected Gradients of Maxout Networks and Consequences to Parameter Initialization,"Hanna Tseran, Guido Montufar",http://arxiv.org/abs/2301.06956,,https://huggingface.co/papers/2301.06956,,,,2301.06956,2,0
1689
  Competing for Shareable Arms in Multi-Player Multi-Armed Bandits,"Renzhe Xu, Haotian Wang, Xingxuan Zhang, Bo Li, Peng Cui",http://arxiv.org/abs/2305.19158,,https://huggingface.co/papers/2305.19158,,,,2305.19158,5,1
1690
  Intrinsic Sliced Wasserstein Distances for Comparing Collections of Probability Distributions on Manifolds and Graphs,"Raif Rustamov, Subhabrata Majumdar",http://arxiv.org/abs/2010.15285,,https://huggingface.co/papers/2010.15285,,,,2010.15285,2,1
1691
  Coordinated Dynamic Bidding in Repeated Second-Price Auctions with Budgets,"Yurong Chen, Zhaohua Chen, Xiaotie Deng, Zhijian Duan, Haoran Sun, Qian Wang, Xiang Yan",http://arxiv.org/abs/2306.07709,,https://huggingface.co/papers/2306.07709,,,,2306.07709,7,0
@@ -1699,12 +1699,12 @@ Faster Gradient-Free Algorithms for Nonsmooth Nonconvex Stochastic Optimization,
1699
  One-Step Estimator for Permuted Sparse Recovery,"Hang Zhang, Ping Li",,,,,,,,,
1700
  Cold Analysis of Rao-Blackwellized Straight-Through Gumbel-Softmax Gradient Estimator,Alexander Shekhovtsov,,,,,,,,,
1701
  Estimating the Contamination Factor's Distribution in Unsupervised Anomaly Detection,"Lorenzo Perini, Paul Buerkner, Arto Klami",http://arxiv.org/abs/2210.10487,,https://huggingface.co/papers/2210.10487,,,,2210.10487,3,0
1702
- Image generation with shortest path diffusion,"Ayan Das, Ayan Das, Stathi Fotiadis, Anil Batra, Farhang Nabiei, FengTing Liao, Sattar Vakili, Da-shan Shiu, Alberto Bernacchia",http://arxiv.org/abs/2306.00501,,https://huggingface.co/papers/2306.00501,,,,2306.00501,8,0
1703
  Deep Anomaly Detection under Labeling Budget Constraints,"Aodong Li, Chen Qiu, Padhraic Smyth, Marius Kloft, Stephan Mandt, Maja Rudolph",http://arxiv.org/abs/2302.07832,,https://huggingface.co/papers/2302.07832,,,,2302.07832,6,0
1704
  Transformed Distribution Matching for Missing Value Imputation,"He Zhao, Ke Sun, Amir Dezfouli, Edwin V Bonilla",http://arxiv.org/abs/2302.10363,,https://huggingface.co/papers/2302.10363,,,,2302.10363,4,0
1705
  Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?,"Ruisi Cai, Zhenyu Zhang, Zhangyang “Atlas” Wang",http://arxiv.org/abs/2302.12480,,https://huggingface.co/papers/2302.12480,,,,2302.12480,3,0
1706
  Git-Theta: A Git Extension for Collaborative Development of Machine Learning Models,"Nikhil Kandpal, Brian Lester, Mohammed Muqeeth, Anisha Mascarenhas, Monty Evans, Vishal Baskaran, Tenghao Huang, Haokun Liu, Colin Raffel",,,,,,,,,
1707
- Better Diffusion Models Further Improve Adversarial Training,"Zekai Wang, Tianyu Pang, Chao Du, Min Lin, Weiwei Liu, Shuicheng YAN",http://arxiv.org/abs/2302.04638,https://github.com/wzekai99/DM-Improves-AT,https://huggingface.co/papers/2302.04638,,,,2302.04638,6,0
1708
  On the Expressive Power of Geometric Graph Neural Networks,"Chaitanya Joshi, Cristian Bodnar, Simon Mathis, Taco Cohen, Pietro Lió",http://arxiv.org/abs/2301.09308,https://github.com/chaitjo/geometric-gnn-dojo,https://huggingface.co/papers/2301.09308,,,,2301.09308,5,0
1709
  Randomized Schur Complement Views for Graph Contrastive Learning,Vignesh Kothapalli,http://arxiv.org/abs/2306.04004,,https://huggingface.co/papers/2306.04004,,,,2306.04004,1,1
1710
  Path Neural Networks: Expressive and Accurate Graph Neural Networks,"Gaspard Michel, Giannis Nikolentzos, Johannes Lutzeyer, Michalis Vazirgiannis",http://arxiv.org/abs/2306.05955,,https://huggingface.co/papers/2306.05955,,,,2306.05955,4,0
@@ -1780,7 +1780,7 @@ The Monge Gap: A Regularizer to Learn All Transport Maps,"Théo Uscidda, Marco C
1780
  AbODE: Ab initio antibody design using conjoined ODEs,"Yogesh Verma, Markus Heinonen, Vikas K Garg",http://arxiv.org/abs/2306.01005,,https://huggingface.co/papers/2306.01005,,,,2306.01005,3,0
1781
  Learning-augmented private algorithms for multiple quantile release,"Mikhail Khodak, Kareem Amin, Travis Dick, Sergei Vassilvitskii",http://arxiv.org/abs/2210.11222,,https://huggingface.co/papers/2210.11222,,,,2210.11222,4,0
1782
  Horizon-free Learning for Markov Decision Processes and Games: Stochastically Bounded Rewards and Improved Bounds,"Shengshi Li, Lin Yang",,,,,,,,,
1783
- Variational Autoencoding Neural Operators,"Jacob H. Seidman, Georgios Kissas, George J. Pappas, Paris Perdikaris",http://arxiv.org/abs/2302.10351,,https://huggingface.co/papers/2302.10351,,,,2302.10351,4,0
1784
  Efficient Parametric Approximations of Neural Network Function Space Distance,"Nikita Dhawan, Sicong Huang, Juhan Bae, Roger Grosse",http://arxiv.org/abs/2302.03519,,https://huggingface.co/papers/2302.03519,,,,2302.03519,4,0
1785
  Theory on Forgetting and Generalization of Continual Learning,"Sen Lin, Peizhong Ju, Yingbin LIANG, Ness Shroff",http://arxiv.org/abs/2302.05836,,https://huggingface.co/papers/2302.05836,,,,2302.05836,4,0
1786
  Trapdoor Normalization with Irreversible Ownership Verification,"Hanwen Liu, Zhenyu Weng, Yuesheng Zhu, Yadong Mu",,,,,,,,,
 
172
  Graph Ladling: Shockingly Simple Parallel GNN Training without Intermediate Communication,"Ajay Jaiswal, Shiwei Liu, Tianlong Chen, Ding, Zhangyang “Atlas” Wang",,,,,,,,,
173
  A Critical Revisit of Adversarial Robustness in 3D Point Cloud Recognition with Diffusion-Driven Purification,"Jiachen Sun, Jiongxiao Wang, Weili Nie, Zhiding Yu, Zhuoqing Morley Mao, Chaowei Xiao",,,,,,,,,
174
  COLA: Orchestrating Error Coding and Learning for Robust Neural Network Inference Against Hardware Defects,"Anlan Yu, Ning Lyu, Jieming Yin, Zhiyuan Yan, Wujie Wen",,,,,,,,,
175
+ A Closer Look at Self-Supervised Lightweight Vision Transformers,"Shaoru Wang, Jin Gao, Zeming Li, Xiaoqin Zhang, Weiming Hu",http://arxiv.org/abs/2205.14443,https://github.com/wangsr126/mae-lite,https://huggingface.co/papers/2205.14443,,,,2205.14443,5,1
176
  Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space,"Anas Barakat, Ilyas Fatkhullin, Niao He",http://arxiv.org/abs/2306.01854,,https://huggingface.co/papers/2306.01854,,,,2306.01854,3,0
177
  Leveraging Offline Data in Online Reinforcement Learning,"Andrew Wagenmaker, Aldo Pacchiano",http://arxiv.org/abs/2211.04974,,https://huggingface.co/papers/2211.04974,,,,2211.04974,2,0
178
  Implicit Regularization Leads to Benign Overfitting for Sparse Linear Regression,"Mo Zhou, Rong Ge",http://arxiv.org/abs/2302.00257,,https://huggingface.co/papers/2302.00257,,,,2302.00257,2,0
 
492
  Accounting For Informative Sampling When Learning to Forecast Treatment Outcomes Over Time,"Toon Vanderschueren, Alicia Curth, Wouter Verbeke, Mihaela van der Schaar",http://arxiv.org/abs/2306.04255,,https://huggingface.co/papers/2306.04255,,,,2306.04255,4,0
493
  AudioLDM: Text-to-Audio Generation with Latent Diffusion Models,"Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo Mandic, Wenwu Wang, Mark D Plumbley",http://arxiv.org/abs/2301.12503,,https://huggingface.co/papers/2301.12503,,,,2301.12503,8,1
494
  Revisiting Over-smoothing and Over-squashing Using Ollivier-Ricci Curvature,"Khang Nguyen, Nong Hieu, Vinh NGUYEN, Nhat Ho, Stanley Osher, TAN NGUYEN",http://arxiv.org/abs/2211.15779,,https://huggingface.co/papers/2211.15779,,,,2211.15779,6,1
495
+ Lifelong Language Pretraining with Distribution-Specialized Experts,"Wuyang Chen, Yanqi Zhou, Nan Du, Yanping Huang, James Laudon, Zhifeng Chen, Claire Cui",http://arxiv.org/abs/2305.12281,,https://huggingface.co/papers/2305.12281,,,,2305.12281,7,1
496
  Delay-agnostic Asynchronous Coordinate Update Algorithm,"Xuyang Wu, Changxin Liu, Sindri Magnússon, Mikael Johansson",http://arxiv.org/abs/2305.08535,,https://huggingface.co/papers/2305.08535,,,,2305.08535,4,1
497
  Prototype-oriented unsupervised anomaly detection for multivariate time series,"yuxin li, Wenchao Chen, Bo Chen, Dongsheng Wang, Long Tian, Mingyuan Zhou",,,,,,,,,
498
  ClimaX: A foundation model for weather and climate,"Tung Nguyen, Johannes Brandstetter, Ashish Kapoor, Jayesh K. Gupta, Aditya Grover",http://arxiv.org/abs/2301.10343,,https://huggingface.co/papers/2301.10343,,,,2301.10343,5,1
 
597
  Conformal Prediction Sets for Graph Neural Networks,"Soroush H. Zargarbashi, Simone Antonelli, Aleksandar Bojchevski",,,,,,,,,
598
  Probabilistic Attention-to-Influence Neural Models for Event Sequences,"Xiao Shou, DEBARUN BHATTACHARJYA, Tian Gao, Dharmashankar Subramanian, Oktie Hassanzadeh, Kristin Bennett",,,,,,,,,
599
  Nearly-tight Bounds for Deep Kernel Learning,"Yi-Fan Zhang, Min-Ling Zhang",,,,,,,,,
600
+ Generalized Disparate Impact for Configurable Fairness Solutions in ML,"Luca Giuliani, Eleonora Misino, Michele Lombardi",http://arxiv.org/abs/2305.18504,,https://huggingface.co/papers/2305.18504,,,,2305.18504,3,1
601
  Thompson Sampling with Less Exploration is Fast and Optimal,"Tianyuan Jin, XIANGLIN YANG, Xiaokui Xiao, Pan Xu",,,,,,,,,
602
  Do Machine Learning Models Learn Statistical Rules Inferred from Data?,"Aaditya Naik, Yinjun Wu, Mayur Naik, Eric Wong",http://arxiv.org/abs/2303.01433,https://github.com/DebugML/sqrl,https://huggingface.co/papers/2303.01433,,,,2303.01433,4,1
603
  Deep Perturbation Learning: Enhancing the Network Performance via Image Perturbations,"Zifan Song, Xiao Gong, Guosheng Hu, Cairong Zhao",,,,,,,,,
 
608
  GeCoNeRF: Few-shot Neural Radiance Fields via Geometric Consistency,"Min-Seop Kwak, Jiuhn Song, Seungryong Kim",http://arxiv.org/abs/2301.10941,,https://huggingface.co/papers/2301.10941,,,,2301.10941,3,1
609
  Input uncertainty propagation through trained neural networks,"Paul Monchot, Loic Coquelin, Sébastien J. Petit, Sébastien Marmin, Erwann LE PENNEC, Nicolas Fischer",,,,,,,,,
610
  Optimally-weighted Estimators of the Maximum Mean Discrepancy for Likelihood-Free Inference,"Ayush Bharti, Masha Naslidnyk, Oscar Key, Samuel Kaski, Francois-Xavier Briol",http://arxiv.org/abs/2301.11674,,https://huggingface.co/papers/2301.11674,,,,2301.11674,5,0
611
+ SGD with large step sizes learns sparse features,"Maksym Andriushchenko, Aditya Vardhan Varre, Loucas Pillaud-Vivien, Nicolas Flammarion",http://arxiv.org/abs/2210.05337,https://github.com/tml-epfl/sgd-sparse-features,https://huggingface.co/papers/2210.05337,,,,2210.05337,4,1
612
  Kernel Logistic Regression Approximation of an Understandable ReLU Neural Network,"Marie Guyomard, Susana Barbosa, Lionel Fillatre",,,,,,,,,
613
  Cramming: Training a Language Model on a single GPU in one day.,"Jonas Geiping, Tom Goldstein",https://arxiv.org/abs//2212.14034,https://github.com/JonasGeiping/cramming,https://huggingface.co/papers/2212.14034,,https://huggingface.co/JonasGeiping/crammed-bert,https://huggingface.co/datasets/JonasGeiping/the_pile_WordPiecex32768_2efdb9d060d1ae95faf952ec1a50f020,2212.14034,2,1
614
  A Simple Zero-shot Prompt Weighting Technique to Improve Prompt Ensembling in Text-Image Models,"James Allingham, JIE REN, Michael Dusenberry, Jeremiah Liu, Xiuye Gu, Yin Cui, Dustin Tran, Balaji Lakshminarayanan",http://arxiv.org/abs/2302.06235,,https://huggingface.co/papers/2302.06235,,,,2302.06235,8,0
 
730
  Leveraging Proxy of Training Data for Test-Time Adaptation,"Juwon Kang, Nayeong Kim, Donghyeon Kwon, Jungseul Ok, Suha Kwak",,,,,,,,,
731
  Near-Optimal Algorithms for Private Online Optimization in the Realizable Regime,"Hilal Asi, Vitaly Feldman, Tomer Koren, Kunal Talwar",http://arxiv.org/abs/2302.14154,,https://huggingface.co/papers/2302.14154,,,,2302.14154,4,0
732
  Double-Weighting for Covariate Shift Adaptation,"José I. Segovia-Martín, Santiago Mazuelas, Anqi Liu",http://arxiv.org/abs/2305.08637,,https://huggingface.co/papers/2305.08637,,,,2305.08637,3,0
733
+ Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise Constraints,"Donghao Li, Ruiquan Huang, Cong Shen, Jing Yang",http://arxiv.org/abs/2306.06265,,https://huggingface.co/papers/2306.06265,,,,2306.06265,4,1
734
  PASTA: Pessimistic Assortment Optimization,"Juncheng Dong, Weibin Mo, Zhengling Qi, Cong Shi, Ethan Fang, Vahid Tarokh",http://arxiv.org/abs/2302.03821,,https://huggingface.co/papers/2302.03821,,,,2302.03821,6,0
735
  Coarse-to-Fine: a Hierarchical Diffusion Model for Molecule Generation in 3D,"Bo Qiang, Yuxuan Song, Minkai Xu, Jingjing Gong, Bowen Gao, Hao Zhou, Wei-Ying Ma, Yanyan Lan",,,,,,,,,
736
  Off-Policy Average Reward Actor-Critic with Deterministic Policy Search,"Naman Saxena, Subhojyoti Khastagir, Shishir Nadubettu Yadukumar, Shalabh Bhatnagar",http://arxiv.org/abs/2305.12239,,https://huggingface.co/papers/2305.12239,,,,2305.12239,4,0
 
833
  Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model,"Siyu Chen, Jibang Wu, Yifan Wu, Zhuoran Yang",http://arxiv.org/abs/2303.08613,,https://huggingface.co/papers/2303.08613,,,,2303.08613,4,1
834
  SLAMB: Accelerated Large Batch Training with Sparse Communication,"Hang Xu, Wenxuan Zhang, Jiawei Fei, Yuzhe Wu, TingWen Xie, Jun Huang, Yuchen Xie, Mohamed Elhoseiny, Panos Kalnis",,,,,,,,,
835
  Efficient Quantum Algorithms for Quantum Optimal Control,"Xiantao Li, Chunhao Wang",http://arxiv.org/abs/2304.02613,,https://huggingface.co/papers/2304.02613,,,,2304.02613,2,0
836
+ Improved Policy Evaluation for Randomized Trials of Algorithmic Resource Allocation,"Aditya Mate, Bryan Wilder, Aparna Taneja, Milind Tambe",http://arxiv.org/abs/2302.02570,,https://huggingface.co/papers/2302.02570,,,,2302.02570,4,1
837
  Variational Sparse Inverse Cholesky Approximation for Latent Gaussian Processes via Double Kullback-Leibler Minimization,"Jian Cao, Myeongjong Kang, Felix Jimenez, Huiyan Sang, Florian Schaefer, Matthias Katzfuss",http://arxiv.org/abs/2301.13303,,https://huggingface.co/papers/2301.13303,,,,2301.13303,6,0
838
  Efficient exploration via epistemic-risk-seeking policy gradients,Brendan O'Donoghue,,,,,,,,,
839
  Probing the Deep Neural Manifold of Reinforcement Learning to Expose Volatility,"Ezgi Korkmaz, Jonah Brown-Cohen",,,,,,,,,
 
871
  Cut your Losses with Squentropy,"Like Hui, Misha Belkin, Stephen Wright",http://arxiv.org/abs/2302.03952,,https://huggingface.co/papers/2302.03952,,,,2302.03952,3,0
872
  Multi-Agent Learning from Learners,"MINE M CALISKAN, Francesco Chini, Setareh Maghsudi",,,,,,,,,
873
  Oracles and Followers: Stackelberg Equilibria in Deep Multi-Agent Reinforcement Learning,"Matthias Gerstgrasser, David Parkes",,,,,,,,,
874
+ Robust Counterfactual Explanations for Neural Networks With Probabilistic Guarantees,"Faisal Hamman, Erfaun Noorani, Saumitra Mishra, Daniele Magazzeni, Sanghamitra Dutta",http://arxiv.org/abs/2305.11997,,https://huggingface.co/papers/2305.11997,,,,2305.11997,5,1
875
  Theoretical Behavior of XAI Methods in the Presence of Suppressor Variables,"Rick Wilming, Leo Kieslich, Benedict Clark, Stefan Haufe",http://arxiv.org/abs/2306.01464,,https://huggingface.co/papers/2306.01464,,,,2306.01464,4,1
876
  When do Minimax-fair Learning and Empirical Risk Minimization Coincide?,"Harvineet Singh, Matthäus Kleindessner, Volkan Cevher, Rumi Chunara, Chris Russell",,,,,,,,,
877
  Semi-Autoregressive Energy Flows: Towards Determinant-Free Training of Normalizing Flows ,"Phillip Si, Zeyi Chen, Subham S Sahoo, Yair Schiff, Volodymyr Kuleshov",,,,,,,,,
 
977
  Bayes-optimal Learning of Deep Random Networks of Extensive-width,"Hugo Cui, FLORENT KRZAKALA, Lenka Zdeborova",,,,,,,,,
978
  Adapting to game trees in zero-sum imperfect information games,"Côme Fiegel, Pierre Menard, Tadashi Kozuno, Remi Munos, Vianney Perchet, Michal Valko",http://arxiv.org/abs/2212.12567,,https://huggingface.co/papers/2212.12567,,,,2212.12567,6,0
979
  Adversarial Policies Beat Superhuman Go AIs,"Tony Wang, Adam Gleave, Tom Tseng, Nora Belrose, Kellin Pelrine, Joseph Miller, Michael Dennis, Yawen Duan, Viktor Pogrebniak, Sergey Levine, Stuart Russell",http://arxiv.org/abs/2211.00241,,https://huggingface.co/papers/2211.00241,,,,2211.00241,11,1
980
+ Pretraining Language Models with Human Preferences,"Tomasz Korbak, Kejian Shi, Angelica Chen, Rasika Bhalerao, Christopher Buckley, Jason Phang, Samuel Bowman, Ethan Perez",http://arxiv.org/abs/2302.08582,,https://huggingface.co/papers/2302.08582,,,,2302.08582,8,2
981
  Adversarial Example Does Good: Preventing Painting Imitation from Diffusion Models via Adversarial Examples,"Chumeng Liang, Xiaoyu Wu, Yang Hua, Jiaru Zhang, Yiming Xue, Tao Song, Zhengui XUE, Ruhui Ma, Haibing Guan",http://arxiv.org/abs/2302.04578,https://github.com/mist-project/mist.git,https://huggingface.co/papers/2302.04578,,,,2302.04578,9,0
982
  A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs,"Mikael Henaff, Minqi Jiang, Roberta Raileanu",http://arxiv.org/abs/2306.03236,,https://huggingface.co/papers/2306.03236,,,,2306.03236,3,0
983
  Using Large Language Models to Simulate Multiple Humans and Replicate Human Subject Studies,"Gati Aher, Rosa I. Arriaga, Adam Tauman Kalai",http://arxiv.org/abs/2208.10264,,https://huggingface.co/papers/2208.10264,,,,2208.10264,3,0
 
989
  Towards Theoretical Understanding of Inverse Reinforcement Learning,"Alberto Maria Metelli, Filippo Lazzati, Marcello Restelli",http://arxiv.org/abs/2304.12966,,https://huggingface.co/papers/2304.12966,,,,2304.12966,3,0
990
  Tighter Lower Bounds for Shuffling SGD: Random Permutations and Beyond,"Jaeyoung Cha, Jaewook Lee, Chulhee Yun",http://arxiv.org/abs/2303.07160,,https://huggingface.co/papers/2303.07160,,,,2303.07160,3,0
991
  Delayed Feedback in Kernel Bandits,"Sattar Vakili, Danyal Ahmed, Alberto Bernacchia, Ciara Pike-Burke",http://arxiv.org/abs/2302.00392,,https://huggingface.co/papers/2302.00392,,,,2302.00392,4,0
992
+ Sharper Bounds for $\ell_p$ Sensitivity Sampling,"David Woodruff, Taisuke Yasuda",http://arxiv.org/abs/2306.00732,,https://huggingface.co/papers/2306.00732,,,,2306.00732,2,1
993
  Hyena Hierarchy: Towards Larger Convolutional Language Models,"Michael Poli, Stefano Massaroli, Eric Nguyen, Daniel Y Fu, Tri Dao, Stephen Baccus, Yoshua Bengio, Stefano Ermon, Christopher Re",http://arxiv.org/abs/2302.10866,,https://huggingface.co/papers/2302.10866,,,,2302.10866,9,0
994
  Delving into Noisy Label Detection with Clean Data,"Chenglin Yu, Xinsong Ma, Weiwei Liu",,,,,,,,,
995
  GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration,"Naoki Murata, Koichi Saito, Chieh-Hsin Lai, Yuhta Takida, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon",http://arxiv.org/abs/2301.12686,,https://huggingface.co/papers/2301.12686,,,,2301.12686,7,1
 
1056
  Towards Omni-generalizable Neural Methods for Vehicle Routing Problems,"Jianan Zhou, Yaoxin Wu, Wen Song, Zhiguang Cao, Jie Zhang",http://arxiv.org/abs/2305.19587,https://github.com/RoyalSkye/Omni-VRP,https://huggingface.co/papers/2305.19587,,,,2305.19587,5,0
1057
  Protecting Language Generation Models via Invisible Watermarking,"Xuandong Zhao, Yu-Xiang Wang, Lei Li",http://arxiv.org/abs/2302.03162,,https://huggingface.co/papers/2302.03162,,,,2302.03162,3,0
1058
  Global Optimization with Parametric Function Approximation,"Chong Liu, Yu-Xiang Wang",http://arxiv.org/abs/2211.09100,,https://huggingface.co/papers/2211.09100,,,,2211.09100,2,0
1059
+ Non-stationary Reinforcement Learning under General Function Approximation,"Songtao Feng, Ming Yin, Ruiquan Huang, Yu-Xiang Wang, Jing Yang, Yingbin LIANG",http://arxiv.org/abs/2306.00861,,https://huggingface.co/papers/2306.00861,,,,2306.00861,6,1
1060
  Demystifying Disagreement-on-the-Line in High Dimensions,"Donghwan Lee, Behrad Moniri, Xinmeng Huang, Edgar Dobriban, Hamed Hassani",http://arxiv.org/abs/2301.13371,,https://huggingface.co/papers/2301.13371,,,,2301.13371,5,0
1061
  Multisample Flow Matching: Straightening Flows with Minibatch Couplings,"Aram-Alexandre Pooladian, Heli Ben-Hamu, Carles Domingo i Enrich, Brandon Amos, Yaron Lipman, Ricky T. Q. Chen",http://arxiv.org/abs/2304.14772,,https://huggingface.co/papers/2304.14772,,,,2304.14772,6,1
1062
  Competitive Gradient Optimization,"Abhijeet Vyas, Brian Bullins, Kamyar Azizzadenesheli",http://arxiv.org/abs/2205.14232,,https://huggingface.co/papers/2205.14232,,,,2205.14232,2,0
 
1096
  LegendreTron: Uprising Proper Multiclass Loss Learning,"Kevin H. Lam, Christian Walder, Spiridon Penev, Richard Nock",http://arxiv.org/abs/2301.11695,,https://huggingface.co/papers/2301.11695,,,,2301.11695,4,0
1097
  R-U-SURE? Uncertainty-Aware Code Suggestions By Maximizing Utility Across Random User Intents,"Daniel D. Johnson, Daniel Tarlow, Christian Walder",,,,,,,,,
1098
  High-dimensional Location Estimation via Norm Concentration for Subgamma Vectors,"Shivam Gupta, Jasper Lee, Eric Price",http://arxiv.org/abs/2302.02497,,https://huggingface.co/papers/2302.02497,,,,2302.02497,3,0
1099
+ COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models,"Jinqi Xiao, Miao Yin, Yu Gong, Xiao Zang, Jian Ren, Bo Yuan",http://arxiv.org/abs/2305.17235,,https://huggingface.co/papers/2305.17235,,,,2305.17235,6,1
1100
  Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling,"Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'Brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, USVSN Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar van der Wal",http://arxiv.org/abs/2304.01373,https://github.com/EleutherAI/pythia,https://huggingface.co/papers/2304.01373,,,,2304.01373,13,3
1101
  HyperTuning: Toward Adapting Large Language Models without Back-propagation,"Jason Phang, Yi Mao, Pengcheng He, Weizhu Chen",,,,,,,,,
1102
  Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models,"Zhihong Shao, Yeyun Gong, Yelong Shen, Minlie Huang, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2302.00618,,https://huggingface.co/papers/2302.00618,,,,2302.00618,6,0
 
1115
  Quantile Credit Assignment,"Thomas Mesnard, Wenqi Chen, Alaa Saade, Yunhao Tang, Mark Rowland, Theophane Weber, Clare Lyle, Audrunas Gruslys, Michal Valko, Will Dabney, Georg Ostrovski, Eric Moulines, Remi Munos",,,,,,,,,
1116
  Understanding Self-Predictive Learning for Reinforcement Learning,"Yunhao Tang, Zhaohan Guo, Pierre Richemond, Bernardo Avila Pires, Yash Chandak, Remi Munos, Mark Rowland, Mohammad Gheshlaghi Azar, Charline Le Lan, Clare Lyle, Andras Gyorgy, Shantanu Thakoor, Will Dabney, Bilal Piot, Daniele Calandriello, Michal Valko",http://arxiv.org/abs/2212.03319,,https://huggingface.co/papers/2212.03319,,,,2212.03319,16,0
1117
  Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning,"Brett Daley, Martha White, Christopher Amato, Marlos C. Machado",http://arxiv.org/abs/2301.11321,,https://huggingface.co/papers/2301.11321,,,,2301.11321,4,0
1118
+ "For Pre-Trained Vision Models in Motor Control, Not All Policy Learning Methods are Created Equal","Yingdong Hu, Renhao Wang, Li Li, Yang Gao",http://arxiv.org/abs/2304.04591,,https://huggingface.co/papers/2304.04591,,,,2304.04591,4,1
1119
  Weakly Supervised Regression with Interval Targets,"Xin Cheng, Yuzhou Cao, Ximing Li, Bo An, LEI FENG",,,,,,,,,
1120
  Regret Bounds for Markov Decision Processes with Recursive Optimized Certainty Equivalents,"WENHAO XU, Xuefeng Gao, Xuedong He",http://arxiv.org/abs/2301.12601,,https://huggingface.co/papers/2301.12601,,,,2301.12601,3,0
1121
  Decentralized Stochastic Bilevel Optimization with Improved per-Iteration Complexity,"Xuxing Chen, Minhui Huang, Shiqian Ma, Krishna Balasubramanian",http://arxiv.org/abs/2210.12839,,https://huggingface.co/papers/2210.12839,,,,2210.12839,4,0
 
1140
  New metrics and search algorithms for weighted causal DAGs,"Davin Choo, Kirankumar Shiragur",http://arxiv.org/abs/2305.04445,,https://huggingface.co/papers/2305.04445,,,,2305.04445,2,0
1141
  CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms,"Shengyi Huang, Rousslan Fernand Julien Dossa, Chang Ye, Jeff Braga, Dipam Chakraborty, Kinal Mehta, João Madeira Araujo",http://arxiv.org/abs/2111.08819,https://github.com/vwxyzjn/cleanrl,https://huggingface.co/papers/2111.08819,,,,2111.08819,4,1
1142
  Simplifying Momentum-based Positive-definite Submanifold Optimization with Applications to Deep Learning,"Wu Lin, Valentin Duruisseaux, Melvin Leok, Frank Nielsen, Khan Emtiyaz, Mark Schmidt",http://arxiv.org/abs/2302.09738,https://github.com/yorkerlin/StructuredNGD-DL,https://huggingface.co/papers/2302.09738,,,,2302.09738,6,0
1143
+ Polarity Is All You Need to Learn and Transfer Faster,"Alice (Qingyang) Wang, Michael Powell, Eric Bridgeford, Ali Geisa, Joshua Vogelstein",http://arxiv.org/abs/2303.17589,,https://huggingface.co/papers/2303.17589,,,,2303.17589,5,1
1144
  Scaling Vision Transformers to 22 Billion Parameters,"Mostafa Dehghani, Josip Djolonga, Basil Mustafa, Piotr Padlewski, Jonathan Heek, Justin Gilmer, Andreas Steiner, Mathilde Caron, Robert Geirhos, Ibrahim Alabdulmohsin, Rodolphe Jenatton, Lucas Beyer, Michael Tschannen, Anurag Arnab, Xiao Wang, Carlos Riquelme, Matthias Minderer, Joan Puigcerver, Utku Evci, Manoj Kumar, Sjoerd van Steenkiste, Gamaleldin Elsayed, Aravindh Mahendran, Fisher Yu, Avital Oliver, Fantine Huot, Jasmijn Bastings, Mark Collier, Alexey Gritsenko, Vighnesh N Birodkar, Cristina Vasconcelos, Yi Tay, Thomas Mensink, Alexander Kolesnikov, Filip Pavetic, Dustin Tran, Thomas Kipf, Mario Lucic, Xiaohua Zhai, Daniel Keysers, Jeremiah Harmsen, Neil Houlsby",http://arxiv.org/abs/2302.05442,,https://huggingface.co/papers/2302.05442,,,,2302.05442,22,0
1145
  Toward Fair and Robust Estimation of Optimal Treatment Regimes,"Kwangho Kim, Jose Zubizarreta",,,,,,,,,
1146
  Internally Rewarded Reinforcement Learning,"Mengdi Li, Xufeng Zhao, Jae Hee Lee, Cornelius Weber, Stefan Wermter",http://arxiv.org/abs/2302.00270,,https://huggingface.co/papers/2302.00270,,,,2302.00270,5,0
 
1154
  Critical Points and Convergence Analysis of Generative Deep Linear Networks Trained with Bures-Wasserstein Loss,"Pierre Bréchet, Katerina Papagiannouli, Jing An, Guido Montufar",http://arxiv.org/abs/2303.03027,,https://huggingface.co/papers/2303.03027,,,,2303.03027,4,0
1155
  Policy Evaluation and Temporal-Difference Learning in Continuous Time and Space: A Martingale Approach,"Yanwei Jia, Xun Yu Zhou",http://arxiv.org/abs/2108.06655,,https://huggingface.co/papers/2108.06655,,,,2108.06655,2,0
1156
  VIMA: Robot Manipulation with Multimodal Prompts,"Yunfan Jiang, Agrim Gupta, Zichen Zhang, Guanzhi Wang, Yongqiang Dou, Yanjun Chen, Li Fei-Fei, Anima Anandkumar, Yuke Zhu, Jim Fan",,,,,,,,,
1157
+ StriderNet: A Graph Reinforcement Learning Approach to Optimize Atomic Structures on Rough Energy Landscapes,"Vaibhav Bihani, Sahil Manchanda, Srikanth Sastry, Sayan Ranu, N M Anoop Krishnan",http://arxiv.org/abs/2301.12477,,https://huggingface.co/papers/2301.12477,,,,2301.12477,5,1
1158
  Multi-agent Online Scheduling: MMS Allocations for Indivisible Items,"Shengwei Zhou, Rufan Bai, Xiaowei Wu",http://arxiv.org/abs/2304.13405,,https://huggingface.co/papers/2304.13405,,,,2304.13405,3,0
1159
  Multi-Symmetry Ensembles: Improving Diversity and Generalization via Opposing Symmetries,"Charlotte Loh, Seungwook Han, Shivchander Sudalairaj, Rumen Dangovski, Kai Xu, Florian Wenzel, Marin Solja\v{c}i\'{c}, Akash Srivastava",http://arxiv.org/abs/2303.02484,,https://huggingface.co/papers/2303.02484,,,,2303.02484,8,0
1160
  NP-SemiSeg: When Neural Processes meet Semi-Supervised Semantic Segmentation,"Jianfeng Wang, Daniela Massiceti, Xiaolin Hu, Vladimir Pavlovic, Thomas Lukasiewicz",,,,,,,,,
 
1178
  Improving Adversarial Robustness Through the Contrastive-Guided Diffusion Process,"Yidong Ouyang, Liyan Xie, Guang Cheng",,,,,,,,,
1179
  MetaModulation: Learning Variational Feature Hierarchies for Few-Shot Learning with Fewer Tasks,"Wenfang Sun, Yingjun Du, Xiantong Zhen, Fan Wang, Ling Wang, Cees Snoek",http://arxiv.org/abs/2305.10309,,https://huggingface.co/papers/2305.10309,,,,2305.10309,6,0
1180
  Provable Dynamic Fusion for Low-Quality Multimodal Data,"qingyang zhang, Haitao Wu, Changqing Zhang, Qinghua Hu, Huazhu Fu, Joey Tianyi Zhou, Xi Peng",http://arxiv.org/abs/2306.02050,,https://huggingface.co/papers/2306.02050,,,,2306.02050,7,0
1181
+ Beyond Homophily: Reconstructing Structure for Graph-agnostic Clustering,"Erlin Pan, zhao kang",http://arxiv.org/abs/2305.02931,,https://huggingface.co/papers/2305.02931,,,,2305.02931,2,1
1182
  SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation,"Huaishao Luo, Junwei Bao, Youzheng Wu, Xiaodong He, Tianrui Li",http://arxiv.org/abs/2211.14813,https://github.com/ArrowLuo/SegCLIP,https://huggingface.co/papers/2211.14813,,,,2211.14813,5,0
1183
  Explainability as statistical inference,"Hugo Senetaire, Damien Garreau, Jes Frellsen, Pierre-Alexandre Mattei",http://arxiv.org/abs/2212.03131,,https://huggingface.co/papers/2212.03131,,,,2212.03131,4,0
1184
  Learning Prescriptive ReLU Networks,"Wei Sun, Asterios Tsiourvas",http://arxiv.org/abs/2306.00651,,https://huggingface.co/papers/2306.00651,,,,2306.00651,2,0
1185
  Bidirectional Adaptation for Robust Semi-Supervised Learning with Inconsistent Data Distributions,"Lin-Han Jia, Lan-Zhe Guo, Zhi Zhou, Jie-Jing Shao, Yuke Xiang, Yu-Feng Li",,,,,,,,,
1186
  Beyond the Universal Law of Robustness: Sharper Laws for Random Features and Neural Tangent Kernels,"Simone Bombari, Shayan Kiyani, Marco Mondelli",http://arxiv.org/abs/2302.01629,,https://huggingface.co/papers/2302.01629,,,,2302.01629,3,0
1187
+ Human-Timescale Adaptation in an Open-Ended Task Space,"Jakob Bauer, Kate Baumli, Feryal Behbahani, Avishkar Bhoopchand, Natalie Bradley-Schmieg, Michael Chang, Natalie Clay, Adrian Collister, Vibhavari Dasagi, Lucy Gonzalez, Karol Gregor, Edward Hughes, Sheleem Kashem, Maria Loks-Thompson, Hannah Openshaw, Jack Parker-Holder, Shreya Pathak, Nicolas Perez-Nieves, Nemanja Rakicevic, Tim Rocktäschel, Yannick Schroecker, Satinder Singh, Jakub Sygnowski, Karl Tuyls, Sarah York, Alexander Zacherl, Lei Zhang",http://arxiv.org/abs/2301.07608,,https://huggingface.co/papers/2301.07608,,,,2301.07608,22,1
1188
  Analysis of Error Feedback in Federated Non-Convex Optimization with Biased Compression: Linear Speedup and Partial Participation,"Xiaoyun Li, Ping Li",,,,,,,,,
1189
  ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts,"Minghao Xu, Xinyu Yuan, Santiago Miret, Jian Tang",http://arxiv.org/abs/2301.12040,,https://huggingface.co/papers/2301.12040,,,,2301.12040,4,0
1190
  Specializing Smaller Language Models towards Multi-Step Reasoning,"Yao Fu, Hao Peng, Litu Ou, Ashish Sabharwal, Tushar Khot",http://arxiv.org/abs/2301.12726,,https://huggingface.co/papers/2301.12726,,,,2301.12726,5,1
 
1192
  Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models,"Dongjun Kim, Yeongmin Kim, Se Jung Kwon, Wanmo Kang, IL CHUL MOON",http://arxiv.org/abs/2211.17091,https://github.com/alsdudrla10/DG,https://huggingface.co/papers/2211.17091,,,,2211.17091,5,0
1193
  Weighted flow diffusion for local graph clustering with node attributes: an algorithm and statistical guarantees,"Shenghao Yang, Kimon Fountoulakis",http://arxiv.org/abs/2301.13187,,https://huggingface.co/papers/2301.13187,,,,2301.13187,2,0
1194
  Robust Budget Pacing with a Single Sample,"Santiago Balseiro, Rachitesh Kumar, Vahab Mirrokni, Balasubramanian Sivan, Di Wang",http://arxiv.org/abs/2302.02006,,https://huggingface.co/papers/2302.02006,,,,2302.02006,5,0
1195
+ Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the Machiavelli Benchmark,"Alexander Pan, Jun Shern Chan, Andy Zou, Nathaniel Li, Scott Emmons, Hanlin Zhang, Steven Basart, Thomas Woodside, Dan Hendrycks",http://arxiv.org/abs/2304.03279,,https://huggingface.co/papers/2304.03279,,,,2304.03279,10,1
1196
  Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines?,"Victor Boutin, Thomas FEL, Lakshya Singhal, Rishav Mukherji, Akash Nagaraj, Julien Colin, Thomas Serre",http://arxiv.org/abs/2301.11722,,https://huggingface.co/papers/2301.11722,,,,2301.11722,7,1
1197
  Random Classification Noise does not defeat All Convex Potential Boosters Irrespective of Model Choice,"Yishay Mansour, Richard Nock, Robert C. Williamson",,,,,,,,,
1198
  "Fundamental Limits of Two-layer Autoencoders, and Achieving Them with Gradient Methods","Aleksandr Shevchenko, Kevin Kögler, Hamed Hassani, Marco Mondelli",http://arxiv.org/abs/2212.13468,,https://huggingface.co/papers/2212.13468,,,,2212.13468,4,0
 
1203
  HETAL: Efficient Privacy-preserving Transfer Learning with Homomorphic Encryption,"Seewoo Lee, Garam Lee, Jung Woo Kim, Junbum Shin, Mun-Kyu Lee",,,,,,,,,
1204
  Marginalization is not Marginal: No Bad VAE Local Minima when Learning Optimal Sparse Representations,David Wipf,,,,,,,,,
1205
  Direct Parameterization of Lipschitz-Bounded Deep Networks,"Ruigang Wang, Ian Manchester",http://arxiv.org/abs/2301.11526,https://github.com/acfr/LBDN,https://huggingface.co/papers/2301.11526,,,,2301.11526,2,0
1206
+ XAI Beyond Classification: Interpretable Neural Clustering,"Xi Peng, Yunfan Li, Ivor W. Tsang, Hongyuan Zhu, Jiancheng Lv, Joey Tianyi Zhou",http://arxiv.org/abs/1808.07292,,https://huggingface.co/papers/1808.07292,,,,1808.07292,6,1
1207
  Exploiting locality in high-dimensional Factorial hidden Markov models,"Lorenzo Rimella, Nick Whiteley",http://arxiv.org/abs/1902.01639,,https://huggingface.co/papers/1902.01639,,,,1902.01639,2,0
1208
  Mitigating the Effects of Non-Identifiability on Inference for Bayesian Neural Networks with Latent Variables,"Yaniv Yacoby, Weiwei Pan, Finale Doshi-Velez",http://arxiv.org/abs/1911.00569,,https://huggingface.co/papers/1911.00569,,,,1911.00569,3,0
1209
  Project and Forget: Solving Large-Scale Metric Constrained Problems,"Rishi Sonthalia, Anna C. Gilbert",http://arxiv.org/abs/2005.03853,,https://huggingface.co/papers/2005.03853,,,,2005.03853,2,0
1210
  "Let's Make Block Coordinate Descent Converge Faster: Faster Greedy Rules, Message-Passing, Active-Set Complexity, and Superlinear Convergence","Julie Nutini, Issam Laradji, Mark Schmidt",http://arxiv.org/abs/1712.08859,,https://huggingface.co/papers/1712.08859,,,,1712.08859,3,0
1211
+ Cluster-Specific Predictions with Multi-Task Gaussian Processes,"Arthur Leroy, Pierre Latouche, Benjamin Guedj, Servane Gey",http://arxiv.org/abs/2011.07866,,https://huggingface.co/papers/2011.07866,,,,2011.07866,4,1
1212
  Non-asymptotic Properties of Individualized Treatment Rules from Sequentially Rule-Adaptive Trials,"Daiqi Gao, Yufeng Liu, Donglin Zeng",,,,,,,,,
1213
  Mean-field Analysis of Piecewise Linear Solutions for Wide ReLU Networks,"Aleksandr Shevchenko, Vyacheslav Kungurtsev, Marco Mondelli",http://arxiv.org/abs/2111.02278,,https://huggingface.co/papers/2111.02278,,,,2111.02278,3,0
1214
  "Multi-Agent Online Optimization with Delays: Asynchronicity, Adaptivity, and Optimism","Yu-Guan Hsieh, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos",http://arxiv.org/abs/2012.11579,,https://huggingface.co/papers/2012.11579,,,,2012.11579,4,0
 
1219
  Deep linear networks can benignly overfit when shallow ones do,"Niladri S. Chatterji, Phil Long",http://arxiv.org/abs/2209.09315,,https://huggingface.co/papers/2209.09315,,,,2209.09315,2,0
1220
  Taming graph kernels with random features,Krzysztof Choromanski,http://arxiv.org/abs/2305.00156,,https://huggingface.co/papers/2305.00156,,,,2305.00156,1,0
1221
  On Uni-Modal Feature Learning in Supervised Multi-Modal Learning,"Chenzhuang Du, Jiaye Teng, Tingle Li, Yichen Liu, Tianyuan Yuan, Yue Wang, Yang Yuan, Hang Zhao",http://arxiv.org/abs/2305.01233,,https://huggingface.co/papers/2305.01233,,,,2305.01233,8,0
1222
+ CSP: Self-Supervised Contrastive Spatial Pre-Training for Geospatial-Visual Representations,"Gengchen Mai, Ni Lao, Yutong He, Jiaming Song, Stefano Ermon",http://arxiv.org/abs/2305.01118,,https://huggingface.co/papers/2305.01118,,,,2305.01118,5,2
1223
  CLIPood: Generalizing CLIP to Out-of-Distributions,"Yang Shu, Xingzhuo Guo, Jialong Wu, Ximei Wang, Jianmin Wang, Mingsheng Long",http://arxiv.org/abs/2302.00864,,https://huggingface.co/papers/2302.00864,,,,2302.00864,6,0
1224
  Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning,"Yu Meng, Martin Michalski, Jiaxin Huang, Yu Zhang, Tarek Abdelzaher, Jiawei Han",http://arxiv.org/abs/2211.03044,,https://huggingface.co/papers/2211.03044,,,,2211.03044,6,1
1225
  Meta-SAGE: Scale Meta-Learning Scheduled Adaptation with Guided Exploration for Mitigating Scale Shift on Combinatorial Optimization,"Jiwoo Son, Minsu Kim, Hyeonah Kim, Jinkyoo Park",,,,,,,,,
 
1251
  Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification,"Dong Xing, Pengjie Gu, Qian Zheng, Xinrun Wang, Shanqi Liu, Longtao Zheng, Bo An, Gang Pan",,,,,,,,,
1252
  Data-Efficient Contrastive Self-supervised Learning: Most Beneficial Examples for Supervised Learning Contribute the Least,"Siddharth Joshi, Baharan Mirzasoleiman",http://arxiv.org/abs/2302.09195,,https://huggingface.co/papers/2302.09195,,,,2302.09195,2,0
1253
  Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs,"Guan-Ting Liu, En-Pei Hu, Pu-Jen Cheng, Hung-yi Lee, Shao-Hua Sun",http://arxiv.org/abs/2301.12950,,https://huggingface.co/papers/2301.12950,,,,2301.12950,5,0
1254
+ Cooperative Open-ended Learning Framework for Zero-Shot Coordination,"Yang Li, Shao Zhang, Jichen Sun, Yali Du, Ying Wen, Xinbing Wang, Wei Pan",http://arxiv.org/abs/2302.04831,,https://huggingface.co/papers/2302.04831,,,,2302.04831,7,1
1255
  CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental Design,"Desi Ivanova, Joel Jennings, Tom Rainforth, Cheng Zhang, Adam Foster",,,,,,,,,
1256
  On the Identifiability and Estimation of Causal Location-Scale Noise Models,"Alexander Immer, Christoph Schultheiss, Julia Vogt, Bernhard Schölkopf, Peter Bühlmann, Alexander Marx",http://arxiv.org/abs/2210.09054,,https://huggingface.co/papers/2210.09054,,,,2210.09054,6,0
1257
  From Temporal to Contemporaneous Iterative Causal Discovery in the Presence of Latent Confounders,"Raanan Yehezkel Rohekar, Shami Nisimov, Yaniv Gurwicz, Gal Novik",http://arxiv.org/abs/2306.00624,,https://huggingface.co/papers/2306.00624,,,,2306.00624,4,0
 
1267
  Drug Discovery under Covariate Shift with Domain-Informed Prior Distributions over Functions,"Leo Klarner, Tim G. J. Rudner, Michael Reutlinger, Torsten Schindler, Garrett Morris, Charlotte Deane, Yee-Whye Teh",,,,,,,,,
1268
  SOM-CPC: Unsupervised Contrastive Learning with Self-Organizing Maps for Structured Representations of High-Rate Time Series,"Iris Huijben, Arthur A. Nijdam, Sebastiaan Overeem, Merel Van Gilst, Ruud J. G. van Sloun",,,,,,,,,
1269
  Hierarchical Grammar-Induced Geometry for Data-Efficient Molecular Property Prediction,"Minghao Guo, Veronika Thost, Samuel Song, Adithya Balachandran, Payel Das, Jie Chen, Wojciech Matusik",,,,,,,,,
1270
+ A Closer Look at the Intervention Procedure of Concept Bottleneck Models,"Sungbin Shin, Yohan Jo, Sungsoo Ahn, Namhoon Lee",http://arxiv.org/abs/2302.14260,,https://huggingface.co/papers/2302.14260,,,,2302.14260,4,1
1271
+ Simple Hardware-Efficient Long Convolutions for Sequence Modeling,"Daniel Y Fu, Elliot L Epstein, Eric Nguyen, Michael Zhang, Tri Dao, Atri Rudra, Christopher Re",http://arxiv.org/abs/2302.06646,,https://huggingface.co/papers/2302.06646,,,,2302.06646,8,1
1272
  Towards Controlled Data Augmentations for Active Learning,"Jianan Yang, Jianan Yang, Haobo Wang, Sai Wu, Gang Chen, Junbo Zhao",,,,,,,,,
1273
  "Bigger, Better, Faster: Human-level Atari with human-level efficiency","Max Schwarzer, Johan Obando Ceron, Aaron Courville, Marc Bellemare, Rishabh Agarwal, Pablo Samuel Castro",http://arxiv.org/abs/2305.19452,https://github.com/google-research/google-research/tree/master/bigger_better_faster,https://huggingface.co/papers/2305.19452,,,,2305.19452,6,3
1274
  A Law of Robustness beyond Isoperimetry,"Yihan Wu, Heng Huang, Hongyang Zhang",http://arxiv.org/abs/2202.11592,,https://huggingface.co/papers/2202.11592,,,,2202.11592,3,0
 
1300
  Guiding Pretraining in Reinforcement Learning with Large Language Models,"Yuqing Du, Olivia Watkins, Zihan Wang, Cédric Colas, Trevor Darrell, Pieter Abbeel, Abhishek Gupta, Jacob Andreas",http://arxiv.org/abs/2302.06692,,https://huggingface.co/papers/2302.06692,,,,2302.06692,8,0
1301
  PPG Reloaded: An Empirical Study on What Matters in Phasic Policy Gradient,"Kaixin Wang, Zhou Daquan, Jiashi Feng, Shie Mannor",,,,,,,,,
1302
  Differentially Private Sharpness-Aware Training,"Jinseong Park, Hoki Kim, Yujin Choi, Jaewook Lee",http://arxiv.org/abs/2306.05651,https://github.com/jinseongP/DPSAT,https://huggingface.co/papers/2306.05651,,,,2306.05651,4,0
1303
+ Provably and Practically Efficient Neural Contextual Bandits,Sudeep Salgia,http://arxiv.org/abs/2206.00099,,https://huggingface.co/papers/2206.00099,,,,2206.00099,3,1
1304
  How Does Information Bottleneck Help Deep Learning?,"Kenji Kawaguchi, Zhun Deng, Xu Ji, Jiaoyang Huang",http://arxiv.org/abs/2305.18887,https://github.com/xu-ji/information-bottleneck,https://huggingface.co/papers/2305.18887,,,,2305.18887,4,0
1305
  Why Is Public Pretraining Necessary for Private Model Training?,"Arun Ganesh, Mahdi Haghifam, Milad Nasresfahani, Sewoong Oh, Thomas Steinke, Om Thakkar, Abhradeep Guha Thakurta, Lun Wang",http://arxiv.org/abs/2302.09483,,https://huggingface.co/papers/2302.09483,,,,2302.09483,8,0
1306
  Learning Instance-Specific Augmentations by Capturing Local Invariances,"Ning Miao, Tom Rainforth, Emile Mathieu, Yann Dubois, Yee-Whye Teh, Adam Foster, Hyunjik Kim",http://arxiv.org/abs/2206.00051,,https://huggingface.co/papers/2206.00051,,,,2206.00051,7,0
 
1360
  Unleashing Mask: Explore the Intrinsic Out-of-Distribution Detection Capability,"Jianing Zhu, Hengzhuang Li, Jiangchao Yao, Tongliang Liu, Jianliang Xu, Bo Han",http://arxiv.org/abs/2306.03715,https://github.com/tmlr-group/Unleashing-Mask,https://huggingface.co/papers/2306.03715,,,,2306.03715,6,0
1361
  Conditional Graph Information Bottleneck for Molecular Relational Learning,"Namkyeong Lee, Dongmin Hyun, Gyoung S. Na, Sungwon Kim, Junseok Lee, Chanyoung Park",http://arxiv.org/abs/2305.01520,https://github.com/Namkyeong/CGIB,https://huggingface.co/papers/2305.01520,,,,2305.01520,6,0
1362
  Reconstructive Neuron Pruning for Backdoor Defense,"Yige Li, XIXIANG LYU, Xingjun Ma, Nodens Koren, Lingjuan Lyu, Bo Li, Yu-Gang Jiang",http://arxiv.org/abs/2305.14876,https://github.com/bboylyg/RNP,https://huggingface.co/papers/2305.14876,,,,2305.14876,7,0
1363
+ Abstract-to-Executable Trajectory Translation for One-Shot Task Generalization,"Stone Tao, Xiaochen Li, Tongzhou Mu, Zhiao Huang, Yuzhe Qin, Hao Su",http://arxiv.org/abs/2210.07658,,https://huggingface.co/papers/2210.07658,,,,2210.07658,6,1
1364
  Multi-View Masked World Models for Visual Robotic Manipulation,"Younggyo Seo, Junsu Kim, Stephen James, Kimin Lee, Jinwoo Shin, Pieter Abbeel",http://arxiv.org/abs/2302.02408,,https://huggingface.co/papers/2302.02408,,,,2302.02408,6,0
1365
  CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets,"Zachary Novack, Julian McAuley, Zachary Lipton, Saurabh Garg",http://arxiv.org/abs/2302.02551,https://github.com/acmi-lab/CHILS,https://huggingface.co/papers/2302.02551,,,,2302.02551,4,1
1366
  Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization,"Zi-Hao Qiu, Quanqi Hu, Zhuoning Yuan, Denny Zhou, Lijun Zhang, Tianbao Yang",http://arxiv.org/abs/2305.11965,,https://huggingface.co/papers/2305.11965,,,,2305.11965,6,0
 
1407
  Bandit Online Linear Optimization with Hints and Queries,"Aditya Bhaskara, Ashok Cutkosky, Ravi Kumar, Manish Purohit",,,,,,,,,
1408
  Neural Network Approximations of PDEs Beyond Linearity: A Representational Perspective,"Tanya Marwah, Zachary Lipton, Jianfeng Lu, Andrej Risteski",http://arxiv.org/abs/2210.12101,,https://huggingface.co/papers/2210.12101,,,,2210.12101,4,0
1409
  Attribute-Efficient PAC Learning of Low-Degree Polynomial Threshold Functions with Nasty Noise,"Shiwei Zeng, Jie Shen",http://arxiv.org/abs/2306.00673,,https://huggingface.co/papers/2306.00673,,,,2306.00673,2,0
1410
+ Sample Complexity Bounds for Learning High-dimensional Simplices in Noisy Regimes,"seyed amir saberi, Amir Najafi, Abolfazl Motahari, Babak Khalaj",http://arxiv.org/abs/2209.05953,,https://huggingface.co/papers/2209.05953,,,,2209.05953,4,1
1411
  "Monge, Bregman and Occam: Interpretable Optimal Transport in High-Dimensions with Feature-Sparse Maps","Marco Cuturi, Michal Klein, Pierre Ablin",http://arxiv.org/abs/2302.04065,,https://huggingface.co/papers/2302.04065,,,,2302.04065,3,0
1412
  Sketching Meets Differential Privacy: Fast Algorithm for Dynamic Kronecker Projection Maintenance,"Zhao Song, Xin Yang, Yuanyuan Yang, Lichen Zhang",http://arxiv.org/abs/2210.11542,,https://huggingface.co/papers/2210.11542,,,,2210.11542,4,0
1413
  Combinatorial Neural Bandits,"Taehyun Hwang, Kyuwook Chai, Min-hwan Oh",http://arxiv.org/abs/2306.00242,,https://huggingface.co/papers/2306.00242,,,,2306.00242,3,0
1414
  Reward-Mixing MDPs with Few Contexts are Learnable,"Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor",,,,,,,,,
1415
+ Quantum Speedups for Zero-Sum Games via Improved Dynamic Gibbs Sampling,"Adam Bouland, Yosheb Getachew, Yujia Jin, Aaron Sidford, Kevin Tian",http://arxiv.org/abs/2301.03763,,https://huggingface.co/papers/2301.03763,,,,2301.03763,5,1
1416
  Tight Regret Bounds for Single-pass Streaming Multi-armed Bandits,Chen Wang,http://arxiv.org/abs/2306.02208,,https://huggingface.co/papers/2306.02208,,,,2306.02208,1,0
1417
  Minimum Width of Leaky-ReLU Neural Networks for Uniform Universal Approximation,"Li'ang Li, Yifei duan, Guanghua Ji, Yongqiang Cai",http://arxiv.org/abs/2305.18460,,https://huggingface.co/papers/2305.18460,,,,2305.18460,4,0
1418
  Dynamical Linear Bandits,"Marco Mussi, Alberto Maria Metelli, Marcello Restelli",http://arxiv.org/abs/2211.08997,,https://huggingface.co/papers/2211.08997,,,,2211.08997,3,0
 
1497
  Neural networks trained with SGD learn distributions of increasing complexity,"Maria Refinetti, Alessandro Ingrosso, Sebastian Goldt",http://arxiv.org/abs/2211.11567,,https://huggingface.co/papers/2211.11567,,,,2211.11567,3,0
1498
  Scaling Laws for Multilingual Neural Machine Translation,"Patrick Fernandes, Behrooz Ghorbani, Xavier Garcia, Markus Freitag, Orhan Firat",http://arxiv.org/abs/2302.09650,,https://huggingface.co/papers/2302.09650,,,,2302.09650,5,0
1499
  Explaining the effects of non-convergent MCMC in the training of Energy-Based Models,"Elisabeth Agoritsas, Giovanni Catania, Aurélien Decelle, Beatriz Seoane",,,,,,,,,
1500
+ A Three-regime Model of Network Pruning,"Yefan Zhou, Yaoqing Yang, Arin Chang, Michael Mahoney",http://arxiv.org/abs/2305.18383,,https://huggingface.co/papers/2305.18383,,,,2305.18383,4,1
1501
  Metagenomic Binning using Connectivity-constrained Variational Autoencoders,"Andre Lamurias, Alessandro Tibo, Katja Hose, Mads Albertsen, Thomas D. Nielsen",,,,,,,,,
1502
  SNeRL: Semantic-aware Neural Radiance Fields for Reinforcement Learning,"Dongseok Shim, Seungjae Lee, H. Kim",http://arxiv.org/abs/2301.11520,,https://huggingface.co/papers/2301.11520,,,,2301.11520,3,0
1503
  Spatial-Temporal Graph Learning with Adversarial Contrastive Adaptation,"Qianru Zhang, Chao Huang, Lianghao Xia, Zheng Wang, Siu Ming Yiu, Ruihua Han",,,,,,,,,
 
1621
  Brainformers: Trading Simplicity for Efficiency,"Yanqi Zhou, Nan Du, Yanping Huang, Daiyi Peng, Chang Lan, Da Huang, Siamak Shakeri, David So, Andrew Dai, Yifeng Lu, Zhifeng Chen, Quoc Le, Claire Cui, James Laudon, Jeff Dean",http://arxiv.org/abs/2306.00008,,https://huggingface.co/papers/2306.00008,,,,2306.00008,15,3
1622
  On the Training Instability of Shuffling SGD with Batch Normalization,"David X. Wu, Chulhee Yun, Suvrit Sra",http://arxiv.org/abs/2302.12444,,https://huggingface.co/papers/2302.12444,,,,2302.12444,3,0
1623
  Dropout Reduces Underfitting,"Zhuang Liu, Zhiqiu (Oscar) Xu, Joseph Jin, Zhiqiang Shen, Trevor Darrell",http://arxiv.org/abs/2303.01500,https://github.com/facebookresearch/dropout,https://huggingface.co/papers/2303.01500,,,,2303.01500,5,0
1624
+ A modern look at the relationship between sharpness and generalization,"Maksym Andriushchenko, Francesco Croce, Maximilian Müller, Matthias Hein, Nicolas Flammarion",http://arxiv.org/abs/2302.07011,https://github.com/tml-epfl/sharpness-vs-generalization,https://huggingface.co/papers/2302.07011,,,,2302.07011,5,1
1625
  Weak Proxies are Sufficient and Preferable for Fairness with Missing Sensitive Attributes,"Zhaowei Zhu, Yuanshun Yao, Jiankai Sun, Hang Li, Yang Liu",http://arxiv.org/abs/2210.03175,,https://huggingface.co/papers/2210.03175,,,,2210.03175,5,0
1626
  Cocktail Party Attack: Breaking Aggregation-Based Privacy in Federated Learning Using Independent Component Analysis,"Sanjay Kariyappa, Chuan Guo, Kiwan Maeng, Wenjie Xiong, G. Edward Suh, Moinuddin Qureshi, Hsien-Hsin Sean Lee",http://arxiv.org/abs/2209.05578,,https://huggingface.co/papers/2209.05578,,,,2209.05578,7,0
1627
  On the Robustness of Randomized Ensembles to Adversarial Perturbations,"Hassan Dbouk, Naresh Shanbhag",http://arxiv.org/abs/2302.01375,https://github.com/hsndbk4/BARRE,https://huggingface.co/papers/2302.01375,,,,2302.01375,2,0
 
1647
  On the Complexity of Bayesian Generalization,"Yu-Zhe Shi, Manjie Xu, John Hopcroft, Kun He, Josh Tenenbaum, Song-Chun Zhu, Ying Nian Wu, Wenjuan Han, Yixin Zhu",http://arxiv.org/abs/2211.11033,,https://huggingface.co/papers/2211.11033,,,,2211.11033,9,0
1648
  QAS-Bench: Rethinking Quantum Architecture Search and A Benchmark,"Xudong Lu, Kaisen Pan, Ge Yan, Jiaming Shan, Wenjie Wu, Junchi Yan",,,,,,,,,
1649
  Not all Strongly Rayleigh Distributions Have Small Probabilistic Generating Circuits,Markus Bläser,,,,,,,,,
1650
+ PAL: Program-aided Language Models,"Luyu Gao, Aman Madaan, Shuyan Zhou, Uri Alon, Pengfei Liu, Yiming Yang, Jamie Callan, Graham Neubig",http://arxiv.org/abs/2211.10435,,https://huggingface.co/papers/2211.10435,,,,2211.10435,8,2
1651
  Tighter Bounds on the Expressivity of Transformer Encoders,"David Chiang, Peter Cholak, Anand Pillay",http://arxiv.org/abs/2301.10743,,https://huggingface.co/papers/2301.10743,,,,2301.10743,3,0
1652
  Efficient Algorithms for Exact Graph Matching on Correlated Stochastic Block Models with Constant Correlation,"Joonhyuk Yang, Shin Dongpil, Hye Won Chung",http://arxiv.org/abs/2305.19666,,https://huggingface.co/papers/2305.19666,,,,2305.19666,3,0
1653
  Causal Discovery with Latent Confounders Based on Higher-Order Cumulants,"Ruichu Cai, Zhiyi Huang, Wei Chen, Zhifeng Hao, Kun Zhang",http://arxiv.org/abs/2305.19582,,https://huggingface.co/papers/2305.19582,,,,2305.19582,5,0
 
1685
  Learning Representations without Compositional Assumptions,"Tennison Liu, Jeroen Berrevoets, Zhaozhi Qian, Mihaela van der Schaar",http://arxiv.org/abs/2305.19726,,https://huggingface.co/papers/2305.19726,,,,2305.19726,4,0
1686
  Making Transformers Compute-lite for CPU inference,"Zhanpeng Zeng, Michael Davies, Pranav Pulijala, Karthikeyan Sankaralingam, Vikas Singh",,,,,,,,,
1687
  Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers,"Grant Strimel, Yi Xie, Brian King, martin radfar, Ariya Rastrow, Athanasios Mouchtaris",http://arxiv.org/abs/2305.04159,,https://huggingface.co/papers/2305.04159,,,,2305.04159,6,0
1688
+ Expected Gradients of Maxout Networks and Consequences to Parameter Initialization,"Hanna Tseran, Guido Montufar",http://arxiv.org/abs/2301.06956,,https://huggingface.co/papers/2301.06956,,,,2301.06956,2,1
1689
  Competing for Shareable Arms in Multi-Player Multi-Armed Bandits,"Renzhe Xu, Haotian Wang, Xingxuan Zhang, Bo Li, Peng Cui",http://arxiv.org/abs/2305.19158,,https://huggingface.co/papers/2305.19158,,,,2305.19158,5,1
1690
  Intrinsic Sliced Wasserstein Distances for Comparing Collections of Probability Distributions on Manifolds and Graphs,"Raif Rustamov, Subhabrata Majumdar",http://arxiv.org/abs/2010.15285,,https://huggingface.co/papers/2010.15285,,,,2010.15285,2,1
1691
  Coordinated Dynamic Bidding in Repeated Second-Price Auctions with Budgets,"Yurong Chen, Zhaohua Chen, Xiaotie Deng, Zhijian Duan, Haoran Sun, Qian Wang, Xiang Yan",http://arxiv.org/abs/2306.07709,,https://huggingface.co/papers/2306.07709,,,,2306.07709,7,0
 
1699
  One-Step Estimator for Permuted Sparse Recovery,"Hang Zhang, Ping Li",,,,,,,,,
1700
  Cold Analysis of Rao-Blackwellized Straight-Through Gumbel-Softmax Gradient Estimator,Alexander Shekhovtsov,,,,,,,,,
1701
  Estimating the Contamination Factor's Distribution in Unsupervised Anomaly Detection,"Lorenzo Perini, Paul Buerkner, Arto Klami",http://arxiv.org/abs/2210.10487,,https://huggingface.co/papers/2210.10487,,,,2210.10487,3,0
1702
+ Image generation with shortest path diffusion,"Ayan Das, Ayan Das, Stathi Fotiadis, Anil Batra, Farhang Nabiei, FengTing Liao, Sattar Vakili, Da-shan Shiu, Alberto Bernacchia",http://arxiv.org/abs/2306.00501,,https://huggingface.co/papers/2306.00501,,,,2306.00501,8,2
1703
  Deep Anomaly Detection under Labeling Budget Constraints,"Aodong Li, Chen Qiu, Padhraic Smyth, Marius Kloft, Stephan Mandt, Maja Rudolph",http://arxiv.org/abs/2302.07832,,https://huggingface.co/papers/2302.07832,,,,2302.07832,6,0
1704
  Transformed Distribution Matching for Missing Value Imputation,"He Zhao, Ke Sun, Amir Dezfouli, Edwin V Bonilla",http://arxiv.org/abs/2302.10363,,https://huggingface.co/papers/2302.10363,,,,2302.10363,4,0
1705
  Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?,"Ruisi Cai, Zhenyu Zhang, Zhangyang “Atlas” Wang",http://arxiv.org/abs/2302.12480,,https://huggingface.co/papers/2302.12480,,,,2302.12480,3,0
1706
  Git-Theta: A Git Extension for Collaborative Development of Machine Learning Models,"Nikhil Kandpal, Brian Lester, Mohammed Muqeeth, Anisha Mascarenhas, Monty Evans, Vishal Baskaran, Tenghao Huang, Haokun Liu, Colin Raffel",,,,,,,,,
1707
+ Better Diffusion Models Further Improve Adversarial Training,"Zekai Wang, Tianyu Pang, Chao Du, Min Lin, Weiwei Liu, Shuicheng YAN",http://arxiv.org/abs/2302.04638,https://github.com/wzekai99/DM-Improves-AT,https://huggingface.co/papers/2302.04638,,,,2302.04638,6,1
1708
  On the Expressive Power of Geometric Graph Neural Networks,"Chaitanya Joshi, Cristian Bodnar, Simon Mathis, Taco Cohen, Pietro Lió",http://arxiv.org/abs/2301.09308,https://github.com/chaitjo/geometric-gnn-dojo,https://huggingface.co/papers/2301.09308,,,,2301.09308,5,0
1709
  Randomized Schur Complement Views for Graph Contrastive Learning,Vignesh Kothapalli,http://arxiv.org/abs/2306.04004,,https://huggingface.co/papers/2306.04004,,,,2306.04004,1,1
1710
  Path Neural Networks: Expressive and Accurate Graph Neural Networks,"Gaspard Michel, Giannis Nikolentzos, Johannes Lutzeyer, Michalis Vazirgiannis",http://arxiv.org/abs/2306.05955,,https://huggingface.co/papers/2306.05955,,,,2306.05955,4,0
 
1780
  AbODE: Ab initio antibody design using conjoined ODEs,"Yogesh Verma, Markus Heinonen, Vikas K Garg",http://arxiv.org/abs/2306.01005,,https://huggingface.co/papers/2306.01005,,,,2306.01005,3,0
1781
  Learning-augmented private algorithms for multiple quantile release,"Mikhail Khodak, Kareem Amin, Travis Dick, Sergei Vassilvitskii",http://arxiv.org/abs/2210.11222,,https://huggingface.co/papers/2210.11222,,,,2210.11222,4,0
1782
  Horizon-free Learning for Markov Decision Processes and Games: Stochastically Bounded Rewards and Improved Bounds,"Shengshi Li, Lin Yang",,,,,,,,,
1783
+ Variational Autoencoding Neural Operators,"Jacob H. Seidman, Georgios Kissas, George J. Pappas, Paris Perdikaris",http://arxiv.org/abs/2302.10351,,https://huggingface.co/papers/2302.10351,,,,2302.10351,4,1
1784
  Efficient Parametric Approximations of Neural Network Function Space Distance,"Nikita Dhawan, Sicong Huang, Juhan Bae, Roger Grosse",http://arxiv.org/abs/2302.03519,,https://huggingface.co/papers/2302.03519,,,,2302.03519,4,0
1785
  Theory on Forgetting and Generalization of Continual Learning,"Sen Lin, Peizhong Ju, Yingbin LIANG, Ness Shroff",http://arxiv.org/abs/2302.05836,,https://huggingface.co/papers/2302.05836,,,,2302.05836,4,0
1786
  Trapdoor Normalization with Irreversible Ownership Verification,"Hanwen Liu, Zhenyu Weng, Yuesheng Zhu, Yadong Mu",,,,,,,,,