hysts HF Staff commited on
Commit
44b9f70
·
1 Parent(s): 8cca298

commit files to HF hub

Browse files
Files changed (1) hide show
  1. papers.csv +9 -9
papers.csv CHANGED
@@ -433,7 +433,7 @@ Parallel $Q$-Learning: Scaling Off-policy Reinforcement Learning under Massively
433
  Efficient List-Decodable Regression using Batches,"Abhimanyu Das, Ayush Jain, Weihao Kong, Rajat Sen",http://arxiv.org/abs/2211.12743,,https://huggingface.co/papers/2211.12743,,,,2211.12743,4,0
434
  Proper Scoring Rules for Survival Analysis,Hiroki Yanagisawa,http://arxiv.org/abs/2305.00621,,https://huggingface.co/papers/2305.00621,,,,2305.00621,1,0
435
  GraphCleaner: Detecting Mislabelled Samples in Popular Graph Learning Benchmarks,"Yuwen Li, Miao Xiong, Bryan Hooi",http://arxiv.org/abs/2306.00015,,https://huggingface.co/papers/2306.00015,,,,2306.00015,3,0
436
- Large Language Models Can Be Easily Distracted by Irrelevant Context,"Haoyue Shi, Xinyun Chen, Kanishka Misra, Nathan Scales, David Dohan, Ed Chi, Nathanael Schärli, Denny Zhou",http://arxiv.org/abs/2302.00093,,https://huggingface.co/papers/2302.00093,,,,2302.00093,8,0
437
  Temporally Consistent Transformers for Video Generation,"Wilson Yan, Danijar Hafner, Stephen James, Pieter Abbeel",http://arxiv.org/abs/2210.02396,,https://huggingface.co/papers/2210.02396,,,,2210.02396,4,1
438
  Improved Algorithms for Multi-period Multi-class Packing Problems with Bandit Feedback,"Wonyoung Kim, Garud Iyengar, Assaf Zeevi",http://arxiv.org/abs/2301.13791,,https://huggingface.co/papers/2301.13791,,,,2301.13791,3,0
439
  Scaling Laws for Generative Mixed-Modal Language Models,"Armen Aghajanyan, LILI YU, Alexis Conneau, Wei-Ning Hsu, Karen Hambardzumyan, Susan Zhang, Stephen Roller, Naman Goyal, Omer Levy, Luke Zettlemoyer",http://arxiv.org/abs/2301.03728,,https://huggingface.co/papers/2301.03728,,,,2301.03728,10,0
@@ -557,7 +557,7 @@ Data Poisoning Attacks Against Multimodal Encoders,"Ziqing Yang, Xinlei He, Zhen
557
  FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation,"Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon",,,,,,,,,
558
  Certified Robust Neural Networks: Generalization and Corruption Resistance,"Amine Bennouna, Ryan Lucas, Bart Van Parys",http://arxiv.org/abs/2303.02251,https://github.com/RyanLucas3/HR_Neural_Networks,https://huggingface.co/papers/2303.02251,,,,2303.02251,3,1
559
  "Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective","Michael Sander, Joan Puigcerver, Josip Djolonga, Gabriel Peyré, Mathieu Blondel",,,,,,,,,
560
- Anti-Exploration by Random Network Distillation,"Alexander Nikulin, Vladislav Kurenkov, Denis Tarasov, Sergey Kolesnikov",http://arxiv.org/abs/2301.13616,,https://huggingface.co/papers/2301.13616,,,,2301.13616,4,0
561
  Monotonicity and Double Descent in Uncertainty Estimation with Gaussian Processes,"Liam Hodgkinson, Chris van der Heide, Fred Roosta, Michael Mahoney",http://arxiv.org/abs/2210.07612,,https://huggingface.co/papers/2210.07612,,,,2210.07612,4,1
562
  Sampling-Based Accuracy Testing of Posterior Estimators for General Inference,"Pablo Lemos, Adam Coogan, Laurence Perreault-Levasseur, Yashar Hezaveh",http://arxiv.org/abs/2302.03026,,https://huggingface.co/papers/2302.03026,,,,2302.03026,4,1
563
  Discrete Continuous Optimization Framework for Simultaneous Clustering and Training in Mixture Models,"Parth Sangani, Arjun Kashettiwar, Pritish Chakraborty, Bhuvan Gangula, Sivasubramanian Durga, Ganesh Ramakrishnan, Rishabh Iyer, Abir De",,,,,,,,,
@@ -605,7 +605,7 @@ Hypothesis Transfer Learning with Surrogate Classification Losses: Generalizat
605
  Learning Controllable Degradation for Real-World Super-Resolution via Constrained Flows,"Seobin Park, Dongjin Kim, Sungyong Baik, Tae Hyun Kim",,,,,,,,,
606
  Few-bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction,"Goergii Novikov, Daniel Bershatsky, Julia Gusak, Alex Shonenkov, Denis Dimitrov, Ivan Oseledets",http://arxiv.org/abs/2202.00441,,https://huggingface.co/papers/2202.00441,,,,2202.00441,6,0
607
  In Search for a Generalizable Method for Source Free Domain Adaptation,"Malik Boudiaf, tom denton, Bart van Merrienboer, Vincent Dumoulin, Eleni Triantafillou",http://arxiv.org/abs/2302.06658,,https://huggingface.co/papers/2302.06658,,,,2302.06658,5,1
608
- GeCoNeRF: Few-shot Neural Radiance Fields via Geometric Consistency,"Min-Seop Kwak, Jiuhn Song, Seungryong Kim",http://arxiv.org/abs/2301.10941,,https://huggingface.co/papers/2301.10941,,,,2301.10941,3,1
609
  Input uncertainty propagation through trained neural networks,"Paul Monchot, Loic Coquelin, Sébastien J. Petit, Sébastien Marmin, Erwann LE PENNEC, Nicolas Fischer",,,,,,,,,
610
  Optimally-weighted Estimators of the Maximum Mean Discrepancy for Likelihood-Free Inference,"Ayush Bharti, Masha Naslidnyk, Oscar Key, Samuel Kaski, Francois-Xavier Briol",http://arxiv.org/abs/2301.11674,,https://huggingface.co/papers/2301.11674,,,,2301.11674,5,0
611
  SGD with large step sizes learns sparse features,"Maksym Andriushchenko, Aditya Vardhan Varre, Loucas Pillaud-Vivien, Nicolas Flammarion",http://arxiv.org/abs/2210.05337,https://github.com/tml-epfl/sgd-sparse-features,https://huggingface.co/papers/2210.05337,,,,2210.05337,4,1
@@ -724,7 +724,7 @@ Training-Free Neural Active Learning with Initialization-Robustness Guarantees,"
724
  Unit Scaling: Out-of-the-Box Low-Precision Training,"Charlie Blake, Charlie Blake, Douglas Orr, Carlo Luschi",http://arxiv.org/abs/2303.11257,,https://huggingface.co/papers/2303.11257,,,,2303.11257,3,2
725
  NUNO: A General Framework for Learning Parametric PDEs with Non-Uniform Data,"LIU SONGMING, Zhongkai Hao, Chengyang Ying, Hang Su, Ze Cheng, Jun Zhu",http://arxiv.org/abs/2305.18694,https://github.com/thu-ml/NUNO,https://huggingface.co/papers/2305.18694,,,,2305.18694,6,0
726
  Diffusion Models for Offline Black-Box Optimization,"Siddarth Krishnamoorthy, Satvik Mashkaria, Aditya Grover",,,,,,,,,
727
- The Flan Collection: Designing Data and Methods for Effective Instruction Tuning,"Shayne Longpre, Le Hou, Tu Vu, Albert Webson, Hyung Won Chung, Yi Tay, Denny Zhou, Quoc Le, Barret Zoph, Jason Wei, Adam Roberts",http://arxiv.org/abs/2301.13688,https://github.com/google-research/FLAN/tree/main/flan/v2,https://huggingface.co/papers/2301.13688,,,,2301.13688,11,1
728
  Compositional Score Modeling for Simulation-Based Inference,"Tomas Geffner, George Papamakarios, Andriy Mnih",http://arxiv.org/abs/2209.14249,,https://huggingface.co/papers/2209.14249,,,,2209.14249,3,0
729
  Dirichlet Diffusion Score Model for Biological Sequence Generation,"Pavel Avdeyev, Chenlai Shi, Yuhao Tan, Kseniia Dudnyk, Jian Zhou",http://arxiv.org/abs/2305.10699,,https://huggingface.co/papers/2305.10699,,,,2305.10699,5,0
730
  Leveraging Proxy of Training Data for Test-Time Adaptation,"Juwon Kang, Nayeong Kim, Donghyeon Kwon, Jungseul Ok, Suha Kwak",,,,,,,,,
@@ -1099,7 +1099,7 @@ High-dimensional Location Estimation via Norm Concentration for Subgamma Vectors
1099
  COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models,"Jinqi Xiao, Miao Yin, Yu Gong, Xiao Zang, Jian Ren, Bo Yuan",http://arxiv.org/abs/2305.17235,,https://huggingface.co/papers/2305.17235,,,,2305.17235,6,1
1100
  Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling,"Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'Brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, USVSN Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar van der Wal",http://arxiv.org/abs/2304.01373,https://github.com/EleutherAI/pythia,https://huggingface.co/papers/2304.01373,,,,2304.01373,13,7
1101
  HyperTuning: Toward Adapting Large Language Models without Back-propagation,"Jason Phang, Yi Mao, Pengcheng He, Weizhu Chen",,,,,,,,,
1102
- Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models,"Zhihong Shao, Yeyun Gong, Yelong Shen, Minlie Huang, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2302.00618,,https://huggingface.co/papers/2302.00618,,,,2302.00618,6,0
1103
  Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise,"Zhenghao Lin, Yeyun Gong, Yelong Shen, Tong Wu, Zhihao Fan, Chen Lin, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2212.11685,https://github.com/microsoft/ProphetNet/tree/master/GENIE,https://huggingface.co/papers/2212.11685,,,,2212.11685,8,0
1104
  Fast $(1+\varepsilon)$-Approximation Algorithms for Binary Matrix Factorization,"Ameya Velingker, Maximilian Vötsch, David Woodruff, Samson Zhou",,,,,,,,,
1105
  Exphormer: Sparse Transformers for Graphs,"Hamed Shirzad, Ameya Velingker, Balaji Venkatachalam, Danica J Sutherland, Ali K Sinop",http://arxiv.org/abs/2303.06147,https://github.com/hamed1375/Exphormer,https://huggingface.co/papers/2303.06147,,,,2303.06147,5,1
@@ -1190,7 +1190,7 @@ ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts,"Mingh
1190
  Specializing Smaller Language Models towards Multi-Step Reasoning,"Yao Fu, Hao Peng, Litu Ou, Ashish Sabharwal, Tushar Khot",http://arxiv.org/abs/2301.12726,,https://huggingface.co/papers/2301.12726,,,,2301.12726,5,1
1191
  Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap,"Hang Wang, Sen Lin, Junshan Zhang",,,,,,,,,
1192
  Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models,"Dongjun Kim, Yeongmin Kim, Se Jung Kwon, Wanmo Kang, IL CHUL MOON",http://arxiv.org/abs/2211.17091,https://github.com/alsdudrla10/DG,https://huggingface.co/papers/2211.17091,,,,2211.17091,5,0
1193
- Weighted flow diffusion for local graph clustering with node attributes: an algorithm and statistical guarantees,"Shenghao Yang, Kimon Fountoulakis",http://arxiv.org/abs/2301.13187,,https://huggingface.co/papers/2301.13187,,,,2301.13187,2,0
1194
  Robust Budget Pacing with a Single Sample,"Santiago Balseiro, Rachitesh Kumar, Vahab Mirrokni, Balasubramanian Sivan, Di Wang",http://arxiv.org/abs/2302.02006,,https://huggingface.co/papers/2302.02006,,,,2302.02006,5,0
1195
  Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the Machiavelli Benchmark,"Alexander Pan, Jun Shern Chan, Andy Zou, Nathaniel Li, Scott Emmons, Hanlin Zhang, Steven Basart, Thomas Woodside, Dan Hendrycks",http://arxiv.org/abs/2304.03279,,https://huggingface.co/papers/2304.03279,,,,2304.03279,10,1
1196
  Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines?,"Victor Boutin, Thomas FEL, Lakshya Singhal, Rishav Mukherji, Akash Nagaraj, Julien Colin, Thomas Serre",http://arxiv.org/abs/2301.11722,,https://huggingface.co/papers/2301.11722,,,,2301.11722,7,1
@@ -1363,7 +1363,7 @@ Reconstructive Neuron Pruning for Backdoor Defense,"Yige Li, XIXIANG LYU, Xingju
1363
  Abstract-to-Executable Trajectory Translation for One-Shot Task Generalization,"Stone Tao, Xiaochen Li, Tongzhou Mu, Zhiao Huang, Yuzhe Qin, Hao Su",http://arxiv.org/abs/2210.07658,,https://huggingface.co/papers/2210.07658,,,,2210.07658,6,1
1364
  Multi-View Masked World Models for Visual Robotic Manipulation,"Younggyo Seo, Junsu Kim, Stephen James, Kimin Lee, Jinwoo Shin, Pieter Abbeel",http://arxiv.org/abs/2302.02408,,https://huggingface.co/papers/2302.02408,,,,2302.02408,6,0
1365
  CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets,"Zachary Novack, Julian McAuley, Zachary Lipton, Saurabh Garg",http://arxiv.org/abs/2302.02551,https://github.com/acmi-lab/CHILS,https://huggingface.co/papers/2302.02551,,,,2302.02551,4,1
1366
- Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization,"Zi-Hao Qiu, Quanqi Hu, Zhuoning Yuan, Denny Zhou, Lijun Zhang, Tianbao Yang",http://arxiv.org/abs/2305.11965,,https://huggingface.co/papers/2305.11965,,,,2305.11965,6,0
1367
  Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL,"Taku Yamagata, Ahmed Khalil, Raul Santos-Rodriguez",,,,,,,,,
1368
  A Statistical Perspective on Retrieval-Based Models,"Soumya Basu, Ankit Singh Rawat, Manzil Zaheer",,,,,,,,,
1369
  PFNs4BO: Meta-Learning the surrogate model for Bayesian optimization from scratch using Transformers,"Samuel Gabriel Müller, Matthias Feurer, Noah Hollmann, Frank Hutter",,,,,,,,,
@@ -1429,7 +1429,7 @@ Regularization-free Diffeomorphic Temporal Alignment Nets,"Ron Shapira Weber, Or
1429
  Topologically Faithful Image Segmentation via Induced Matching of Persistence Barcodes,"Nico Stucki, Johannes C. Paetzold, Suprosanna Shit, bjoern menze, Ulrich Bauer",http://arxiv.org/abs/2211.15272,https://github.com/nstucki/Betti-matching,https://huggingface.co/papers/2211.15272,,,,2211.15272,5,0
1430
  FedDisco: Federated Learning with Discrepancy-Aware Collaboration,"Rui Ye, Mingkai Xu, Jianyu Wang, Chenxin Xu, Siheng Chen, Yan-Feng Wang",http://arxiv.org/abs/2305.19229,https://github.com/MediaBrain-SJTU/FedDisco,https://huggingface.co/papers/2305.19229,,,,2305.19229,6,0
1431
  Personalized Federated Learning with Inferred Collaboration Graphs,"Rui Ye, Zhenyang Ni, Fangzhao Wu, Siheng Chen, Yan-Feng Wang",,,,,,,,,
1432
- ModelDiff: A Framework for Comparing Learning Algorithms,"Harshay Shah, Sung Min (Sam) Park, Andrew Ilyas, Aleksander Madry",http://arxiv.org/abs/2211.12491,https://github.com/MadryLab/modeldiff,https://huggingface.co/papers/2211.12491,,,,2211.12491,4,0
1433
  Half-Hop: A graph upsampling approach for slowing down message passing,"Mehdi Azabou, Venkataramana Ganesh, Shantanu Thakoor, Chi-Heng Lin, Lakshmi Sathidevi, Ran Liu, Michal Valko, Petar Veličković, Eva Dyer",,,,,,,,,
1434
  Structural Re-weighting Improves Graph Domain Adaptation,"Shikun Liu, Tianchun Li, Yongbin Feng, Nhan Tran, Han Zhao, Qiang Qiu, Pan Li, Pan Li",http://arxiv.org/abs/2306.03221,,https://huggingface.co/papers/2306.03221,,,,2306.03221,7,0
1435
  InfoOT: Information Maximizing Optimal Transport,"Ching-Yao Chuang, Stefanie Jegelka, David Alvarez-Melis",http://arxiv.org/abs/2210.03164,,https://huggingface.co/papers/2210.03164,,,,2210.03164,3,0
@@ -1710,7 +1710,7 @@ Randomized Schur Complement Views for Graph Contrastive Learning,Vignesh Kothapa
1710
  Path Neural Networks: Expressive and Accurate Graph Neural Networks,"Gaspard Michel, Giannis Nikolentzos, Johannes Lutzeyer, Michalis Vazirgiannis",http://arxiv.org/abs/2306.05955,,https://huggingface.co/papers/2306.05955,,,,2306.05955,4,1
1711
  Hierarchical Diffusion for Offline Decision Making,"Wenhao Li, Xiangfeng Wang, Bo Jin, Hongyuan Zha",,,,,,,,,
1712
  Generated Graph Detection,"Yihan Ma, Zhikun Zhang, Ning Yu, Xinlei He, Michael Backes, Yun Shen, Yang Zhang",http://arxiv.org/abs/2306.07758,,https://huggingface.co/papers/2306.07758,,,,2306.07758,7,0
1713
- Variational Open-Domain Question Answering,"Valentin Liévin, Andreas Geert Motzfeldt, Ida Jensen, Ole Winther",http://arxiv.org/abs/2210.06345,,https://huggingface.co/papers/2210.06345,,,,2210.06345,4,0
1714
  PromptBoosting: Black-Box Text Classification with Ten Forward Passes,"Bairu Hou, Joe O'Connor, Jacob Andreas, Shiyu Chang, Yang Zhang",http://arxiv.org/abs/2212.09257,,https://huggingface.co/papers/2212.09257,,,,2212.09257,5,0
1715
  Gradient-Free Structured Pruning with Unlabeled Data,"Azade Nova, Hanjun Dai, Dale Schuurmans",http://arxiv.org/abs/2303.04185,,https://huggingface.co/papers/2303.04185,,,,2303.04185,3,0
1716
  Text-To-Concept (and Back) via Cross-Model Alignment,"Mazda Moayeri, Keivan Rezaei, Maziar Sanjabi, Soheil Feizi",http://arxiv.org/abs/2305.06386,,https://huggingface.co/papers/2305.06386,,,,2305.06386,4,2
 
433
  Efficient List-Decodable Regression using Batches,"Abhimanyu Das, Ayush Jain, Weihao Kong, Rajat Sen",http://arxiv.org/abs/2211.12743,,https://huggingface.co/papers/2211.12743,,,,2211.12743,4,0
434
  Proper Scoring Rules for Survival Analysis,Hiroki Yanagisawa,http://arxiv.org/abs/2305.00621,,https://huggingface.co/papers/2305.00621,,,,2305.00621,1,0
435
  GraphCleaner: Detecting Mislabelled Samples in Popular Graph Learning Benchmarks,"Yuwen Li, Miao Xiong, Bryan Hooi",http://arxiv.org/abs/2306.00015,,https://huggingface.co/papers/2306.00015,,,,2306.00015,3,0
436
+ Large Language Models Can Be Easily Distracted by Irrelevant Context,"Haoyue Shi, Xinyun Chen, Kanishka Misra, Nathan Scales, David Dohan, Ed Chi, Nathanael Schärli, Denny Zhou",http://arxiv.org/abs/2302.00093,,https://huggingface.co/papers/2302.00093,,,,2302.00093,8,1
437
  Temporally Consistent Transformers for Video Generation,"Wilson Yan, Danijar Hafner, Stephen James, Pieter Abbeel",http://arxiv.org/abs/2210.02396,,https://huggingface.co/papers/2210.02396,,,,2210.02396,4,1
438
  Improved Algorithms for Multi-period Multi-class Packing Problems with Bandit Feedback,"Wonyoung Kim, Garud Iyengar, Assaf Zeevi",http://arxiv.org/abs/2301.13791,,https://huggingface.co/papers/2301.13791,,,,2301.13791,3,0
439
  Scaling Laws for Generative Mixed-Modal Language Models,"Armen Aghajanyan, LILI YU, Alexis Conneau, Wei-Ning Hsu, Karen Hambardzumyan, Susan Zhang, Stephen Roller, Naman Goyal, Omer Levy, Luke Zettlemoyer",http://arxiv.org/abs/2301.03728,,https://huggingface.co/papers/2301.03728,,,,2301.03728,10,0
 
557
  FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation,"Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon",,,,,,,,,
558
  Certified Robust Neural Networks: Generalization and Corruption Resistance,"Amine Bennouna, Ryan Lucas, Bart Van Parys",http://arxiv.org/abs/2303.02251,https://github.com/RyanLucas3/HR_Neural_Networks,https://huggingface.co/papers/2303.02251,,,,2303.02251,3,1
559
  "Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective","Michael Sander, Joan Puigcerver, Josip Djolonga, Gabriel Peyré, Mathieu Blondel",,,,,,,,,
560
+ Anti-Exploration by Random Network Distillation,"Alexander Nikulin, Vladislav Kurenkov, Denis Tarasov, Sergey Kolesnikov",http://arxiv.org/abs/2301.13616,,https://huggingface.co/papers/2301.13616,,,,2301.13616,4,2
561
  Monotonicity and Double Descent in Uncertainty Estimation with Gaussian Processes,"Liam Hodgkinson, Chris van der Heide, Fred Roosta, Michael Mahoney",http://arxiv.org/abs/2210.07612,,https://huggingface.co/papers/2210.07612,,,,2210.07612,4,1
562
  Sampling-Based Accuracy Testing of Posterior Estimators for General Inference,"Pablo Lemos, Adam Coogan, Laurence Perreault-Levasseur, Yashar Hezaveh",http://arxiv.org/abs/2302.03026,,https://huggingface.co/papers/2302.03026,,,,2302.03026,4,1
563
  Discrete Continuous Optimization Framework for Simultaneous Clustering and Training in Mixture Models,"Parth Sangani, Arjun Kashettiwar, Pritish Chakraborty, Bhuvan Gangula, Sivasubramanian Durga, Ganesh Ramakrishnan, Rishabh Iyer, Abir De",,,,,,,,,
 
605
  Learning Controllable Degradation for Real-World Super-Resolution via Constrained Flows,"Seobin Park, Dongjin Kim, Sungyong Baik, Tae Hyun Kim",,,,,,,,,
606
  Few-bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction,"Goergii Novikov, Daniel Bershatsky, Julia Gusak, Alex Shonenkov, Denis Dimitrov, Ivan Oseledets",http://arxiv.org/abs/2202.00441,,https://huggingface.co/papers/2202.00441,,,,2202.00441,6,0
607
  In Search for a Generalizable Method for Source Free Domain Adaptation,"Malik Boudiaf, tom denton, Bart van Merrienboer, Vincent Dumoulin, Eleni Triantafillou",http://arxiv.org/abs/2302.06658,,https://huggingface.co/papers/2302.06658,,,,2302.06658,5,1
608
+ GeCoNeRF: Few-shot Neural Radiance Fields via Geometric Consistency,"Min-Seop Kwak, Jiuhn Song, Seungryong Kim",http://arxiv.org/abs/2301.10941,,https://huggingface.co/papers/2301.10941,,,,2301.10941,3,2
609
  Input uncertainty propagation through trained neural networks,"Paul Monchot, Loic Coquelin, Sébastien J. Petit, Sébastien Marmin, Erwann LE PENNEC, Nicolas Fischer",,,,,,,,,
610
  Optimally-weighted Estimators of the Maximum Mean Discrepancy for Likelihood-Free Inference,"Ayush Bharti, Masha Naslidnyk, Oscar Key, Samuel Kaski, Francois-Xavier Briol",http://arxiv.org/abs/2301.11674,,https://huggingface.co/papers/2301.11674,,,,2301.11674,5,0
611
  SGD with large step sizes learns sparse features,"Maksym Andriushchenko, Aditya Vardhan Varre, Loucas Pillaud-Vivien, Nicolas Flammarion",http://arxiv.org/abs/2210.05337,https://github.com/tml-epfl/sgd-sparse-features,https://huggingface.co/papers/2210.05337,,,,2210.05337,4,1
 
724
  Unit Scaling: Out-of-the-Box Low-Precision Training,"Charlie Blake, Charlie Blake, Douglas Orr, Carlo Luschi",http://arxiv.org/abs/2303.11257,,https://huggingface.co/papers/2303.11257,,,,2303.11257,3,2
725
  NUNO: A General Framework for Learning Parametric PDEs with Non-Uniform Data,"LIU SONGMING, Zhongkai Hao, Chengyang Ying, Hang Su, Ze Cheng, Jun Zhu",http://arxiv.org/abs/2305.18694,https://github.com/thu-ml/NUNO,https://huggingface.co/papers/2305.18694,,,,2305.18694,6,0
726
  Diffusion Models for Offline Black-Box Optimization,"Siddarth Krishnamoorthy, Satvik Mashkaria, Aditya Grover",,,,,,,,,
727
+ The Flan Collection: Designing Data and Methods for Effective Instruction Tuning,"Shayne Longpre, Le Hou, Tu Vu, Albert Webson, Hyung Won Chung, Yi Tay, Denny Zhou, Quoc Le, Barret Zoph, Jason Wei, Adam Roberts",http://arxiv.org/abs/2301.13688,https://github.com/google-research/FLAN/tree/main/flan/v2,https://huggingface.co/papers/2301.13688,,,,2301.13688,11,2
728
  Compositional Score Modeling for Simulation-Based Inference,"Tomas Geffner, George Papamakarios, Andriy Mnih",http://arxiv.org/abs/2209.14249,,https://huggingface.co/papers/2209.14249,,,,2209.14249,3,0
729
  Dirichlet Diffusion Score Model for Biological Sequence Generation,"Pavel Avdeyev, Chenlai Shi, Yuhao Tan, Kseniia Dudnyk, Jian Zhou",http://arxiv.org/abs/2305.10699,,https://huggingface.co/papers/2305.10699,,,,2305.10699,5,0
730
  Leveraging Proxy of Training Data for Test-Time Adaptation,"Juwon Kang, Nayeong Kim, Donghyeon Kwon, Jungseul Ok, Suha Kwak",,,,,,,,,
 
1099
  COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models,"Jinqi Xiao, Miao Yin, Yu Gong, Xiao Zang, Jian Ren, Bo Yuan",http://arxiv.org/abs/2305.17235,,https://huggingface.co/papers/2305.17235,,,,2305.17235,6,1
1100
  Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling,"Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'Brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, USVSN Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar van der Wal",http://arxiv.org/abs/2304.01373,https://github.com/EleutherAI/pythia,https://huggingface.co/papers/2304.01373,,,,2304.01373,13,7
1101
  HyperTuning: Toward Adapting Large Language Models without Back-propagation,"Jason Phang, Yi Mao, Pengcheng He, Weizhu Chen",,,,,,,,,
1102
+ Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models,"Zhihong Shao, Yeyun Gong, Yelong Shen, Minlie Huang, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2302.00618,,https://huggingface.co/papers/2302.00618,,,,2302.00618,6,1
1103
  Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise,"Zhenghao Lin, Yeyun Gong, Yelong Shen, Tong Wu, Zhihao Fan, Chen Lin, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2212.11685,https://github.com/microsoft/ProphetNet/tree/master/GENIE,https://huggingface.co/papers/2212.11685,,,,2212.11685,8,0
1104
  Fast $(1+\varepsilon)$-Approximation Algorithms for Binary Matrix Factorization,"Ameya Velingker, Maximilian Vötsch, David Woodruff, Samson Zhou",,,,,,,,,
1105
  Exphormer: Sparse Transformers for Graphs,"Hamed Shirzad, Ameya Velingker, Balaji Venkatachalam, Danica J Sutherland, Ali K Sinop",http://arxiv.org/abs/2303.06147,https://github.com/hamed1375/Exphormer,https://huggingface.co/papers/2303.06147,,,,2303.06147,5,1
 
1190
  Specializing Smaller Language Models towards Multi-Step Reasoning,"Yao Fu, Hao Peng, Litu Ou, Ashish Sabharwal, Tushar Khot",http://arxiv.org/abs/2301.12726,,https://huggingface.co/papers/2301.12726,,,,2301.12726,5,1
1191
  Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap,"Hang Wang, Sen Lin, Junshan Zhang",,,,,,,,,
1192
  Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models,"Dongjun Kim, Yeongmin Kim, Se Jung Kwon, Wanmo Kang, IL CHUL MOON",http://arxiv.org/abs/2211.17091,https://github.com/alsdudrla10/DG,https://huggingface.co/papers/2211.17091,,,,2211.17091,5,0
1193
+ Weighted flow diffusion for local graph clustering with node attributes: an algorithm and statistical guarantees,"Shenghao Yang, Kimon Fountoulakis",http://arxiv.org/abs/2301.13187,,https://huggingface.co/papers/2301.13187,,,,2301.13187,2,1
1194
  Robust Budget Pacing with a Single Sample,"Santiago Balseiro, Rachitesh Kumar, Vahab Mirrokni, Balasubramanian Sivan, Di Wang",http://arxiv.org/abs/2302.02006,,https://huggingface.co/papers/2302.02006,,,,2302.02006,5,0
1195
  Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the Machiavelli Benchmark,"Alexander Pan, Jun Shern Chan, Andy Zou, Nathaniel Li, Scott Emmons, Hanlin Zhang, Steven Basart, Thomas Woodside, Dan Hendrycks",http://arxiv.org/abs/2304.03279,,https://huggingface.co/papers/2304.03279,,,,2304.03279,10,1
1196
  Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines?,"Victor Boutin, Thomas FEL, Lakshya Singhal, Rishav Mukherji, Akash Nagaraj, Julien Colin, Thomas Serre",http://arxiv.org/abs/2301.11722,,https://huggingface.co/papers/2301.11722,,,,2301.11722,7,1
 
1363
  Abstract-to-Executable Trajectory Translation for One-Shot Task Generalization,"Stone Tao, Xiaochen Li, Tongzhou Mu, Zhiao Huang, Yuzhe Qin, Hao Su",http://arxiv.org/abs/2210.07658,,https://huggingface.co/papers/2210.07658,,,,2210.07658,6,1
1364
  Multi-View Masked World Models for Visual Robotic Manipulation,"Younggyo Seo, Junsu Kim, Stephen James, Kimin Lee, Jinwoo Shin, Pieter Abbeel",http://arxiv.org/abs/2302.02408,,https://huggingface.co/papers/2302.02408,,,,2302.02408,6,0
1365
  CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets,"Zachary Novack, Julian McAuley, Zachary Lipton, Saurabh Garg",http://arxiv.org/abs/2302.02551,https://github.com/acmi-lab/CHILS,https://huggingface.co/papers/2302.02551,,,,2302.02551,4,1
1366
+ Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization,"Zi-Hao Qiu, Quanqi Hu, Zhuoning Yuan, Denny Zhou, Lijun Zhang, Tianbao Yang",http://arxiv.org/abs/2305.11965,,https://huggingface.co/papers/2305.11965,,,,2305.11965,6,1
1367
  Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL,"Taku Yamagata, Ahmed Khalil, Raul Santos-Rodriguez",,,,,,,,,
1368
  A Statistical Perspective on Retrieval-Based Models,"Soumya Basu, Ankit Singh Rawat, Manzil Zaheer",,,,,,,,,
1369
  PFNs4BO: Meta-Learning the surrogate model for Bayesian optimization from scratch using Transformers,"Samuel Gabriel Müller, Matthias Feurer, Noah Hollmann, Frank Hutter",,,,,,,,,
 
1429
  Topologically Faithful Image Segmentation via Induced Matching of Persistence Barcodes,"Nico Stucki, Johannes C. Paetzold, Suprosanna Shit, bjoern menze, Ulrich Bauer",http://arxiv.org/abs/2211.15272,https://github.com/nstucki/Betti-matching,https://huggingface.co/papers/2211.15272,,,,2211.15272,5,0
1430
  FedDisco: Federated Learning with Discrepancy-Aware Collaboration,"Rui Ye, Mingkai Xu, Jianyu Wang, Chenxin Xu, Siheng Chen, Yan-Feng Wang",http://arxiv.org/abs/2305.19229,https://github.com/MediaBrain-SJTU/FedDisco,https://huggingface.co/papers/2305.19229,,,,2305.19229,6,0
1431
  Personalized Federated Learning with Inferred Collaboration Graphs,"Rui Ye, Zhenyang Ni, Fangzhao Wu, Siheng Chen, Yan-Feng Wang",,,,,,,,,
1432
+ ModelDiff: A Framework for Comparing Learning Algorithms,"Harshay Shah, Sung Min (Sam) Park, Andrew Ilyas, Aleksander Madry",http://arxiv.org/abs/2211.12491,https://github.com/MadryLab/modeldiff,https://huggingface.co/papers/2211.12491,,,,2211.12491,4,1
1433
  Half-Hop: A graph upsampling approach for slowing down message passing,"Mehdi Azabou, Venkataramana Ganesh, Shantanu Thakoor, Chi-Heng Lin, Lakshmi Sathidevi, Ran Liu, Michal Valko, Petar Veličković, Eva Dyer",,,,,,,,,
1434
  Structural Re-weighting Improves Graph Domain Adaptation,"Shikun Liu, Tianchun Li, Yongbin Feng, Nhan Tran, Han Zhao, Qiang Qiu, Pan Li, Pan Li",http://arxiv.org/abs/2306.03221,,https://huggingface.co/papers/2306.03221,,,,2306.03221,7,0
1435
  InfoOT: Information Maximizing Optimal Transport,"Ching-Yao Chuang, Stefanie Jegelka, David Alvarez-Melis",http://arxiv.org/abs/2210.03164,,https://huggingface.co/papers/2210.03164,,,,2210.03164,3,0
 
1710
  Path Neural Networks: Expressive and Accurate Graph Neural Networks,"Gaspard Michel, Giannis Nikolentzos, Johannes Lutzeyer, Michalis Vazirgiannis",http://arxiv.org/abs/2306.05955,,https://huggingface.co/papers/2306.05955,,,,2306.05955,4,1
1711
  Hierarchical Diffusion for Offline Decision Making,"Wenhao Li, Xiangfeng Wang, Bo Jin, Hongyuan Zha",,,,,,,,,
1712
  Generated Graph Detection,"Yihan Ma, Zhikun Zhang, Ning Yu, Xinlei He, Michael Backes, Yun Shen, Yang Zhang",http://arxiv.org/abs/2306.07758,,https://huggingface.co/papers/2306.07758,,,,2306.07758,7,0
1713
+ Variational Open-Domain Question Answering,"Valentin Liévin, Andreas Geert Motzfeldt, Ida Jensen, Ole Winther",http://arxiv.org/abs/2210.06345,,https://huggingface.co/papers/2210.06345,,,,2210.06345,4,2
1714
  PromptBoosting: Black-Box Text Classification with Ten Forward Passes,"Bairu Hou, Joe O'Connor, Jacob Andreas, Shiyu Chang, Yang Zhang",http://arxiv.org/abs/2212.09257,,https://huggingface.co/papers/2212.09257,,,,2212.09257,5,0
1715
  Gradient-Free Structured Pruning with Unlabeled Data,"Azade Nova, Hanjun Dai, Dale Schuurmans",http://arxiv.org/abs/2303.04185,,https://huggingface.co/papers/2303.04185,,,,2303.04185,3,0
1716
  Text-To-Concept (and Back) via Cross-Model Alignment,"Mazda Moayeri, Keivan Rezaei, Maziar Sanjabi, Soheil Feizi",http://arxiv.org/abs/2305.06386,,https://huggingface.co/papers/2305.06386,,,,2305.06386,4,2