Spaces:
Sleeping
Sleeping
commit files to HF hub
Browse files- papers.csv +9 -9
papers.csv
CHANGED
@@ -433,7 +433,7 @@ Parallel $Q$-Learning: Scaling Off-policy Reinforcement Learning under Massively
|
|
433 |
Efficient List-Decodable Regression using Batches,"Abhimanyu Das, Ayush Jain, Weihao Kong, Rajat Sen",http://arxiv.org/abs/2211.12743,,https://huggingface.co/papers/2211.12743,,,,2211.12743,4,0
|
434 |
Proper Scoring Rules for Survival Analysis,Hiroki Yanagisawa,http://arxiv.org/abs/2305.00621,,https://huggingface.co/papers/2305.00621,,,,2305.00621,1,0
|
435 |
GraphCleaner: Detecting Mislabelled Samples in Popular Graph Learning Benchmarks,"Yuwen Li, Miao Xiong, Bryan Hooi",http://arxiv.org/abs/2306.00015,,https://huggingface.co/papers/2306.00015,,,,2306.00015,3,0
|
436 |
-
Large Language Models Can Be Easily Distracted by Irrelevant Context,"Haoyue Shi, Xinyun Chen, Kanishka Misra, Nathan Scales, David Dohan, Ed Chi, Nathanael Schärli, Denny Zhou",http://arxiv.org/abs/2302.00093,,https://huggingface.co/papers/2302.00093,,,,2302.00093,8,
|
437 |
Temporally Consistent Transformers for Video Generation,"Wilson Yan, Danijar Hafner, Stephen James, Pieter Abbeel",http://arxiv.org/abs/2210.02396,,https://huggingface.co/papers/2210.02396,,,,2210.02396,4,1
|
438 |
Improved Algorithms for Multi-period Multi-class Packing Problems with Bandit Feedback,"Wonyoung Kim, Garud Iyengar, Assaf Zeevi",http://arxiv.org/abs/2301.13791,,https://huggingface.co/papers/2301.13791,,,,2301.13791,3,0
|
439 |
Scaling Laws for Generative Mixed-Modal Language Models,"Armen Aghajanyan, LILI YU, Alexis Conneau, Wei-Ning Hsu, Karen Hambardzumyan, Susan Zhang, Stephen Roller, Naman Goyal, Omer Levy, Luke Zettlemoyer",http://arxiv.org/abs/2301.03728,,https://huggingface.co/papers/2301.03728,,,,2301.03728,10,0
|
@@ -557,7 +557,7 @@ Data Poisoning Attacks Against Multimodal Encoders,"Ziqing Yang, Xinlei He, Zhen
|
|
557 |
FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation,"Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon",,,,,,,,,
|
558 |
Certified Robust Neural Networks: Generalization and Corruption Resistance,"Amine Bennouna, Ryan Lucas, Bart Van Parys",http://arxiv.org/abs/2303.02251,https://github.com/RyanLucas3/HR_Neural_Networks,https://huggingface.co/papers/2303.02251,,,,2303.02251,3,1
|
559 |
"Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective","Michael Sander, Joan Puigcerver, Josip Djolonga, Gabriel Peyré, Mathieu Blondel",,,,,,,,,
|
560 |
-
Anti-Exploration by Random Network Distillation,"Alexander Nikulin, Vladislav Kurenkov, Denis Tarasov, Sergey Kolesnikov",http://arxiv.org/abs/2301.13616,,https://huggingface.co/papers/2301.13616,,,,2301.13616,4,
|
561 |
Monotonicity and Double Descent in Uncertainty Estimation with Gaussian Processes,"Liam Hodgkinson, Chris van der Heide, Fred Roosta, Michael Mahoney",http://arxiv.org/abs/2210.07612,,https://huggingface.co/papers/2210.07612,,,,2210.07612,4,1
|
562 |
Sampling-Based Accuracy Testing of Posterior Estimators for General Inference,"Pablo Lemos, Adam Coogan, Laurence Perreault-Levasseur, Yashar Hezaveh",http://arxiv.org/abs/2302.03026,,https://huggingface.co/papers/2302.03026,,,,2302.03026,4,1
|
563 |
Discrete Continuous Optimization Framework for Simultaneous Clustering and Training in Mixture Models,"Parth Sangani, Arjun Kashettiwar, Pritish Chakraborty, Bhuvan Gangula, Sivasubramanian Durga, Ganesh Ramakrishnan, Rishabh Iyer, Abir De",,,,,,,,,
|
@@ -605,7 +605,7 @@ Hypothesis Transfer Learning with Surrogate Classification Losses: Generalizat
|
|
605 |
Learning Controllable Degradation for Real-World Super-Resolution via Constrained Flows,"Seobin Park, Dongjin Kim, Sungyong Baik, Tae Hyun Kim",,,,,,,,,
|
606 |
Few-bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction,"Goergii Novikov, Daniel Bershatsky, Julia Gusak, Alex Shonenkov, Denis Dimitrov, Ivan Oseledets",http://arxiv.org/abs/2202.00441,,https://huggingface.co/papers/2202.00441,,,,2202.00441,6,0
|
607 |
In Search for a Generalizable Method for Source Free Domain Adaptation,"Malik Boudiaf, tom denton, Bart van Merrienboer, Vincent Dumoulin, Eleni Triantafillou",http://arxiv.org/abs/2302.06658,,https://huggingface.co/papers/2302.06658,,,,2302.06658,5,1
|
608 |
-
GeCoNeRF: Few-shot Neural Radiance Fields via Geometric Consistency,"Min-Seop Kwak, Jiuhn Song, Seungryong Kim",http://arxiv.org/abs/2301.10941,,https://huggingface.co/papers/2301.10941,,,,2301.10941,3,
|
609 |
Input uncertainty propagation through trained neural networks,"Paul Monchot, Loic Coquelin, Sébastien J. Petit, Sébastien Marmin, Erwann LE PENNEC, Nicolas Fischer",,,,,,,,,
|
610 |
Optimally-weighted Estimators of the Maximum Mean Discrepancy for Likelihood-Free Inference,"Ayush Bharti, Masha Naslidnyk, Oscar Key, Samuel Kaski, Francois-Xavier Briol",http://arxiv.org/abs/2301.11674,,https://huggingface.co/papers/2301.11674,,,,2301.11674,5,0
|
611 |
SGD with large step sizes learns sparse features,"Maksym Andriushchenko, Aditya Vardhan Varre, Loucas Pillaud-Vivien, Nicolas Flammarion",http://arxiv.org/abs/2210.05337,https://github.com/tml-epfl/sgd-sparse-features,https://huggingface.co/papers/2210.05337,,,,2210.05337,4,1
|
@@ -724,7 +724,7 @@ Training-Free Neural Active Learning with Initialization-Robustness Guarantees,"
|
|
724 |
Unit Scaling: Out-of-the-Box Low-Precision Training,"Charlie Blake, Charlie Blake, Douglas Orr, Carlo Luschi",http://arxiv.org/abs/2303.11257,,https://huggingface.co/papers/2303.11257,,,,2303.11257,3,2
|
725 |
NUNO: A General Framework for Learning Parametric PDEs with Non-Uniform Data,"LIU SONGMING, Zhongkai Hao, Chengyang Ying, Hang Su, Ze Cheng, Jun Zhu",http://arxiv.org/abs/2305.18694,https://github.com/thu-ml/NUNO,https://huggingface.co/papers/2305.18694,,,,2305.18694,6,0
|
726 |
Diffusion Models for Offline Black-Box Optimization,"Siddarth Krishnamoorthy, Satvik Mashkaria, Aditya Grover",,,,,,,,,
|
727 |
-
The Flan Collection: Designing Data and Methods for Effective Instruction Tuning,"Shayne Longpre, Le Hou, Tu Vu, Albert Webson, Hyung Won Chung, Yi Tay, Denny Zhou, Quoc Le, Barret Zoph, Jason Wei, Adam Roberts",http://arxiv.org/abs/2301.13688,https://github.com/google-research/FLAN/tree/main/flan/v2,https://huggingface.co/papers/2301.13688,,,,2301.13688,11,
|
728 |
Compositional Score Modeling for Simulation-Based Inference,"Tomas Geffner, George Papamakarios, Andriy Mnih",http://arxiv.org/abs/2209.14249,,https://huggingface.co/papers/2209.14249,,,,2209.14249,3,0
|
729 |
Dirichlet Diffusion Score Model for Biological Sequence Generation,"Pavel Avdeyev, Chenlai Shi, Yuhao Tan, Kseniia Dudnyk, Jian Zhou",http://arxiv.org/abs/2305.10699,,https://huggingface.co/papers/2305.10699,,,,2305.10699,5,0
|
730 |
Leveraging Proxy of Training Data for Test-Time Adaptation,"Juwon Kang, Nayeong Kim, Donghyeon Kwon, Jungseul Ok, Suha Kwak",,,,,,,,,
|
@@ -1099,7 +1099,7 @@ High-dimensional Location Estimation via Norm Concentration for Subgamma Vectors
|
|
1099 |
COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models,"Jinqi Xiao, Miao Yin, Yu Gong, Xiao Zang, Jian Ren, Bo Yuan",http://arxiv.org/abs/2305.17235,,https://huggingface.co/papers/2305.17235,,,,2305.17235,6,1
|
1100 |
Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling,"Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'Brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, USVSN Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar van der Wal",http://arxiv.org/abs/2304.01373,https://github.com/EleutherAI/pythia,https://huggingface.co/papers/2304.01373,,,,2304.01373,13,7
|
1101 |
HyperTuning: Toward Adapting Large Language Models without Back-propagation,"Jason Phang, Yi Mao, Pengcheng He, Weizhu Chen",,,,,,,,,
|
1102 |
-
Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models,"Zhihong Shao, Yeyun Gong, Yelong Shen, Minlie Huang, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2302.00618,,https://huggingface.co/papers/2302.00618,,,,2302.00618,6,
|
1103 |
Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise,"Zhenghao Lin, Yeyun Gong, Yelong Shen, Tong Wu, Zhihao Fan, Chen Lin, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2212.11685,https://github.com/microsoft/ProphetNet/tree/master/GENIE,https://huggingface.co/papers/2212.11685,,,,2212.11685,8,0
|
1104 |
Fast $(1+\varepsilon)$-Approximation Algorithms for Binary Matrix Factorization,"Ameya Velingker, Maximilian Vötsch, David Woodruff, Samson Zhou",,,,,,,,,
|
1105 |
Exphormer: Sparse Transformers for Graphs,"Hamed Shirzad, Ameya Velingker, Balaji Venkatachalam, Danica J Sutherland, Ali K Sinop",http://arxiv.org/abs/2303.06147,https://github.com/hamed1375/Exphormer,https://huggingface.co/papers/2303.06147,,,,2303.06147,5,1
|
@@ -1190,7 +1190,7 @@ ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts,"Mingh
|
|
1190 |
Specializing Smaller Language Models towards Multi-Step Reasoning,"Yao Fu, Hao Peng, Litu Ou, Ashish Sabharwal, Tushar Khot",http://arxiv.org/abs/2301.12726,,https://huggingface.co/papers/2301.12726,,,,2301.12726,5,1
|
1191 |
Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap,"Hang Wang, Sen Lin, Junshan Zhang",,,,,,,,,
|
1192 |
Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models,"Dongjun Kim, Yeongmin Kim, Se Jung Kwon, Wanmo Kang, IL CHUL MOON",http://arxiv.org/abs/2211.17091,https://github.com/alsdudrla10/DG,https://huggingface.co/papers/2211.17091,,,,2211.17091,5,0
|
1193 |
-
Weighted flow diffusion for local graph clustering with node attributes: an algorithm and statistical guarantees,"Shenghao Yang, Kimon Fountoulakis",http://arxiv.org/abs/2301.13187,,https://huggingface.co/papers/2301.13187,,,,2301.13187,2,
|
1194 |
Robust Budget Pacing with a Single Sample,"Santiago Balseiro, Rachitesh Kumar, Vahab Mirrokni, Balasubramanian Sivan, Di Wang",http://arxiv.org/abs/2302.02006,,https://huggingface.co/papers/2302.02006,,,,2302.02006,5,0
|
1195 |
Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the Machiavelli Benchmark,"Alexander Pan, Jun Shern Chan, Andy Zou, Nathaniel Li, Scott Emmons, Hanlin Zhang, Steven Basart, Thomas Woodside, Dan Hendrycks",http://arxiv.org/abs/2304.03279,,https://huggingface.co/papers/2304.03279,,,,2304.03279,10,1
|
1196 |
Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines?,"Victor Boutin, Thomas FEL, Lakshya Singhal, Rishav Mukherji, Akash Nagaraj, Julien Colin, Thomas Serre",http://arxiv.org/abs/2301.11722,,https://huggingface.co/papers/2301.11722,,,,2301.11722,7,1
|
@@ -1363,7 +1363,7 @@ Reconstructive Neuron Pruning for Backdoor Defense,"Yige Li, XIXIANG LYU, Xingju
|
|
1363 |
Abstract-to-Executable Trajectory Translation for One-Shot Task Generalization,"Stone Tao, Xiaochen Li, Tongzhou Mu, Zhiao Huang, Yuzhe Qin, Hao Su",http://arxiv.org/abs/2210.07658,,https://huggingface.co/papers/2210.07658,,,,2210.07658,6,1
|
1364 |
Multi-View Masked World Models for Visual Robotic Manipulation,"Younggyo Seo, Junsu Kim, Stephen James, Kimin Lee, Jinwoo Shin, Pieter Abbeel",http://arxiv.org/abs/2302.02408,,https://huggingface.co/papers/2302.02408,,,,2302.02408,6,0
|
1365 |
CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets,"Zachary Novack, Julian McAuley, Zachary Lipton, Saurabh Garg",http://arxiv.org/abs/2302.02551,https://github.com/acmi-lab/CHILS,https://huggingface.co/papers/2302.02551,,,,2302.02551,4,1
|
1366 |
-
Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization,"Zi-Hao Qiu, Quanqi Hu, Zhuoning Yuan, Denny Zhou, Lijun Zhang, Tianbao Yang",http://arxiv.org/abs/2305.11965,,https://huggingface.co/papers/2305.11965,,,,2305.11965,6,
|
1367 |
Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL,"Taku Yamagata, Ahmed Khalil, Raul Santos-Rodriguez",,,,,,,,,
|
1368 |
A Statistical Perspective on Retrieval-Based Models,"Soumya Basu, Ankit Singh Rawat, Manzil Zaheer",,,,,,,,,
|
1369 |
PFNs4BO: Meta-Learning the surrogate model for Bayesian optimization from scratch using Transformers,"Samuel Gabriel Müller, Matthias Feurer, Noah Hollmann, Frank Hutter",,,,,,,,,
|
@@ -1429,7 +1429,7 @@ Regularization-free Diffeomorphic Temporal Alignment Nets,"Ron Shapira Weber, Or
|
|
1429 |
Topologically Faithful Image Segmentation via Induced Matching of Persistence Barcodes,"Nico Stucki, Johannes C. Paetzold, Suprosanna Shit, bjoern menze, Ulrich Bauer",http://arxiv.org/abs/2211.15272,https://github.com/nstucki/Betti-matching,https://huggingface.co/papers/2211.15272,,,,2211.15272,5,0
|
1430 |
FedDisco: Federated Learning with Discrepancy-Aware Collaboration,"Rui Ye, Mingkai Xu, Jianyu Wang, Chenxin Xu, Siheng Chen, Yan-Feng Wang",http://arxiv.org/abs/2305.19229,https://github.com/MediaBrain-SJTU/FedDisco,https://huggingface.co/papers/2305.19229,,,,2305.19229,6,0
|
1431 |
Personalized Federated Learning with Inferred Collaboration Graphs,"Rui Ye, Zhenyang Ni, Fangzhao Wu, Siheng Chen, Yan-Feng Wang",,,,,,,,,
|
1432 |
-
ModelDiff: A Framework for Comparing Learning Algorithms,"Harshay Shah, Sung Min (Sam) Park, Andrew Ilyas, Aleksander Madry",http://arxiv.org/abs/2211.12491,https://github.com/MadryLab/modeldiff,https://huggingface.co/papers/2211.12491,,,,2211.12491,4,
|
1433 |
Half-Hop: A graph upsampling approach for slowing down message passing,"Mehdi Azabou, Venkataramana Ganesh, Shantanu Thakoor, Chi-Heng Lin, Lakshmi Sathidevi, Ran Liu, Michal Valko, Petar Veličković, Eva Dyer",,,,,,,,,
|
1434 |
Structural Re-weighting Improves Graph Domain Adaptation,"Shikun Liu, Tianchun Li, Yongbin Feng, Nhan Tran, Han Zhao, Qiang Qiu, Pan Li, Pan Li",http://arxiv.org/abs/2306.03221,,https://huggingface.co/papers/2306.03221,,,,2306.03221,7,0
|
1435 |
InfoOT: Information Maximizing Optimal Transport,"Ching-Yao Chuang, Stefanie Jegelka, David Alvarez-Melis",http://arxiv.org/abs/2210.03164,,https://huggingface.co/papers/2210.03164,,,,2210.03164,3,0
|
@@ -1710,7 +1710,7 @@ Randomized Schur Complement Views for Graph Contrastive Learning,Vignesh Kothapa
|
|
1710 |
Path Neural Networks: Expressive and Accurate Graph Neural Networks,"Gaspard Michel, Giannis Nikolentzos, Johannes Lutzeyer, Michalis Vazirgiannis",http://arxiv.org/abs/2306.05955,,https://huggingface.co/papers/2306.05955,,,,2306.05955,4,1
|
1711 |
Hierarchical Diffusion for Offline Decision Making,"Wenhao Li, Xiangfeng Wang, Bo Jin, Hongyuan Zha",,,,,,,,,
|
1712 |
Generated Graph Detection,"Yihan Ma, Zhikun Zhang, Ning Yu, Xinlei He, Michael Backes, Yun Shen, Yang Zhang",http://arxiv.org/abs/2306.07758,,https://huggingface.co/papers/2306.07758,,,,2306.07758,7,0
|
1713 |
-
Variational Open-Domain Question Answering,"Valentin Liévin, Andreas Geert Motzfeldt, Ida Jensen, Ole Winther",http://arxiv.org/abs/2210.06345,,https://huggingface.co/papers/2210.06345,,,,2210.06345,4,
|
1714 |
PromptBoosting: Black-Box Text Classification with Ten Forward Passes,"Bairu Hou, Joe O'Connor, Jacob Andreas, Shiyu Chang, Yang Zhang",http://arxiv.org/abs/2212.09257,,https://huggingface.co/papers/2212.09257,,,,2212.09257,5,0
|
1715 |
Gradient-Free Structured Pruning with Unlabeled Data,"Azade Nova, Hanjun Dai, Dale Schuurmans",http://arxiv.org/abs/2303.04185,,https://huggingface.co/papers/2303.04185,,,,2303.04185,3,0
|
1716 |
Text-To-Concept (and Back) via Cross-Model Alignment,"Mazda Moayeri, Keivan Rezaei, Maziar Sanjabi, Soheil Feizi",http://arxiv.org/abs/2305.06386,,https://huggingface.co/papers/2305.06386,,,,2305.06386,4,2
|
|
|
433 |
Efficient List-Decodable Regression using Batches,"Abhimanyu Das, Ayush Jain, Weihao Kong, Rajat Sen",http://arxiv.org/abs/2211.12743,,https://huggingface.co/papers/2211.12743,,,,2211.12743,4,0
|
434 |
Proper Scoring Rules for Survival Analysis,Hiroki Yanagisawa,http://arxiv.org/abs/2305.00621,,https://huggingface.co/papers/2305.00621,,,,2305.00621,1,0
|
435 |
GraphCleaner: Detecting Mislabelled Samples in Popular Graph Learning Benchmarks,"Yuwen Li, Miao Xiong, Bryan Hooi",http://arxiv.org/abs/2306.00015,,https://huggingface.co/papers/2306.00015,,,,2306.00015,3,0
|
436 |
+
Large Language Models Can Be Easily Distracted by Irrelevant Context,"Haoyue Shi, Xinyun Chen, Kanishka Misra, Nathan Scales, David Dohan, Ed Chi, Nathanael Schärli, Denny Zhou",http://arxiv.org/abs/2302.00093,,https://huggingface.co/papers/2302.00093,,,,2302.00093,8,1
|
437 |
Temporally Consistent Transformers for Video Generation,"Wilson Yan, Danijar Hafner, Stephen James, Pieter Abbeel",http://arxiv.org/abs/2210.02396,,https://huggingface.co/papers/2210.02396,,,,2210.02396,4,1
|
438 |
Improved Algorithms for Multi-period Multi-class Packing Problems with Bandit Feedback,"Wonyoung Kim, Garud Iyengar, Assaf Zeevi",http://arxiv.org/abs/2301.13791,,https://huggingface.co/papers/2301.13791,,,,2301.13791,3,0
|
439 |
Scaling Laws for Generative Mixed-Modal Language Models,"Armen Aghajanyan, LILI YU, Alexis Conneau, Wei-Ning Hsu, Karen Hambardzumyan, Susan Zhang, Stephen Roller, Naman Goyal, Omer Levy, Luke Zettlemoyer",http://arxiv.org/abs/2301.03728,,https://huggingface.co/papers/2301.03728,,,,2301.03728,10,0
|
|
|
557 |
FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation,"Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon",,,,,,,,,
|
558 |
Certified Robust Neural Networks: Generalization and Corruption Resistance,"Amine Bennouna, Ryan Lucas, Bart Van Parys",http://arxiv.org/abs/2303.02251,https://github.com/RyanLucas3/HR_Neural_Networks,https://huggingface.co/papers/2303.02251,,,,2303.02251,3,1
|
559 |
"Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective","Michael Sander, Joan Puigcerver, Josip Djolonga, Gabriel Peyré, Mathieu Blondel",,,,,,,,,
|
560 |
+
Anti-Exploration by Random Network Distillation,"Alexander Nikulin, Vladislav Kurenkov, Denis Tarasov, Sergey Kolesnikov",http://arxiv.org/abs/2301.13616,,https://huggingface.co/papers/2301.13616,,,,2301.13616,4,2
|
561 |
Monotonicity and Double Descent in Uncertainty Estimation with Gaussian Processes,"Liam Hodgkinson, Chris van der Heide, Fred Roosta, Michael Mahoney",http://arxiv.org/abs/2210.07612,,https://huggingface.co/papers/2210.07612,,,,2210.07612,4,1
|
562 |
Sampling-Based Accuracy Testing of Posterior Estimators for General Inference,"Pablo Lemos, Adam Coogan, Laurence Perreault-Levasseur, Yashar Hezaveh",http://arxiv.org/abs/2302.03026,,https://huggingface.co/papers/2302.03026,,,,2302.03026,4,1
|
563 |
Discrete Continuous Optimization Framework for Simultaneous Clustering and Training in Mixture Models,"Parth Sangani, Arjun Kashettiwar, Pritish Chakraborty, Bhuvan Gangula, Sivasubramanian Durga, Ganesh Ramakrishnan, Rishabh Iyer, Abir De",,,,,,,,,
|
|
|
605 |
Learning Controllable Degradation for Real-World Super-Resolution via Constrained Flows,"Seobin Park, Dongjin Kim, Sungyong Baik, Tae Hyun Kim",,,,,,,,,
|
606 |
Few-bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction,"Goergii Novikov, Daniel Bershatsky, Julia Gusak, Alex Shonenkov, Denis Dimitrov, Ivan Oseledets",http://arxiv.org/abs/2202.00441,,https://huggingface.co/papers/2202.00441,,,,2202.00441,6,0
|
607 |
In Search for a Generalizable Method for Source Free Domain Adaptation,"Malik Boudiaf, tom denton, Bart van Merrienboer, Vincent Dumoulin, Eleni Triantafillou",http://arxiv.org/abs/2302.06658,,https://huggingface.co/papers/2302.06658,,,,2302.06658,5,1
|
608 |
+
GeCoNeRF: Few-shot Neural Radiance Fields via Geometric Consistency,"Min-Seop Kwak, Jiuhn Song, Seungryong Kim",http://arxiv.org/abs/2301.10941,,https://huggingface.co/papers/2301.10941,,,,2301.10941,3,2
|
609 |
Input uncertainty propagation through trained neural networks,"Paul Monchot, Loic Coquelin, Sébastien J. Petit, Sébastien Marmin, Erwann LE PENNEC, Nicolas Fischer",,,,,,,,,
|
610 |
Optimally-weighted Estimators of the Maximum Mean Discrepancy for Likelihood-Free Inference,"Ayush Bharti, Masha Naslidnyk, Oscar Key, Samuel Kaski, Francois-Xavier Briol",http://arxiv.org/abs/2301.11674,,https://huggingface.co/papers/2301.11674,,,,2301.11674,5,0
|
611 |
SGD with large step sizes learns sparse features,"Maksym Andriushchenko, Aditya Vardhan Varre, Loucas Pillaud-Vivien, Nicolas Flammarion",http://arxiv.org/abs/2210.05337,https://github.com/tml-epfl/sgd-sparse-features,https://huggingface.co/papers/2210.05337,,,,2210.05337,4,1
|
|
|
724 |
Unit Scaling: Out-of-the-Box Low-Precision Training,"Charlie Blake, Charlie Blake, Douglas Orr, Carlo Luschi",http://arxiv.org/abs/2303.11257,,https://huggingface.co/papers/2303.11257,,,,2303.11257,3,2
|
725 |
NUNO: A General Framework for Learning Parametric PDEs with Non-Uniform Data,"LIU SONGMING, Zhongkai Hao, Chengyang Ying, Hang Su, Ze Cheng, Jun Zhu",http://arxiv.org/abs/2305.18694,https://github.com/thu-ml/NUNO,https://huggingface.co/papers/2305.18694,,,,2305.18694,6,0
|
726 |
Diffusion Models for Offline Black-Box Optimization,"Siddarth Krishnamoorthy, Satvik Mashkaria, Aditya Grover",,,,,,,,,
|
727 |
+
The Flan Collection: Designing Data and Methods for Effective Instruction Tuning,"Shayne Longpre, Le Hou, Tu Vu, Albert Webson, Hyung Won Chung, Yi Tay, Denny Zhou, Quoc Le, Barret Zoph, Jason Wei, Adam Roberts",http://arxiv.org/abs/2301.13688,https://github.com/google-research/FLAN/tree/main/flan/v2,https://huggingface.co/papers/2301.13688,,,,2301.13688,11,2
|
728 |
Compositional Score Modeling for Simulation-Based Inference,"Tomas Geffner, George Papamakarios, Andriy Mnih",http://arxiv.org/abs/2209.14249,,https://huggingface.co/papers/2209.14249,,,,2209.14249,3,0
|
729 |
Dirichlet Diffusion Score Model for Biological Sequence Generation,"Pavel Avdeyev, Chenlai Shi, Yuhao Tan, Kseniia Dudnyk, Jian Zhou",http://arxiv.org/abs/2305.10699,,https://huggingface.co/papers/2305.10699,,,,2305.10699,5,0
|
730 |
Leveraging Proxy of Training Data for Test-Time Adaptation,"Juwon Kang, Nayeong Kim, Donghyeon Kwon, Jungseul Ok, Suha Kwak",,,,,,,,,
|
|
|
1099 |
COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models,"Jinqi Xiao, Miao Yin, Yu Gong, Xiao Zang, Jian Ren, Bo Yuan",http://arxiv.org/abs/2305.17235,,https://huggingface.co/papers/2305.17235,,,,2305.17235,6,1
|
1100 |
Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling,"Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'Brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, USVSN Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar van der Wal",http://arxiv.org/abs/2304.01373,https://github.com/EleutherAI/pythia,https://huggingface.co/papers/2304.01373,,,,2304.01373,13,7
|
1101 |
HyperTuning: Toward Adapting Large Language Models without Back-propagation,"Jason Phang, Yi Mao, Pengcheng He, Weizhu Chen",,,,,,,,,
|
1102 |
+
Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models,"Zhihong Shao, Yeyun Gong, Yelong Shen, Minlie Huang, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2302.00618,,https://huggingface.co/papers/2302.00618,,,,2302.00618,6,1
|
1103 |
Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise,"Zhenghao Lin, Yeyun Gong, Yelong Shen, Tong Wu, Zhihao Fan, Chen Lin, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2212.11685,https://github.com/microsoft/ProphetNet/tree/master/GENIE,https://huggingface.co/papers/2212.11685,,,,2212.11685,8,0
|
1104 |
Fast $(1+\varepsilon)$-Approximation Algorithms for Binary Matrix Factorization,"Ameya Velingker, Maximilian Vötsch, David Woodruff, Samson Zhou",,,,,,,,,
|
1105 |
Exphormer: Sparse Transformers for Graphs,"Hamed Shirzad, Ameya Velingker, Balaji Venkatachalam, Danica J Sutherland, Ali K Sinop",http://arxiv.org/abs/2303.06147,https://github.com/hamed1375/Exphormer,https://huggingface.co/papers/2303.06147,,,,2303.06147,5,1
|
|
|
1190 |
Specializing Smaller Language Models towards Multi-Step Reasoning,"Yao Fu, Hao Peng, Litu Ou, Ashish Sabharwal, Tushar Khot",http://arxiv.org/abs/2301.12726,,https://huggingface.co/papers/2301.12726,,,,2301.12726,5,1
|
1191 |
Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap,"Hang Wang, Sen Lin, Junshan Zhang",,,,,,,,,
|
1192 |
Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models,"Dongjun Kim, Yeongmin Kim, Se Jung Kwon, Wanmo Kang, IL CHUL MOON",http://arxiv.org/abs/2211.17091,https://github.com/alsdudrla10/DG,https://huggingface.co/papers/2211.17091,,,,2211.17091,5,0
|
1193 |
+
Weighted flow diffusion for local graph clustering with node attributes: an algorithm and statistical guarantees,"Shenghao Yang, Kimon Fountoulakis",http://arxiv.org/abs/2301.13187,,https://huggingface.co/papers/2301.13187,,,,2301.13187,2,1
|
1194 |
Robust Budget Pacing with a Single Sample,"Santiago Balseiro, Rachitesh Kumar, Vahab Mirrokni, Balasubramanian Sivan, Di Wang",http://arxiv.org/abs/2302.02006,,https://huggingface.co/papers/2302.02006,,,,2302.02006,5,0
|
1195 |
Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the Machiavelli Benchmark,"Alexander Pan, Jun Shern Chan, Andy Zou, Nathaniel Li, Scott Emmons, Hanlin Zhang, Steven Basart, Thomas Woodside, Dan Hendrycks",http://arxiv.org/abs/2304.03279,,https://huggingface.co/papers/2304.03279,,,,2304.03279,10,1
|
1196 |
Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines?,"Victor Boutin, Thomas FEL, Lakshya Singhal, Rishav Mukherji, Akash Nagaraj, Julien Colin, Thomas Serre",http://arxiv.org/abs/2301.11722,,https://huggingface.co/papers/2301.11722,,,,2301.11722,7,1
|
|
|
1363 |
Abstract-to-Executable Trajectory Translation for One-Shot Task Generalization,"Stone Tao, Xiaochen Li, Tongzhou Mu, Zhiao Huang, Yuzhe Qin, Hao Su",http://arxiv.org/abs/2210.07658,,https://huggingface.co/papers/2210.07658,,,,2210.07658,6,1
|
1364 |
Multi-View Masked World Models for Visual Robotic Manipulation,"Younggyo Seo, Junsu Kim, Stephen James, Kimin Lee, Jinwoo Shin, Pieter Abbeel",http://arxiv.org/abs/2302.02408,,https://huggingface.co/papers/2302.02408,,,,2302.02408,6,0
|
1365 |
CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets,"Zachary Novack, Julian McAuley, Zachary Lipton, Saurabh Garg",http://arxiv.org/abs/2302.02551,https://github.com/acmi-lab/CHILS,https://huggingface.co/papers/2302.02551,,,,2302.02551,4,1
|
1366 |
+
Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization,"Zi-Hao Qiu, Quanqi Hu, Zhuoning Yuan, Denny Zhou, Lijun Zhang, Tianbao Yang",http://arxiv.org/abs/2305.11965,,https://huggingface.co/papers/2305.11965,,,,2305.11965,6,1
|
1367 |
Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL,"Taku Yamagata, Ahmed Khalil, Raul Santos-Rodriguez",,,,,,,,,
|
1368 |
A Statistical Perspective on Retrieval-Based Models,"Soumya Basu, Ankit Singh Rawat, Manzil Zaheer",,,,,,,,,
|
1369 |
PFNs4BO: Meta-Learning the surrogate model for Bayesian optimization from scratch using Transformers,"Samuel Gabriel Müller, Matthias Feurer, Noah Hollmann, Frank Hutter",,,,,,,,,
|
|
|
1429 |
Topologically Faithful Image Segmentation via Induced Matching of Persistence Barcodes,"Nico Stucki, Johannes C. Paetzold, Suprosanna Shit, bjoern menze, Ulrich Bauer",http://arxiv.org/abs/2211.15272,https://github.com/nstucki/Betti-matching,https://huggingface.co/papers/2211.15272,,,,2211.15272,5,0
|
1430 |
FedDisco: Federated Learning with Discrepancy-Aware Collaboration,"Rui Ye, Mingkai Xu, Jianyu Wang, Chenxin Xu, Siheng Chen, Yan-Feng Wang",http://arxiv.org/abs/2305.19229,https://github.com/MediaBrain-SJTU/FedDisco,https://huggingface.co/papers/2305.19229,,,,2305.19229,6,0
|
1431 |
Personalized Federated Learning with Inferred Collaboration Graphs,"Rui Ye, Zhenyang Ni, Fangzhao Wu, Siheng Chen, Yan-Feng Wang",,,,,,,,,
|
1432 |
+
ModelDiff: A Framework for Comparing Learning Algorithms,"Harshay Shah, Sung Min (Sam) Park, Andrew Ilyas, Aleksander Madry",http://arxiv.org/abs/2211.12491,https://github.com/MadryLab/modeldiff,https://huggingface.co/papers/2211.12491,,,,2211.12491,4,1
|
1433 |
Half-Hop: A graph upsampling approach for slowing down message passing,"Mehdi Azabou, Venkataramana Ganesh, Shantanu Thakoor, Chi-Heng Lin, Lakshmi Sathidevi, Ran Liu, Michal Valko, Petar Veličković, Eva Dyer",,,,,,,,,
|
1434 |
Structural Re-weighting Improves Graph Domain Adaptation,"Shikun Liu, Tianchun Li, Yongbin Feng, Nhan Tran, Han Zhao, Qiang Qiu, Pan Li, Pan Li",http://arxiv.org/abs/2306.03221,,https://huggingface.co/papers/2306.03221,,,,2306.03221,7,0
|
1435 |
InfoOT: Information Maximizing Optimal Transport,"Ching-Yao Chuang, Stefanie Jegelka, David Alvarez-Melis",http://arxiv.org/abs/2210.03164,,https://huggingface.co/papers/2210.03164,,,,2210.03164,3,0
|
|
|
1710 |
Path Neural Networks: Expressive and Accurate Graph Neural Networks,"Gaspard Michel, Giannis Nikolentzos, Johannes Lutzeyer, Michalis Vazirgiannis",http://arxiv.org/abs/2306.05955,,https://huggingface.co/papers/2306.05955,,,,2306.05955,4,1
|
1711 |
Hierarchical Diffusion for Offline Decision Making,"Wenhao Li, Xiangfeng Wang, Bo Jin, Hongyuan Zha",,,,,,,,,
|
1712 |
Generated Graph Detection,"Yihan Ma, Zhikun Zhang, Ning Yu, Xinlei He, Michael Backes, Yun Shen, Yang Zhang",http://arxiv.org/abs/2306.07758,,https://huggingface.co/papers/2306.07758,,,,2306.07758,7,0
|
1713 |
+
Variational Open-Domain Question Answering,"Valentin Liévin, Andreas Geert Motzfeldt, Ida Jensen, Ole Winther",http://arxiv.org/abs/2210.06345,,https://huggingface.co/papers/2210.06345,,,,2210.06345,4,2
|
1714 |
PromptBoosting: Black-Box Text Classification with Ten Forward Passes,"Bairu Hou, Joe O'Connor, Jacob Andreas, Shiyu Chang, Yang Zhang",http://arxiv.org/abs/2212.09257,,https://huggingface.co/papers/2212.09257,,,,2212.09257,5,0
|
1715 |
Gradient-Free Structured Pruning with Unlabeled Data,"Azade Nova, Hanjun Dai, Dale Schuurmans",http://arxiv.org/abs/2303.04185,,https://huggingface.co/papers/2303.04185,,,,2303.04185,3,0
|
1716 |
Text-To-Concept (and Back) via Cross-Model Alignment,"Mazda Moayeri, Keivan Rezaei, Maziar Sanjabi, Soheil Feizi",http://arxiv.org/abs/2305.06386,,https://huggingface.co/papers/2305.06386,,,,2305.06386,4,2
|