hysts HF Staff commited on
Commit
a5b9ea4
·
1 Parent(s): 44b9f70

commit files to HF hub

Browse files
Files changed (1) hide show
  1. papers.csv +5 -5
papers.csv CHANGED
@@ -1097,7 +1097,7 @@ LegendreTron: Uprising Proper Multiclass Loss Learning,"Kevin H. Lam, Christian
1097
  R-U-SURE? Uncertainty-Aware Code Suggestions By Maximizing Utility Across Random User Intents,"Daniel D. Johnson, Daniel Tarlow, Christian Walder",,,,,,,,,
1098
  High-dimensional Location Estimation via Norm Concentration for Subgamma Vectors,"Shivam Gupta, Jasper Lee, Eric Price",http://arxiv.org/abs/2302.02497,,https://huggingface.co/papers/2302.02497,,,,2302.02497,3,0
1099
  COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models,"Jinqi Xiao, Miao Yin, Yu Gong, Xiao Zang, Jian Ren, Bo Yuan",http://arxiv.org/abs/2305.17235,,https://huggingface.co/papers/2305.17235,,,,2305.17235,6,1
1100
- Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling,"Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'Brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, USVSN Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar van der Wal",http://arxiv.org/abs/2304.01373,https://github.com/EleutherAI/pythia,https://huggingface.co/papers/2304.01373,,,,2304.01373,13,7
1101
  HyperTuning: Toward Adapting Large Language Models without Back-propagation,"Jason Phang, Yi Mao, Pengcheng He, Weizhu Chen",,,,,,,,,
1102
  Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models,"Zhihong Shao, Yeyun Gong, Yelong Shen, Minlie Huang, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2302.00618,,https://huggingface.co/papers/2302.00618,,,,2302.00618,6,1
1103
  Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise,"Zhenghao Lin, Yeyun Gong, Yelong Shen, Tong Wu, Zhihao Fan, Chen Lin, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2212.11685,https://github.com/microsoft/ProphetNet/tree/master/GENIE,https://huggingface.co/papers/2212.11685,,,,2212.11685,8,0
@@ -1465,7 +1465,7 @@ OpenFE: Automated Feature Generation with Expert-level Performance,"Tianping Zha
1465
  Weighted Sampling without Replacement for Deep Top-$k$ Classification,"Dieqiao Feng, Yuanqi Du, Carla Gomes, Bart Selman",,,,,,,,,
1466
  A Flexible Diffusion Model,"weitao du, He Zhang, Tao Yang, Yuanqi Du",http://arxiv.org/abs/2206.10365,,https://huggingface.co/papers/2206.10365,,,,2206.10365,4,0
1467
  Learning Expressive Priors for Generalization and Uncertainty Estimation in Neural Networks,"Dominik Schnaus, Jongseok Lee, Daniel Cremers, Rudolph Triebel",,,,,,,,,
1468
- Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguous Inputs,"Michael Kirchhof, Enkelejda Kasneci, Seong Joon Oh",http://arxiv.org/abs/2302.02865,https://github.com/mkirchhof/Probabilistic_Contrastive_Learning,https://huggingface.co/papers/2302.02865,,,,2302.02865,3,0
1469
  Linear Time GPs for Inferring Latent Trajectories from Neural Spike Trains,"Matthew Dowling, Yuan Zhao, Memming Park",http://arxiv.org/abs/2306.01802,,https://huggingface.co/papers/2306.01802,,,,2306.01802,3,0
1470
  Learning Control by Iterative Inversion,"Gal Leibovich, Guy Jacob, Or Avner, Gal Novik, Aviv Tamar",http://arxiv.org/abs/2211.01724,,https://huggingface.co/papers/2211.01724,,,,2211.01724,5,2
1471
  Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning,"Tongzhou Wang, Antonio Torralba, Phillip Isola, Amy Zhang",http://arxiv.org/abs/2304.01203,,https://huggingface.co/papers/2304.01203,,,,2304.01203,4,1
@@ -1490,7 +1490,7 @@ Infinite Action Contextual Bandits with Reusable Data Exhaust,"Mark Rucker, Ying
1490
  Regret Minimization and Convergence to Equilibria in General-sum Markov Games,"Liad Erez, Tal Lancewicki, Uri Sherman, Tomer Koren, Yishay Mansour",http://arxiv.org/abs/2207.14211,,https://huggingface.co/papers/2207.14211,,,,2207.14211,5,0
1491
  Optimal Rates and Efficient Algorithms for Online Bayesian Persuasion,"Martino Bernasconi, Matteo Castiglioni, Andrea Celli, Alberto Marchesi, Francesco Trovò, Nicola Gatti",http://arxiv.org/abs/2303.01296,,https://huggingface.co/papers/2303.01296,,,,2303.01296,6,0
1492
  Distributed Linear Bandits under Communication Constraints,"Sudeep Salgia, Qing Zhao",http://arxiv.org/abs/2211.02212,,https://huggingface.co/papers/2211.02212,,,,2211.02212,2,0
1493
- Online Mechanism Design for Information Acquisition,"Federico Cacciamani, Matteo Castiglioni, Nicola Gatti",http://arxiv.org/abs/2302.02873,,https://huggingface.co/papers/2302.02873,,,,2302.02873,3,0
1494
  Federated Online and Bandit Convex Optimization,"Kumar Kshitij Patel, Lingxiao Wang, Aadirupa Saha, Nati Srebro",,,,,,,,,
1495
  Statistical Foundations of Prior-Data Fitted Networks,Thomas Nagler,http://arxiv.org/abs/2305.11097,,https://huggingface.co/papers/2305.11097,,,,2305.11097,1,0
1496
  Who Needs to Know? Minimal Knowledge for Optimal Coordination,"Niklas Lauffer, Ameesh Shah, Micah Carroll, Michael Dennis, Stuart Russell",http://arxiv.org/abs/2306.09309,,https://huggingface.co/papers/2306.09309,,,,2306.09309,5,1
@@ -1590,7 +1590,7 @@ Automatically Auditing Large Language Models via Discrete Optimization,"Erik Jon
1590
  Data Structures for Density Estimation,"Anders Aamand, Alexandr Andoni, Justin Chen, Piotr Indyk, Shyam Narayanan, Sandeep Silwal",,,,,,,,,
1591
  Provably Invariant Learning without Domain Information,"Xiaoyu Tan, Yong LIN, Shengyu Zhu, Chao Qu, Xihe Qiu, Xu Yinghui, Peng Cui, Yuan Qi",,,,,,,,,
1592
  Online Platt Scaling with Calibeating,"Chirag Gupta, Aaditya Ramdas",http://arxiv.org/abs/2305.00070,,https://huggingface.co/papers/2305.00070,,,,2305.00070,2,0
1593
- An Effective Meaningful Way to Evaluate Survival Models,"Shi-ang Qi, Neeraj Kumar, Mahtab Farrokh, Weijie Sun, Li-Hao Kuan, Rajesh Ranganath, Ricardo Henao, Russell Greiner",http://arxiv.org/abs/2306.01196,,https://huggingface.co/papers/2306.01196,,,,2306.01196,8,0
1594
  Evaluating Unsupervised Denoising Requires Unsupervised Metrics,"Adrià Marcos Morales, Matan Leibovich, Sreyas Mohan, Joshua Vincent, Piyush Haluai, Mai Tan, Peter Crozier, Carlos Fernandez-Granda",http://arxiv.org/abs/2210.05553,,https://huggingface.co/papers/2210.05553,,,,2210.05553,8,0
1595
  Hidden symmetries of ReLU networks,"Elisenda Grigsby, Kathryn Lindsey, David Rolnick",http://arxiv.org/abs/2306.06179,,https://huggingface.co/papers/2306.06179,,,,2306.06179,3,0
1596
  Modeling Dynamic Environments with Scene Graph Memory,"Andrey Kurenkov, Michael Lingelbach, Tanmay Agarwal, Emily Jin, Chengshu Li, Ruohan Zhang, Li Fei-Fei, Jiajun Wu, Silvio Savarese, Roberto Martín-Martín",http://arxiv.org/abs/2305.17537,,https://huggingface.co/papers/2305.17537,,,,2305.17537,10,2
@@ -1620,7 +1620,7 @@ CRISP: Curriculum based Sequential neural decoders for Polar code family,"S Ashw
1620
  On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits,"Weitong Zhang, Jiafan He, Jiafan He, Zhiyuan Fan, Quanquan Gu",http://arxiv.org/abs/2303.09390,,https://huggingface.co/papers/2303.09390,,,,2303.09390,4,1
1621
  Brainformers: Trading Simplicity for Efficiency,"Yanqi Zhou, Nan Du, Yanping Huang, Daiyi Peng, Chang Lan, Da Huang, Siamak Shakeri, David So, Andrew Dai, Yifeng Lu, Zhifeng Chen, Quoc Le, Claire Cui, James Laudon, Jeff Dean",http://arxiv.org/abs/2306.00008,,https://huggingface.co/papers/2306.00008,,,,2306.00008,15,3
1622
  On the Training Instability of Shuffling SGD with Batch Normalization,"David X. Wu, Chulhee Yun, Suvrit Sra",http://arxiv.org/abs/2302.12444,,https://huggingface.co/papers/2302.12444,,,,2302.12444,3,0
1623
- Dropout Reduces Underfitting,"Zhuang Liu, Zhiqiu (Oscar) Xu, Joseph Jin, Zhiqiang Shen, Trevor Darrell",http://arxiv.org/abs/2303.01500,https://github.com/facebookresearch/dropout,https://huggingface.co/papers/2303.01500,,,,2303.01500,5,0
1624
  A modern look at the relationship between sharpness and generalization,"Maksym Andriushchenko, Francesco Croce, Maximilian Müller, Matthias Hein, Nicolas Flammarion",http://arxiv.org/abs/2302.07011,https://github.com/tml-epfl/sharpness-vs-generalization,https://huggingface.co/papers/2302.07011,,,,2302.07011,5,1
1625
  Weak Proxies are Sufficient and Preferable for Fairness with Missing Sensitive Attributes,"Zhaowei Zhu, Yuanshun Yao, Jiankai Sun, Hang Li, Yang Liu",http://arxiv.org/abs/2210.03175,,https://huggingface.co/papers/2210.03175,,,,2210.03175,5,0
1626
  Cocktail Party Attack: Breaking Aggregation-Based Privacy in Federated Learning Using Independent Component Analysis,"Sanjay Kariyappa, Chuan Guo, Kiwan Maeng, Wenjie Xiong, G. Edward Suh, Moinuddin Qureshi, Hsien-Hsin Sean Lee",http://arxiv.org/abs/2209.05578,,https://huggingface.co/papers/2209.05578,,,,2209.05578,7,1
 
1097
  R-U-SURE? Uncertainty-Aware Code Suggestions By Maximizing Utility Across Random User Intents,"Daniel D. Johnson, Daniel Tarlow, Christian Walder",,,,,,,,,
1098
  High-dimensional Location Estimation via Norm Concentration for Subgamma Vectors,"Shivam Gupta, Jasper Lee, Eric Price",http://arxiv.org/abs/2302.02497,,https://huggingface.co/papers/2302.02497,,,,2302.02497,3,0
1099
  COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models,"Jinqi Xiao, Miao Yin, Yu Gong, Xiao Zang, Jian Ren, Bo Yuan",http://arxiv.org/abs/2305.17235,,https://huggingface.co/papers/2305.17235,,,,2305.17235,6,1
1100
+ Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling,"Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'Brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, USVSN Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar van der Wal",http://arxiv.org/abs/2304.01373,https://github.com/EleutherAI/pythia,https://huggingface.co/papers/2304.01373,,,,2304.01373,13,8
1101
  HyperTuning: Toward Adapting Large Language Models without Back-propagation,"Jason Phang, Yi Mao, Pengcheng He, Weizhu Chen",,,,,,,,,
1102
  Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models,"Zhihong Shao, Yeyun Gong, Yelong Shen, Minlie Huang, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2302.00618,,https://huggingface.co/papers/2302.00618,,,,2302.00618,6,1
1103
  Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise,"Zhenghao Lin, Yeyun Gong, Yelong Shen, Tong Wu, Zhihao Fan, Chen Lin, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2212.11685,https://github.com/microsoft/ProphetNet/tree/master/GENIE,https://huggingface.co/papers/2212.11685,,,,2212.11685,8,0
 
1465
  Weighted Sampling without Replacement for Deep Top-$k$ Classification,"Dieqiao Feng, Yuanqi Du, Carla Gomes, Bart Selman",,,,,,,,,
1466
  A Flexible Diffusion Model,"weitao du, He Zhang, Tao Yang, Yuanqi Du",http://arxiv.org/abs/2206.10365,,https://huggingface.co/papers/2206.10365,,,,2206.10365,4,0
1467
  Learning Expressive Priors for Generalization and Uncertainty Estimation in Neural Networks,"Dominik Schnaus, Jongseok Lee, Daniel Cremers, Rudolph Triebel",,,,,,,,,
1468
+ Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguous Inputs,"Michael Kirchhof, Enkelejda Kasneci, Seong Joon Oh",http://arxiv.org/abs/2302.02865,https://github.com/mkirchhof/Probabilistic_Contrastive_Learning,https://huggingface.co/papers/2302.02865,,,,2302.02865,3,1
1469
  Linear Time GPs for Inferring Latent Trajectories from Neural Spike Trains,"Matthew Dowling, Yuan Zhao, Memming Park",http://arxiv.org/abs/2306.01802,,https://huggingface.co/papers/2306.01802,,,,2306.01802,3,0
1470
  Learning Control by Iterative Inversion,"Gal Leibovich, Guy Jacob, Or Avner, Gal Novik, Aviv Tamar",http://arxiv.org/abs/2211.01724,,https://huggingface.co/papers/2211.01724,,,,2211.01724,5,2
1471
  Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning,"Tongzhou Wang, Antonio Torralba, Phillip Isola, Amy Zhang",http://arxiv.org/abs/2304.01203,,https://huggingface.co/papers/2304.01203,,,,2304.01203,4,1
 
1490
  Regret Minimization and Convergence to Equilibria in General-sum Markov Games,"Liad Erez, Tal Lancewicki, Uri Sherman, Tomer Koren, Yishay Mansour",http://arxiv.org/abs/2207.14211,,https://huggingface.co/papers/2207.14211,,,,2207.14211,5,0
1491
  Optimal Rates and Efficient Algorithms for Online Bayesian Persuasion,"Martino Bernasconi, Matteo Castiglioni, Andrea Celli, Alberto Marchesi, Francesco Trovò, Nicola Gatti",http://arxiv.org/abs/2303.01296,,https://huggingface.co/papers/2303.01296,,,,2303.01296,6,0
1492
  Distributed Linear Bandits under Communication Constraints,"Sudeep Salgia, Qing Zhao",http://arxiv.org/abs/2211.02212,,https://huggingface.co/papers/2211.02212,,,,2211.02212,2,0
1493
+ Online Mechanism Design for Information Acquisition,"Federico Cacciamani, Matteo Castiglioni, Nicola Gatti",http://arxiv.org/abs/2302.02873,,https://huggingface.co/papers/2302.02873,,,,2302.02873,3,1
1494
  Federated Online and Bandit Convex Optimization,"Kumar Kshitij Patel, Lingxiao Wang, Aadirupa Saha, Nati Srebro",,,,,,,,,
1495
  Statistical Foundations of Prior-Data Fitted Networks,Thomas Nagler,http://arxiv.org/abs/2305.11097,,https://huggingface.co/papers/2305.11097,,,,2305.11097,1,0
1496
  Who Needs to Know? Minimal Knowledge for Optimal Coordination,"Niklas Lauffer, Ameesh Shah, Micah Carroll, Michael Dennis, Stuart Russell",http://arxiv.org/abs/2306.09309,,https://huggingface.co/papers/2306.09309,,,,2306.09309,5,1
 
1590
  Data Structures for Density Estimation,"Anders Aamand, Alexandr Andoni, Justin Chen, Piotr Indyk, Shyam Narayanan, Sandeep Silwal",,,,,,,,,
1591
  Provably Invariant Learning without Domain Information,"Xiaoyu Tan, Yong LIN, Shengyu Zhu, Chao Qu, Xihe Qiu, Xu Yinghui, Peng Cui, Yuan Qi",,,,,,,,,
1592
  Online Platt Scaling with Calibeating,"Chirag Gupta, Aaditya Ramdas",http://arxiv.org/abs/2305.00070,,https://huggingface.co/papers/2305.00070,,,,2305.00070,2,0
1593
+ An Effective Meaningful Way to Evaluate Survival Models,"Shi-ang Qi, Neeraj Kumar, Mahtab Farrokh, Weijie Sun, Li-Hao Kuan, Rajesh Ranganath, Ricardo Henao, Russell Greiner",http://arxiv.org/abs/2306.01196,,https://huggingface.co/papers/2306.01196,,,,2306.01196,8,1
1594
  Evaluating Unsupervised Denoising Requires Unsupervised Metrics,"Adrià Marcos Morales, Matan Leibovich, Sreyas Mohan, Joshua Vincent, Piyush Haluai, Mai Tan, Peter Crozier, Carlos Fernandez-Granda",http://arxiv.org/abs/2210.05553,,https://huggingface.co/papers/2210.05553,,,,2210.05553,8,0
1595
  Hidden symmetries of ReLU networks,"Elisenda Grigsby, Kathryn Lindsey, David Rolnick",http://arxiv.org/abs/2306.06179,,https://huggingface.co/papers/2306.06179,,,,2306.06179,3,0
1596
  Modeling Dynamic Environments with Scene Graph Memory,"Andrey Kurenkov, Michael Lingelbach, Tanmay Agarwal, Emily Jin, Chengshu Li, Ruohan Zhang, Li Fei-Fei, Jiajun Wu, Silvio Savarese, Roberto Martín-Martín",http://arxiv.org/abs/2305.17537,,https://huggingface.co/papers/2305.17537,,,,2305.17537,10,2
 
1620
  On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits,"Weitong Zhang, Jiafan He, Jiafan He, Zhiyuan Fan, Quanquan Gu",http://arxiv.org/abs/2303.09390,,https://huggingface.co/papers/2303.09390,,,,2303.09390,4,1
1621
  Brainformers: Trading Simplicity for Efficiency,"Yanqi Zhou, Nan Du, Yanping Huang, Daiyi Peng, Chang Lan, Da Huang, Siamak Shakeri, David So, Andrew Dai, Yifeng Lu, Zhifeng Chen, Quoc Le, Claire Cui, James Laudon, Jeff Dean",http://arxiv.org/abs/2306.00008,,https://huggingface.co/papers/2306.00008,,,,2306.00008,15,3
1622
  On the Training Instability of Shuffling SGD with Batch Normalization,"David X. Wu, Chulhee Yun, Suvrit Sra",http://arxiv.org/abs/2302.12444,,https://huggingface.co/papers/2302.12444,,,,2302.12444,3,0
1623
+ Dropout Reduces Underfitting,"Zhuang Liu, Zhiqiu (Oscar) Xu, Joseph Jin, Zhiqiang Shen, Trevor Darrell",http://arxiv.org/abs/2303.01500,https://github.com/facebookresearch/dropout,https://huggingface.co/papers/2303.01500,,,,2303.01500,5,1
1624
  A modern look at the relationship between sharpness and generalization,"Maksym Andriushchenko, Francesco Croce, Maximilian Müller, Matthias Hein, Nicolas Flammarion",http://arxiv.org/abs/2302.07011,https://github.com/tml-epfl/sharpness-vs-generalization,https://huggingface.co/papers/2302.07011,,,,2302.07011,5,1
1625
  Weak Proxies are Sufficient and Preferable for Fairness with Missing Sensitive Attributes,"Zhaowei Zhu, Yuanshun Yao, Jiankai Sun, Hang Li, Yang Liu",http://arxiv.org/abs/2210.03175,,https://huggingface.co/papers/2210.03175,,,,2210.03175,5,0
1626
  Cocktail Party Attack: Breaking Aggregation-Based Privacy in Federated Learning Using Independent Component Analysis,"Sanjay Kariyappa, Chuan Guo, Kiwan Maeng, Wenjie Xiong, G. Edward Suh, Moinuddin Qureshi, Hsien-Hsin Sean Lee",http://arxiv.org/abs/2209.05578,,https://huggingface.co/papers/2209.05578,,,,2209.05578,7,1