Spaces:
Sleeping
Sleeping
commit files to HF hub
Browse files- papers.csv +5 -5
papers.csv
CHANGED
@@ -1097,7 +1097,7 @@ LegendreTron: Uprising Proper Multiclass Loss Learning,"Kevin H. Lam, Christian
|
|
1097 |
R-U-SURE? Uncertainty-Aware Code Suggestions By Maximizing Utility Across Random User Intents,"Daniel D. Johnson, Daniel Tarlow, Christian Walder",,,,,,,,,
|
1098 |
High-dimensional Location Estimation via Norm Concentration for Subgamma Vectors,"Shivam Gupta, Jasper Lee, Eric Price",http://arxiv.org/abs/2302.02497,,https://huggingface.co/papers/2302.02497,,,,2302.02497,3,0
|
1099 |
COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models,"Jinqi Xiao, Miao Yin, Yu Gong, Xiao Zang, Jian Ren, Bo Yuan",http://arxiv.org/abs/2305.17235,,https://huggingface.co/papers/2305.17235,,,,2305.17235,6,1
|
1100 |
-
Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling,"Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'Brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, USVSN Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar van der Wal",http://arxiv.org/abs/2304.01373,https://github.com/EleutherAI/pythia,https://huggingface.co/papers/2304.01373,,,,2304.01373,13,
|
1101 |
HyperTuning: Toward Adapting Large Language Models without Back-propagation,"Jason Phang, Yi Mao, Pengcheng He, Weizhu Chen",,,,,,,,,
|
1102 |
Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models,"Zhihong Shao, Yeyun Gong, Yelong Shen, Minlie Huang, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2302.00618,,https://huggingface.co/papers/2302.00618,,,,2302.00618,6,1
|
1103 |
Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise,"Zhenghao Lin, Yeyun Gong, Yelong Shen, Tong Wu, Zhihao Fan, Chen Lin, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2212.11685,https://github.com/microsoft/ProphetNet/tree/master/GENIE,https://huggingface.co/papers/2212.11685,,,,2212.11685,8,0
|
@@ -1465,7 +1465,7 @@ OpenFE: Automated Feature Generation with Expert-level Performance,"Tianping Zha
|
|
1465 |
Weighted Sampling without Replacement for Deep Top-$k$ Classification,"Dieqiao Feng, Yuanqi Du, Carla Gomes, Bart Selman",,,,,,,,,
|
1466 |
A Flexible Diffusion Model,"weitao du, He Zhang, Tao Yang, Yuanqi Du",http://arxiv.org/abs/2206.10365,,https://huggingface.co/papers/2206.10365,,,,2206.10365,4,0
|
1467 |
Learning Expressive Priors for Generalization and Uncertainty Estimation in Neural Networks,"Dominik Schnaus, Jongseok Lee, Daniel Cremers, Rudolph Triebel",,,,,,,,,
|
1468 |
-
Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguous Inputs,"Michael Kirchhof, Enkelejda Kasneci, Seong Joon Oh",http://arxiv.org/abs/2302.02865,https://github.com/mkirchhof/Probabilistic_Contrastive_Learning,https://huggingface.co/papers/2302.02865,,,,2302.02865,3,
|
1469 |
Linear Time GPs for Inferring Latent Trajectories from Neural Spike Trains,"Matthew Dowling, Yuan Zhao, Memming Park",http://arxiv.org/abs/2306.01802,,https://huggingface.co/papers/2306.01802,,,,2306.01802,3,0
|
1470 |
Learning Control by Iterative Inversion,"Gal Leibovich, Guy Jacob, Or Avner, Gal Novik, Aviv Tamar",http://arxiv.org/abs/2211.01724,,https://huggingface.co/papers/2211.01724,,,,2211.01724,5,2
|
1471 |
Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning,"Tongzhou Wang, Antonio Torralba, Phillip Isola, Amy Zhang",http://arxiv.org/abs/2304.01203,,https://huggingface.co/papers/2304.01203,,,,2304.01203,4,1
|
@@ -1490,7 +1490,7 @@ Infinite Action Contextual Bandits with Reusable Data Exhaust,"Mark Rucker, Ying
|
|
1490 |
Regret Minimization and Convergence to Equilibria in General-sum Markov Games,"Liad Erez, Tal Lancewicki, Uri Sherman, Tomer Koren, Yishay Mansour",http://arxiv.org/abs/2207.14211,,https://huggingface.co/papers/2207.14211,,,,2207.14211,5,0
|
1491 |
Optimal Rates and Efficient Algorithms for Online Bayesian Persuasion,"Martino Bernasconi, Matteo Castiglioni, Andrea Celli, Alberto Marchesi, Francesco Trovò, Nicola Gatti",http://arxiv.org/abs/2303.01296,,https://huggingface.co/papers/2303.01296,,,,2303.01296,6,0
|
1492 |
Distributed Linear Bandits under Communication Constraints,"Sudeep Salgia, Qing Zhao",http://arxiv.org/abs/2211.02212,,https://huggingface.co/papers/2211.02212,,,,2211.02212,2,0
|
1493 |
-
Online Mechanism Design for Information Acquisition,"Federico Cacciamani, Matteo Castiglioni, Nicola Gatti",http://arxiv.org/abs/2302.02873,,https://huggingface.co/papers/2302.02873,,,,2302.02873,3,
|
1494 |
Federated Online and Bandit Convex Optimization,"Kumar Kshitij Patel, Lingxiao Wang, Aadirupa Saha, Nati Srebro",,,,,,,,,
|
1495 |
Statistical Foundations of Prior-Data Fitted Networks,Thomas Nagler,http://arxiv.org/abs/2305.11097,,https://huggingface.co/papers/2305.11097,,,,2305.11097,1,0
|
1496 |
Who Needs to Know? Minimal Knowledge for Optimal Coordination,"Niklas Lauffer, Ameesh Shah, Micah Carroll, Michael Dennis, Stuart Russell",http://arxiv.org/abs/2306.09309,,https://huggingface.co/papers/2306.09309,,,,2306.09309,5,1
|
@@ -1590,7 +1590,7 @@ Automatically Auditing Large Language Models via Discrete Optimization,"Erik Jon
|
|
1590 |
Data Structures for Density Estimation,"Anders Aamand, Alexandr Andoni, Justin Chen, Piotr Indyk, Shyam Narayanan, Sandeep Silwal",,,,,,,,,
|
1591 |
Provably Invariant Learning without Domain Information,"Xiaoyu Tan, Yong LIN, Shengyu Zhu, Chao Qu, Xihe Qiu, Xu Yinghui, Peng Cui, Yuan Qi",,,,,,,,,
|
1592 |
Online Platt Scaling with Calibeating,"Chirag Gupta, Aaditya Ramdas",http://arxiv.org/abs/2305.00070,,https://huggingface.co/papers/2305.00070,,,,2305.00070,2,0
|
1593 |
-
An Effective Meaningful Way to Evaluate Survival Models,"Shi-ang Qi, Neeraj Kumar, Mahtab Farrokh, Weijie Sun, Li-Hao Kuan, Rajesh Ranganath, Ricardo Henao, Russell Greiner",http://arxiv.org/abs/2306.01196,,https://huggingface.co/papers/2306.01196,,,,2306.01196,8,
|
1594 |
Evaluating Unsupervised Denoising Requires Unsupervised Metrics,"Adrià Marcos Morales, Matan Leibovich, Sreyas Mohan, Joshua Vincent, Piyush Haluai, Mai Tan, Peter Crozier, Carlos Fernandez-Granda",http://arxiv.org/abs/2210.05553,,https://huggingface.co/papers/2210.05553,,,,2210.05553,8,0
|
1595 |
Hidden symmetries of ReLU networks,"Elisenda Grigsby, Kathryn Lindsey, David Rolnick",http://arxiv.org/abs/2306.06179,,https://huggingface.co/papers/2306.06179,,,,2306.06179,3,0
|
1596 |
Modeling Dynamic Environments with Scene Graph Memory,"Andrey Kurenkov, Michael Lingelbach, Tanmay Agarwal, Emily Jin, Chengshu Li, Ruohan Zhang, Li Fei-Fei, Jiajun Wu, Silvio Savarese, Roberto Martín-Martín",http://arxiv.org/abs/2305.17537,,https://huggingface.co/papers/2305.17537,,,,2305.17537,10,2
|
@@ -1620,7 +1620,7 @@ CRISP: Curriculum based Sequential neural decoders for Polar code family,"S Ashw
|
|
1620 |
On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits,"Weitong Zhang, Jiafan He, Jiafan He, Zhiyuan Fan, Quanquan Gu",http://arxiv.org/abs/2303.09390,,https://huggingface.co/papers/2303.09390,,,,2303.09390,4,1
|
1621 |
Brainformers: Trading Simplicity for Efficiency,"Yanqi Zhou, Nan Du, Yanping Huang, Daiyi Peng, Chang Lan, Da Huang, Siamak Shakeri, David So, Andrew Dai, Yifeng Lu, Zhifeng Chen, Quoc Le, Claire Cui, James Laudon, Jeff Dean",http://arxiv.org/abs/2306.00008,,https://huggingface.co/papers/2306.00008,,,,2306.00008,15,3
|
1622 |
On the Training Instability of Shuffling SGD with Batch Normalization,"David X. Wu, Chulhee Yun, Suvrit Sra",http://arxiv.org/abs/2302.12444,,https://huggingface.co/papers/2302.12444,,,,2302.12444,3,0
|
1623 |
-
Dropout Reduces Underfitting,"Zhuang Liu, Zhiqiu (Oscar) Xu, Joseph Jin, Zhiqiang Shen, Trevor Darrell",http://arxiv.org/abs/2303.01500,https://github.com/facebookresearch/dropout,https://huggingface.co/papers/2303.01500,,,,2303.01500,5,
|
1624 |
A modern look at the relationship between sharpness and generalization,"Maksym Andriushchenko, Francesco Croce, Maximilian Müller, Matthias Hein, Nicolas Flammarion",http://arxiv.org/abs/2302.07011,https://github.com/tml-epfl/sharpness-vs-generalization,https://huggingface.co/papers/2302.07011,,,,2302.07011,5,1
|
1625 |
Weak Proxies are Sufficient and Preferable for Fairness with Missing Sensitive Attributes,"Zhaowei Zhu, Yuanshun Yao, Jiankai Sun, Hang Li, Yang Liu",http://arxiv.org/abs/2210.03175,,https://huggingface.co/papers/2210.03175,,,,2210.03175,5,0
|
1626 |
Cocktail Party Attack: Breaking Aggregation-Based Privacy in Federated Learning Using Independent Component Analysis,"Sanjay Kariyappa, Chuan Guo, Kiwan Maeng, Wenjie Xiong, G. Edward Suh, Moinuddin Qureshi, Hsien-Hsin Sean Lee",http://arxiv.org/abs/2209.05578,,https://huggingface.co/papers/2209.05578,,,,2209.05578,7,1
|
|
|
1097 |
R-U-SURE? Uncertainty-Aware Code Suggestions By Maximizing Utility Across Random User Intents,"Daniel D. Johnson, Daniel Tarlow, Christian Walder",,,,,,,,,
|
1098 |
High-dimensional Location Estimation via Norm Concentration for Subgamma Vectors,"Shivam Gupta, Jasper Lee, Eric Price",http://arxiv.org/abs/2302.02497,,https://huggingface.co/papers/2302.02497,,,,2302.02497,3,0
|
1099 |
COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models,"Jinqi Xiao, Miao Yin, Yu Gong, Xiao Zang, Jian Ren, Bo Yuan",http://arxiv.org/abs/2305.17235,,https://huggingface.co/papers/2305.17235,,,,2305.17235,6,1
|
1100 |
+
Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling,"Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'Brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, USVSN Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar van der Wal",http://arxiv.org/abs/2304.01373,https://github.com/EleutherAI/pythia,https://huggingface.co/papers/2304.01373,,,,2304.01373,13,8
|
1101 |
HyperTuning: Toward Adapting Large Language Models without Back-propagation,"Jason Phang, Yi Mao, Pengcheng He, Weizhu Chen",,,,,,,,,
|
1102 |
Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models,"Zhihong Shao, Yeyun Gong, Yelong Shen, Minlie Huang, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2302.00618,,https://huggingface.co/papers/2302.00618,,,,2302.00618,6,1
|
1103 |
Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise,"Zhenghao Lin, Yeyun Gong, Yelong Shen, Tong Wu, Zhihao Fan, Chen Lin, Nan Duan, Weizhu Chen",http://arxiv.org/abs/2212.11685,https://github.com/microsoft/ProphetNet/tree/master/GENIE,https://huggingface.co/papers/2212.11685,,,,2212.11685,8,0
|
|
|
1465 |
Weighted Sampling without Replacement for Deep Top-$k$ Classification,"Dieqiao Feng, Yuanqi Du, Carla Gomes, Bart Selman",,,,,,,,,
|
1466 |
A Flexible Diffusion Model,"weitao du, He Zhang, Tao Yang, Yuanqi Du",http://arxiv.org/abs/2206.10365,,https://huggingface.co/papers/2206.10365,,,,2206.10365,4,0
|
1467 |
Learning Expressive Priors for Generalization and Uncertainty Estimation in Neural Networks,"Dominik Schnaus, Jongseok Lee, Daniel Cremers, Rudolph Triebel",,,,,,,,,
|
1468 |
+
Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguous Inputs,"Michael Kirchhof, Enkelejda Kasneci, Seong Joon Oh",http://arxiv.org/abs/2302.02865,https://github.com/mkirchhof/Probabilistic_Contrastive_Learning,https://huggingface.co/papers/2302.02865,,,,2302.02865,3,1
|
1469 |
Linear Time GPs for Inferring Latent Trajectories from Neural Spike Trains,"Matthew Dowling, Yuan Zhao, Memming Park",http://arxiv.org/abs/2306.01802,,https://huggingface.co/papers/2306.01802,,,,2306.01802,3,0
|
1470 |
Learning Control by Iterative Inversion,"Gal Leibovich, Guy Jacob, Or Avner, Gal Novik, Aviv Tamar",http://arxiv.org/abs/2211.01724,,https://huggingface.co/papers/2211.01724,,,,2211.01724,5,2
|
1471 |
Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning,"Tongzhou Wang, Antonio Torralba, Phillip Isola, Amy Zhang",http://arxiv.org/abs/2304.01203,,https://huggingface.co/papers/2304.01203,,,,2304.01203,4,1
|
|
|
1490 |
Regret Minimization and Convergence to Equilibria in General-sum Markov Games,"Liad Erez, Tal Lancewicki, Uri Sherman, Tomer Koren, Yishay Mansour",http://arxiv.org/abs/2207.14211,,https://huggingface.co/papers/2207.14211,,,,2207.14211,5,0
|
1491 |
Optimal Rates and Efficient Algorithms for Online Bayesian Persuasion,"Martino Bernasconi, Matteo Castiglioni, Andrea Celli, Alberto Marchesi, Francesco Trovò, Nicola Gatti",http://arxiv.org/abs/2303.01296,,https://huggingface.co/papers/2303.01296,,,,2303.01296,6,0
|
1492 |
Distributed Linear Bandits under Communication Constraints,"Sudeep Salgia, Qing Zhao",http://arxiv.org/abs/2211.02212,,https://huggingface.co/papers/2211.02212,,,,2211.02212,2,0
|
1493 |
+
Online Mechanism Design for Information Acquisition,"Federico Cacciamani, Matteo Castiglioni, Nicola Gatti",http://arxiv.org/abs/2302.02873,,https://huggingface.co/papers/2302.02873,,,,2302.02873,3,1
|
1494 |
Federated Online and Bandit Convex Optimization,"Kumar Kshitij Patel, Lingxiao Wang, Aadirupa Saha, Nati Srebro",,,,,,,,,
|
1495 |
Statistical Foundations of Prior-Data Fitted Networks,Thomas Nagler,http://arxiv.org/abs/2305.11097,,https://huggingface.co/papers/2305.11097,,,,2305.11097,1,0
|
1496 |
Who Needs to Know? Minimal Knowledge for Optimal Coordination,"Niklas Lauffer, Ameesh Shah, Micah Carroll, Michael Dennis, Stuart Russell",http://arxiv.org/abs/2306.09309,,https://huggingface.co/papers/2306.09309,,,,2306.09309,5,1
|
|
|
1590 |
Data Structures for Density Estimation,"Anders Aamand, Alexandr Andoni, Justin Chen, Piotr Indyk, Shyam Narayanan, Sandeep Silwal",,,,,,,,,
|
1591 |
Provably Invariant Learning without Domain Information,"Xiaoyu Tan, Yong LIN, Shengyu Zhu, Chao Qu, Xihe Qiu, Xu Yinghui, Peng Cui, Yuan Qi",,,,,,,,,
|
1592 |
Online Platt Scaling with Calibeating,"Chirag Gupta, Aaditya Ramdas",http://arxiv.org/abs/2305.00070,,https://huggingface.co/papers/2305.00070,,,,2305.00070,2,0
|
1593 |
+
An Effective Meaningful Way to Evaluate Survival Models,"Shi-ang Qi, Neeraj Kumar, Mahtab Farrokh, Weijie Sun, Li-Hao Kuan, Rajesh Ranganath, Ricardo Henao, Russell Greiner",http://arxiv.org/abs/2306.01196,,https://huggingface.co/papers/2306.01196,,,,2306.01196,8,1
|
1594 |
Evaluating Unsupervised Denoising Requires Unsupervised Metrics,"Adrià Marcos Morales, Matan Leibovich, Sreyas Mohan, Joshua Vincent, Piyush Haluai, Mai Tan, Peter Crozier, Carlos Fernandez-Granda",http://arxiv.org/abs/2210.05553,,https://huggingface.co/papers/2210.05553,,,,2210.05553,8,0
|
1595 |
Hidden symmetries of ReLU networks,"Elisenda Grigsby, Kathryn Lindsey, David Rolnick",http://arxiv.org/abs/2306.06179,,https://huggingface.co/papers/2306.06179,,,,2306.06179,3,0
|
1596 |
Modeling Dynamic Environments with Scene Graph Memory,"Andrey Kurenkov, Michael Lingelbach, Tanmay Agarwal, Emily Jin, Chengshu Li, Ruohan Zhang, Li Fei-Fei, Jiajun Wu, Silvio Savarese, Roberto Martín-Martín",http://arxiv.org/abs/2305.17537,,https://huggingface.co/papers/2305.17537,,,,2305.17537,10,2
|
|
|
1620 |
On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits,"Weitong Zhang, Jiafan He, Jiafan He, Zhiyuan Fan, Quanquan Gu",http://arxiv.org/abs/2303.09390,,https://huggingface.co/papers/2303.09390,,,,2303.09390,4,1
|
1621 |
Brainformers: Trading Simplicity for Efficiency,"Yanqi Zhou, Nan Du, Yanping Huang, Daiyi Peng, Chang Lan, Da Huang, Siamak Shakeri, David So, Andrew Dai, Yifeng Lu, Zhifeng Chen, Quoc Le, Claire Cui, James Laudon, Jeff Dean",http://arxiv.org/abs/2306.00008,,https://huggingface.co/papers/2306.00008,,,,2306.00008,15,3
|
1622 |
On the Training Instability of Shuffling SGD with Batch Normalization,"David X. Wu, Chulhee Yun, Suvrit Sra",http://arxiv.org/abs/2302.12444,,https://huggingface.co/papers/2302.12444,,,,2302.12444,3,0
|
1623 |
+
Dropout Reduces Underfitting,"Zhuang Liu, Zhiqiu (Oscar) Xu, Joseph Jin, Zhiqiang Shen, Trevor Darrell",http://arxiv.org/abs/2303.01500,https://github.com/facebookresearch/dropout,https://huggingface.co/papers/2303.01500,,,,2303.01500,5,1
|
1624 |
A modern look at the relationship between sharpness and generalization,"Maksym Andriushchenko, Francesco Croce, Maximilian Müller, Matthias Hein, Nicolas Flammarion",http://arxiv.org/abs/2302.07011,https://github.com/tml-epfl/sharpness-vs-generalization,https://huggingface.co/papers/2302.07011,,,,2302.07011,5,1
|
1625 |
Weak Proxies are Sufficient and Preferable for Fairness with Missing Sensitive Attributes,"Zhaowei Zhu, Yuanshun Yao, Jiankai Sun, Hang Li, Yang Liu",http://arxiv.org/abs/2210.03175,,https://huggingface.co/papers/2210.03175,,,,2210.03175,5,0
|
1626 |
Cocktail Party Attack: Breaking Aggregation-Based Privacy in Federated Learning Using Independent Component Analysis,"Sanjay Kariyappa, Chuan Guo, Kiwan Maeng, Wenjie Xiong, G. Edward Suh, Moinuddin Qureshi, Hsien-Hsin Sean Lee",http://arxiv.org/abs/2209.05578,,https://huggingface.co/papers/2209.05578,,,,2209.05578,7,1
|