Spaces:
Sleeping
Sleeping
commit files to HF hub
Browse files- papers.csv +1 -1
papers.csv
CHANGED
@@ -611,7 +611,7 @@ Optimally-weighted Estimators of the Maximum Mean Discrepancy for Likelihood-Fre
|
|
611 |
SGD with large step sizes learns sparse features,"Maksym Andriushchenko, Aditya Vardhan Varre, Loucas Pillaud-Vivien, Nicolas Flammarion",http://arxiv.org/abs/2210.05337,https://github.com/tml-epfl/sgd-sparse-features,https://huggingface.co/papers/2210.05337,,,,2210.05337,4,1
|
612 |
Kernel Logistic Regression Approximation of an Understandable ReLU Neural Network,"Marie Guyomard, Susana Barbosa, Lionel Fillatre",,,,,,,,,
|
613 |
Cramming: Training a Language Model on a single GPU in one day.,"Jonas Geiping, Tom Goldstein",https://arxiv.org/abs//2212.14034,https://github.com/JonasGeiping/cramming,https://huggingface.co/papers/2212.14034,,https://huggingface.co/JonasGeiping/crammed-bert,https://huggingface.co/datasets/JonasGeiping/the_pile_WordPiecex32768_2efdb9d060d1ae95faf952ec1a50f020,2212.14034,2,1
|
614 |
-
A Simple Zero-shot Prompt Weighting Technique to Improve Prompt Ensembling in Text-Image Models,"James Allingham, JIE REN, Michael Dusenberry, Jeremiah Liu, Xiuye Gu, Yin Cui, Dustin Tran, Balaji Lakshminarayanan",http://arxiv.org/abs/2302.06235,,https://huggingface.co/papers/2302.06235,,,,2302.06235,8,
|
615 |
Trompt: Towards a Better Deep Neural Network for Tabular Data,"Kuan-Yu Chen, Ping-Han Chiang, Hsin-Rung Chou, Ting-Wei Chen, Tien-Hao Chang",http://arxiv.org/abs/2305.18446,,https://huggingface.co/papers/2305.18446,,,,2305.18446,5,0
|
616 |
Probably Anytime-Safe Stochastic Combinatorial Semi-Bandits,"Yunlong Hou, Vincent Tan, Zixin Zhong",http://arxiv.org/abs/2301.13393,,https://huggingface.co/papers/2301.13393,,,,2301.13393,3,0
|
617 |
Online Restless Bandits with Unobserved States,"BOWEN JIANG, Bo Jiang, Jian Li, TAO LIN, Xinbing Wang, Chenghu Zhou",,,,,,,,,
|
|
|
611 |
SGD with large step sizes learns sparse features,"Maksym Andriushchenko, Aditya Vardhan Varre, Loucas Pillaud-Vivien, Nicolas Flammarion",http://arxiv.org/abs/2210.05337,https://github.com/tml-epfl/sgd-sparse-features,https://huggingface.co/papers/2210.05337,,,,2210.05337,4,1
|
612 |
Kernel Logistic Regression Approximation of an Understandable ReLU Neural Network,"Marie Guyomard, Susana Barbosa, Lionel Fillatre",,,,,,,,,
|
613 |
Cramming: Training a Language Model on a single GPU in one day.,"Jonas Geiping, Tom Goldstein",https://arxiv.org/abs//2212.14034,https://github.com/JonasGeiping/cramming,https://huggingface.co/papers/2212.14034,,https://huggingface.co/JonasGeiping/crammed-bert,https://huggingface.co/datasets/JonasGeiping/the_pile_WordPiecex32768_2efdb9d060d1ae95faf952ec1a50f020,2212.14034,2,1
|
614 |
+
A Simple Zero-shot Prompt Weighting Technique to Improve Prompt Ensembling in Text-Image Models,"James Allingham, JIE REN, Michael Dusenberry, Jeremiah Liu, Xiuye Gu, Yin Cui, Dustin Tran, Balaji Lakshminarayanan",http://arxiv.org/abs/2302.06235,,https://huggingface.co/papers/2302.06235,,,,2302.06235,8,2
|
615 |
Trompt: Towards a Better Deep Neural Network for Tabular Data,"Kuan-Yu Chen, Ping-Han Chiang, Hsin-Rung Chou, Ting-Wei Chen, Tien-Hao Chang",http://arxiv.org/abs/2305.18446,,https://huggingface.co/papers/2305.18446,,,,2305.18446,5,0
|
616 |
Probably Anytime-Safe Stochastic Combinatorial Semi-Bandits,"Yunlong Hou, Vincent Tan, Zixin Zhong",http://arxiv.org/abs/2301.13393,,https://huggingface.co/papers/2301.13393,,,,2301.13393,3,0
|
617 |
Online Restless Bandits with Unobserved States,"BOWEN JIANG, Bo Jiang, Jian Li, TAO LIN, Xinbing Wang, Chenghu Zhou",,,,,,,,,
|