LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning Paper β’ 2410.02884 β’ Published 15 days ago β’ 11
TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices Paper β’ 2410.00531 β’ Published 17 days ago β’ 28
LLM Pruning and Distillation in Practice: The Minitron Approach Paper β’ 2408.11796 β’ Published Aug 21 β’ 53
Seeing and Understanding: Bridging Vision with Chemical Knowledge Via ChemVLM Paper β’ 2408.07246 β’ Published Aug 14 β’ 19
Advancing Molecular Machine (Learned) Representations with Stereoelectronics-Infused Molecular Graphs Paper β’ 2408.04520 β’ Published Aug 8 β’ 5
Better Alignment with Instruction Back-and-Forth Translation Paper β’ 2408.04614 β’ Published Aug 8 β’ 14
GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI Paper β’ 2408.03361 β’ Published Aug 6 β’ 85
CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis Paper β’ 2407.13301 β’ Published Jul 18 β’ 54
Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes Paper β’ 2407.10957 β’ Published Jul 15 β’ 23
Improve Mathematical Reasoning in Language Models by Automated Process Supervision Paper β’ 2406.06592 β’ Published Jun 5 β’ 24
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B Paper β’ 2406.07394 β’ Published Jun 11 β’ 21
ZeroGPU Spaces Collection ZeroGPU Spaces made by the community β’ 17 items β’ Updated Jun 6 β’ 227
ShortGPT: Layers in Large Language Models are More Redundant Than You Expect Paper β’ 2403.03853 β’ Published Mar 6 β’ 63
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper β’ 2402.17764 β’ Published Feb 27 β’ 596
The FinBen: An Holistic Financial Benchmark for Large Language Models Paper β’ 2402.12659 β’ Published Feb 20 β’ 16
RMT: Retentive Networks Meet Vision Transformers Paper β’ 2309.11523 β’ Published Sep 20, 2023 β’ 33