Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2411.08147

Large Language Models Can Self-Improve in Long-context Reasoning

Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published 4 days ago • 44
Siheng99/Llama-3.1-8B-Instruct-SEALONG

Text Generation • Updated 6 days ago • 22
Siheng99/Qwen2.5-7B-Instruct-SEALONG

Text Generation • Updated 6 days ago • 19 • 1
Siheng99/Qwen2.5-14B-Instruct-SEALONG

Text Generation • Updated 6 days ago • 13 • 1

LinFusion: 1 GPU, 1 Minute, 16K Image

Paper • 2409.02097 • Published Sep 3 • 31
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

Paper • 2409.11406 • Published Sep 17 • 25
Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27 • 121
Segment Anything with Multiple Modalities

Paper • 2408.09085 • Published Aug 17 • 21

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18 • 144
Orion-14B: Open-source Multilingual Large Language Models

Paper • 2401.12246 • Published Jan 20 • 12
MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24 • 51
MM-LLMs: Recent Advances in MultiModal Large Language Models

Paper • 2401.13601 • Published Jan 24 • 45

Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published 4 days ago • 44

Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published 4 days ago • 44

Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published 4 days ago • 44

Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published 4 days ago • 44

Meta-Learning a Dynamical Language Model

Paper • 1803.10631 • Published Mar 28, 2018
TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation

Paper • 2003.11963 • Published Mar 26, 2020
BigScience: A Case Study in the Social Construction of a Multilingual Large Language Model

Paper • 2212.04960 • Published Dec 9, 2022 • 1
Continuous Learning in a Hierarchical Multiscale Neural Network

Paper • 1805.05758 • Published May 15, 2018 • 1

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

Paper • 2411.04282 • Published 10 days ago • 23
Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published 4 days ago • 44

about 17 hours ago

Rethinking Data Selection at Scale: Random Selection is Almost All You Need

Paper • 2410.09335 • Published Oct 12 • 14
From Generalist to Specialist: Adapting Vision Language Models via Task-Specific Visual Instruction Tuning

Paper • 2410.06456 • Published Oct 9 • 35
Emergent properties with repeated examples

Paper • 2410.07041 • Published Oct 9 • 8
Personalized Visual Instruction Tuning

Paper • 2410.07113 • Published Oct 9 • 69

Previous
1
2
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs