AI_MODELS ggerganov/whisper.cpp Automatic Speech Recognition • Updated Oct 29, 2024 • 1.07k nvidia/Hymba-1.5B-Base Text Generation • 2B • Updated Jan 2 • 1.11k • 146 nvidia/Hymba-1.5B-Instruct Text Generation • 2B • Updated Jan 2 • 235 • 233 deepseek-ai/DeepSeek-V3-Base 685B • Updated Mar 27 • 10.6k • 1.67k
LLM-NEW The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only Paper • 2306.01116 • Published Jun 1, 2023 • 38 Diffusion Model Alignment Using Direct Preference Optimization Paper • 2311.12908 • Published Nov 21, 2023 • 50
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only Paper • 2306.01116 • Published Jun 1, 2023 • 38
Diffusion Model Alignment Using Direct Preference Optimization Paper • 2311.12908 • Published Nov 21, 2023 • 50
AI_MODELS ggerganov/whisper.cpp Automatic Speech Recognition • Updated Oct 29, 2024 • 1.07k nvidia/Hymba-1.5B-Base Text Generation • 2B • Updated Jan 2 • 1.11k • 146 nvidia/Hymba-1.5B-Instruct Text Generation • 2B • Updated Jan 2 • 235 • 233 deepseek-ai/DeepSeek-V3-Base 685B • Updated Mar 27 • 10.6k • 1.67k
LLM-NEW The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only Paper • 2306.01116 • Published Jun 1, 2023 • 38 Diffusion Model Alignment Using Direct Preference Optimization Paper • 2311.12908 • Published Nov 21, 2023 • 50
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only Paper • 2306.01116 • Published Jun 1, 2023 • 38
Diffusion Model Alignment Using Direct Preference Optimization Paper • 2311.12908 • Published Nov 21, 2023 • 50