-
mistralai/Mistral-7B-Instruct-v0.2
Text Generation • Updated • 3.9M • • 2.69k -
mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation • Updated • 535k • • 4.35k -
mistralai/Mixtral-8x7B-v0.1
Text Generation • Updated • 44.7k • • 1.7k -
PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Paper • 2403.10704 • Published • 59
Molone Laveh PRO
molonelaveh
AI & ML interests
convergence, multi-modality, multi-agent, LLM, research
Recent Activity
liked
a Space
3 days ago
Pendrokar/TTS-Spaces-Arena
liked
a Space
3 days ago
sesame/csm-1b
liked
a model
3 days ago
canopylabs/orpheus-3b-0.1-ft
Organizations
Collections
2
models
None public yet
datasets
None public yet