CLAP: Contrastive Language-Audio Pretraining Collection CLAP is to audio what CLIP is to image. • 5 items • Updated Oct 31, 2023 • 8
view article Article Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models Mar 20 • 69
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated 3 days ago • 204
📀 Dataset comparison models Collection 1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12 • 34
My Best Models Collection These all mark personal achievements in my journey • 7 items • Updated Mar 31 • 4
Personal Favorites Collection Recommended models I use often or like for any reason. I recommend reading their cards for more details. • 10 items • Updated about 20 hours ago • 59
story writing favourites Collection Models I personally liked for generating stories in the past. Not a recommendation, many of these are outdated. • 17 items • Updated Nov 11 • 29
Quantized Models (GGUF, IQ, Imatrix) Collection Various quantizations of models in the GGUF format. Models with a "checkmark" are personal favorites. An "orange arrow" means it's being uploaded. • 90 items • Updated 13 days ago • 49
Utilities Collection No crazy stuff, but useful ones for in-between steps • 15 items • Updated Nov 6 • 5
Recent models: last 100 repos, sorted by creation date Collection The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. • 121 items • Updated Jan 31 • 506