view article Article Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning By codelion • 9 days ago • 12
SauerkrautLM-Multilingual-(Reason)-ColBERT Collection SauerkrautLM ColBERT is a suite of Late-Interaction retrieval models built with PyLate’s ColBERT architecture and tuned for seven European languages. • 7 items • Updated 15 days ago • 17
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 153
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 126 items • Updated 4 days ago • 111
Open LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: • 65 items • Updated Mar 20 • 630
view article Article 🇮🇹🇯🇵🇧🇷 Generating multilingual instruction datasets with Magpie 🐦⬛ By anakin87 • Oct 21, 2024 • 19
view article Article Selective fine-tuning of Language Models with Spectrum By anakin87 • Sep 3, 2024 • 36
AnglE📐-based Embeddings Collection This collection consists of popular embeddings trained with AnglE: https://github.com/SeanLee97/AnglE • 9 items • Updated Aug 1, 2024 • 3
view article Article How we leveraged distilabel to create an Argilla 2.0 Chatbot By plaguss and 4 others • Jul 16, 2024 • 33
A Primer on the Inner Workings of Transformer-based Language Models Paper • 2405.00208 • Published Apr 30, 2024 • 10
🇮🇹 Italian NLP Resources Collection Collection of models, datasets and demos relevant to Italian NLP 🇮🇹 • 298 items • Updated about 8 hours ago • 28
LLM ITA Collection Open-Source Language Models Finetuned for Italian • 4 items • Updated Oct 19, 2024 • 7