One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation Paper • 2503.13358 • Published 4 days ago • 39
OLMo 2 Collection Artifacts for the second set of OLMo models. • 27 items • Updated about 16 hours ago • 102
SynthDetoxM Collection Data and models from NAACL 2025 paper "SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators" by Moskovskiy et al. • 4 items • Updated 15 days ago • 2
Knowledge Packing Collection Models and datasets from the paper: "How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?" https://arxiv.org/abs/2502.14502 • 9 items • Updated 24 days ago • 2
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper • 2502.15007 • Published 29 days ago • 164
How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM? Paper • 2502.14502 • Published 29 days ago • 85
Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity Paper • 2502.13063 • Published about 1 month ago • 67
StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of Style Elements Paper • 2408.15666 • Published Aug 28, 2024 • 11
POGEMA: A Benchmark Platform for Cooperative Multi-Agent Navigation Paper • 2407.14931 • Published Jul 20, 2024 • 22
SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators Paper • 2502.06394 • Published Feb 10 • 86
Methods for Detoxification of Texts for the Russian Language Paper • 2105.09052 • Published May 19, 2021 • 1
PseudoParaDetox Collection Models and datasets from the paper: "LLMs to Replace Crowdsourcing For Parallel Data Creation? The Case of Text Detoxification" by Moskovskiy et al. • 9 items • Updated 12 days ago • 1
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval Paper • 2412.14475 • Published Dec 19, 2024 • 54
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation Paper • 2412.06531 • Published Dec 9, 2024 • 71
Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published Dec 4, 2024 • 48