The Differences Between Direct Alignment Algorithms are a Blur Paper • 2502.01237 • Published 2 days ago • 93
DTF Collection Finetune of Qwen-2.5-7B model on a dump of DTF posts and comments. • 2 items • Updated 2 days ago
Exploring Methods for Cross-lingual Text Style Transfer: The Case of Text Detoxification Paper • 2311.13937 • Published Nov 23, 2023 • 1
Demarked: A Strategy for Enhanced Abusive Speech Moderation through Counterspeech, Detoxification, and Message Management Paper • 2406.19543 • Published Jun 27, 2024 • 1
Multilingual and Explainable Text Detoxification with Parallel Corpora Paper • 2412.11691 • Published Dec 16, 2024 • 1
SRMT: Shared Memory for Multi-agent Lifelong Pathfinding Paper • 2501.13200 • Published 14 days ago • 61
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference 20 days ago • 63
LUSIFER: Language Universal Space Integration for Enhanced Multilingual Embeddings with Large Language Models Paper • 2501.00874 • Published Jan 1 • 13
Facilitating large language model Russian adaptation with Learned Embedding Propagation Paper • 2412.21140 • Published Dec 30, 2024 • 16
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation Paper • 2412.06531 • Published Dec 9, 2024 • 71