Purging corrupted capabilities across language models Collection Collects backdoor datasets, language models and transfer mappings between these spaces. • 6 items • Updated 25 days ago • 3
Frame Representation Hypothesis: Multi-Token LLM Interpretability and Concept-Guided Text Generation Paper • 2412.07334 • Published Dec 10, 2024 • 16
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation Paper • 2412.06531 • Published Dec 9, 2024 • 71
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 457