Alexey Dontsov's picture

4 11 2

Alexey Dontsov

therem

·

AI & ML interests

None yet

Recent Activity

upvoted a collection 1 day ago

🧠 Reasoning datasets

upvoted a paper 4 days ago

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

upvoted a paper 5 days ago

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

View all activity

Organizations

None yet

therem's activity

upvoted a collection 1 day ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 11 items • Updated 4 days ago • 62

upvoted a paper 4 days ago

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

Paper • 2502.06394 • Published 4 days ago • 83

upvoted a paper 5 days ago

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Paper • 2502.03032 • Published 10 days ago • 53

upvoted a paper 10 days ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published 11 days ago • 112

upvoted a paper 22 days ago

SRMT: Shared Memory for Multi-agent Lifelong Pathfinding

Paper • 2501.13200 • Published 23 days ago • 63

upvoted a collection about 1 month ago

Purging corrupted capabilities across language models

Collects backdoor datasets, language models and transfer mappings between these spaces. • 6 items • Updated Dec 17, 2024 • 3

upvoted a paper 2 months ago

Frame Representation Hypothesis: Multi-Token LLM Interpretability and Concept-Guided Text Generation

Paper • 2412.07334 • Published Dec 10, 2024 • 16

New activity in therem/CLEAR 2 months ago

Bug for idk data loader (code in github)

#4 opened 2 months ago by

upvoted a paper 2 months ago

Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

Paper • 2412.06531 • Published Dec 9, 2024 • 71

upvoted a paper 3 months ago

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published Nov 21, 2024 • 28

upvoted a collection 3 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 517

New activity in therem/CLEAR 3 months ago

Will the weight of the model after unlearn be disclosed in the future?

#3 opened 3 months ago by

updated a collection 3 months ago

CLEAR

Our benchmark for multimodal unlearning methods • 9 items • Updated Nov 6, 2024