Rykov Elisei

lmeribal

lmeribal

AI & ML interests

NLP, Multimodality

Recent Activity

upvoted an article 7 days ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

upvoted a paper 9 days ago

Chain of Draft: Thinking Faster by Writing Less

upvoted a paper 19 days ago

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

View all activity

Organizations

lmeribal's activity

upvoted an article 7 days ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 66

upvoted a paper 9 days ago

Chain of Draft: Thinking Faster by Writing Less

Paper • 2502.18600 • Published 15 days ago • 44

upvoted a paper 19 days ago

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published 20 days ago • 84

liked a model 21 days ago

deepvk/RuModernBERT-base

Fill-Mask • Updated 21 days ago • 8.16k • 28

upvoted a paper 26 days ago

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

Paper • 2502.06394 • Published 30 days ago • 86

liked a model about 1 month ago

fava-uw/fava-model

Text Generation • Updated Dec 1, 2024 • 225 • 16

upvoted 2 papers about 1 month ago

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Paper • 2502.03032 • Published Feb 5 • 58

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3 • 112

upvoted a collection about 2 months ago

Zeroshot Classifiers

Collection

These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 12 items • Updated Jan 6 • 128

liked a model about 2 months ago

MoritzLaurer/ModernBERT-large-zeroshot-v2.0

Text Classification • Updated Jan 16 • 114k • 41

liked a dataset about 2 months ago

MERA-evaluation/WEIRD

Viewer • Updated Dec 10, 2024 • 824 • 198 • 1

upvoted a paper about 2 months ago

HALoGEN: Fantastic LLM Hallucinations and Where to Find Them

Paper • 2501.08292 • Published Jan 14 • 17

upvoted a paper 2 months ago

Fine-grained Hallucination Detection and Editing for Language Models

Paper • 2401.06855 • Published Jan 12, 2024 • 4

liked a model 2 months ago

microsoft/phi-4

Text Generation • Updated 16 days ago • 525k • • 1.89k

liked a dataset 2 months ago

fava-uw/fava-data

Viewer • Updated Dec 1, 2024 • 30.1k • 207 • 13

liked 2 datasets 3 months ago

microsoft/wiki_qa

Viewer • Updated Jan 4, 2024 • 29.3k • 3.78k • 54

wikimedia/wikipedia

Viewer • Updated Jan 9, 2024 • 61.6M • 101k • 758

liked a model 3 months ago

dslim/distilbert-NER

Token Classification • Updated Oct 8, 2024 • 41.5k • • 30

liked 2 datasets 3 months ago

potsawee/wiki_bio_gpt3_hallucination

Viewer • Updated May 29, 2023 • 238 • 558 • 25

ServiceNow/repliqa

Viewer • Updated 29 days ago • 53.9k • 1.29k • 8