28 74 113

Hyunwoo Ko

Cartinoe5930

https://cartinoe5930.tistory.com/

AI & ML interests

NLP(LLM)

Recent Activity

liked a dataset 2 days ago

HuggingFaceTB/finemath

upvoted an article 20 days ago

Releasing QwQ-LongCoT-130K

liked a dataset 20 days ago

amphora/QwQ-LongCoT-130K

View all activity

Organizations

Cartinoe5930's activity

upvoted an article 20 days ago

Article

Releasing QwQ-LongCoT-130K

•

20 days ago

• 8

upvoted 2 articles 2 months ago

Article

Navigating Korean LLM Research #2: Evaluation Tools

•

Oct 23

• 6

Article

Navigating Korean LLM Research #1: Models

•

Oct 22

• 20

upvoted a paper 3 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

upvoted an article 4 months ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

•

Aug 19

• 74

upvoted a paper 4 months ago

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12 • 117

upvoted 2 papers 5 months ago

EXAONE 3.0 7.8B Instruction Tuned Language Model

Paper • 2408.03541 • Published Aug 7 • 34

DDK: Distilling Domain Knowledge for Efficient Large Language Models

Paper • 2407.16154 • Published Jul 23 • 21

upvoted an article 5 months ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23

• 224

upvoted a paper 5 months ago

AgentInstruct: Toward Generative Teaching with Agentic Flows

Paper • 2407.03502 • Published Jul 3 • 49

upvoted an article 5 months ago

Article

The Rise of Agentic Data Generation

•

Jul 15

• 78

upvoted an article 6 months ago

Article

🪆 Introduction to Matryoshka Embedding Models

Feb 23

• 62

upvoted a collection 6 months ago

🔍 Daily Picks in Interpretability & Analysis of LMs

Collection

Outstanding research in interpretability and evaluation of language models, summarized • 90 items • Updated 5 days ago • 93

upvoted 3 papers 6 months ago

Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges

Paper • 2406.12624 • Published Jun 18 • 36

How Do Large Language Models Acquire Factual Knowledge During Pretraining?

Paper • 2406.11813 • Published Jun 17 • 30

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12 • 65

upvoted an article 7 months ago

Article

Uncensor any LLM with abliteration

•

Jun 13

• 385

upvoted a paper 7 months ago

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published May 31 • 63

upvoted a paper 8 months ago

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13 • 66

upvoted an article 8 months ago

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

•

May 7

• 42