Seungwoo Ryu's picture

Seungwoo Ryu PRO

tryumanshow

·

AI & ML interests

LLM, Agent

Recent Activity

updated a dataset 1 day ago

tryumanshow/RAG-Instruct-ko

updated a model 4 days ago

tryumanshow/Qwen2.5-3B-Instruct-ko

liked a Space 4 days ago

hf-accelerate/model-memory-usage

View all activity

Organizations

tryumanshow's activity

upvoted 2 collections 2 months ago

Korean Instruction Dataset

4 items • Updated Nov 2, 2024 • 4

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 101

upvoted a collection 3 months ago

Korean Reward Modeling

Korean Datasets, Reward Models for RLHF • 16 items • Updated Nov 19, 2024 • 3

upvoted a paper 3 months ago

DiaSynth -- Synthetic Dialogue Generation Framework

Paper • 2409.19020 • Published Sep 25, 2024 • 20

upvoted an article 4 months ago

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22, 2024

• 86

upvoted a collection 5 months ago

LLMs

361 items • Updated 2 days ago • 25

upvoted 2 papers 7 months ago

Octo-planner: On-device Language Model for Planner-Action Agents

Paper • 2406.18082 • Published Jun 26, 2024 • 48

Model Merging and Safety Alignment: One Bad Model Spoils the Bunch

Paper • 2406.14563 • Published Jun 20, 2024 • 30

upvoted 4 collections 8 months ago

Function Calling v3

Models fine-tuned for function-calling • 14 items • Updated Apr 27, 2024 • 20

Agents

Collection of resources related to Agents. • 70 items • Updated about 11 hours ago • 5

Miqu-based Models

A collection of creative writing models based on the 'miqu-1-70b ' model. • 9 items • Updated Dec 3, 2024 • 2

Agents

63 items • Updated 2 days ago • 5

upvoted a paper 8 months ago

Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3

Paper • 2405.00664 • Published May 1, 2024 • 18

upvoted an article 8 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 245

upvoted a collection 9 months ago

Long context

94 items • Updated Sep 29, 2024 • 30

upvoted an article 9 months ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 230

upvoted a collection 9 months ago

Handbook v0.1 models and datasets

Models and datasets for v0.1 of the alignment handbook • 6 items • Updated Nov 10, 2023 • 24