Heegyu Kim's picture

Heegyu Kim PRO

heegyu

·

https://sites.google.com/view/heegyu-kim/

AI & ML interests

NLP

Recent Activity

liked a Space 7 days ago

xiaozaa/catvton-flux-try-on

liked a Space 8 days ago

franciszzj/Leffa

liked a Space 8 days ago

Djrango/qwen2vl-flux-mini-demo

View all activity

Organizations

heegyu's activity

upvoted a collection 16 days ago

EXAONE-3.5

EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B. • 10 items • Updated 15 days ago • 79

upvoted 2 collections about 2 months ago

Cosmos Tokenizer

A suite of image and video tokenizers • 12 items • Updated 7 days ago • 26

Granite 3.0 Language Models

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 6 days ago • 95

upvoted 4 collections 2 months ago

Arch-Function

6 items • Updated Oct 29 • 8

LLM Safety Datasets

Korean safety, ethics dataset • 9 items • Updated Nov 23 • 2

En Ko Translate

영어 데이터셋을 한글로 번역한 데이터셋입니다. • 4 items • Updated Nov 6 • 1

Magpie Conversation Ko

Magpie 데이터셋 한국어 번역본 (@nayohan님 번역 모델 사용) • 10 items • Updated Nov 6 • 1

upvoted a paper 2 months ago

MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures

Paper • 2406.06565 • Published Jun 3 • 9

upvoted 3 collections 4 months ago

3D

Stability AI's suite of models for 3D generation • 5 items • Updated Aug 9 • 32

4bit Instruct Models

18 items • Updated 18 days ago • 26

DeepSeek-Prover

DeepSeek-V1-and-V1.5-Series • 7 items • Updated Aug 16 • 18

upvoted 2 collections 6 months ago

Magpie-Qwen2 Datasets

Dataset built with Qwen2 72B and Qwen2 7B. • 6 items • Updated Sep 14 • 10

Awesome feedback datasets

A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. • 19 items • Updated Apr 12 • 66

upvoted a collection 7 months ago

Standard-format-preference-dataset

We collect the open-source datasets and process them into the standard format. • 14 items • Updated May 8 • 23

upvoted a paper 7 months ago

DoRA: Weight-Decomposed Low-Rank Adaptation

Paper • 2402.09353 • Published Feb 14 • 26

upvoted a collection 8 months ago

Eurus

Advancing LLM Reasoning Generalists with Preference Trees • 11 items • Updated Oct 22 • 24

upvoted 2 papers 9 months ago

Shepherd: A Critic for Language Model Generation

Paper • 2308.04592 • Published Aug 8, 2023 • 31

PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Paper • 2403.10704 • Published Mar 15 • 57

upvoted a collection 10 months ago

zephyr-7b-sft-full-SPIN

Models fine-tuned with SPIN across iterations 0,1,2,3 • 4 items • Updated Feb 7 • 7

upvoted a paper 10 months ago

Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning

Paper • 2402.06619 • Published Feb 9 • 54