Yohan Na's picture

Yohan Na PRO

nayohan

·

nayohan

AI & ML interests

NLP, Dialogue systems

Recent Activity

liked a model about 5 hours ago

NovaSky-AI/Sky-T1-32B-Preview

upvoted a paper about 9 hours ago

HelpSteer2-Preference: Complementing Ratings with Preferences

liked a dataset about 9 hours ago

nvidia/HelpSteer2

View all activity

Organizations

nayohan's activity

upvoted 3 papers about 9 hours ago

HelpSteer2-Preference: Complementing Ratings with Preferences

Paper • 2410.01257 • Published Oct 2, 2024 • 22

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published 9 days ago • 45

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published 27 days ago • 52

upvoted a paper about 13 hours ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 74

upvoted a paper 2 days ago

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 4 days ago • 60

upvoted a collection 9 days ago

Deepseek Papers

Deepseek papers collection • 14 items • Updated 13 days ago • 9

upvoted a paper 22 days ago

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published 24 days ago • 48

upvoted 2 papers 23 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 23 days ago • 339

RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment

Paper • 2412.13746 • Published 24 days ago • 9

upvoted 2 collections 29 days ago

📀 Dataset comparison models

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12, 2024 • 35

🧪 FineWeb v1 data experiments

Ablation models trained for our data experiments. • 22 items • Updated Jun 12, 2024 • 4

upvoted a collection about 1 month ago

Open Preference Datasets

Alignment Learning을 위한 공개 데이터셋 중, 좋은 데이터를 정리해주세요! (english or multi-lingual) (ultrafeedback-binarized 포맷) • 2 items • Updated Dec 4, 2024 • 1

upvoted a paper about 2 months ago

Thinking LLMs: General Instruction Following with Thought Generation

Paper • 2410.10630 • Published Oct 14, 2024 • 18

upvoted 3 papers 3 months ago

Conversation Chronicles: Towards Diverse Temporal and Relational Dynamics in Multi-Session Conversations

Paper • 2310.13420 • Published Oct 20, 2023 • 2

Personalized Visual Instruction Tuning

Paper • 2410.07113 • Published Oct 9, 2024 • 70

Aria: An Open Multimodal Native Mixture-of-Experts Model

Paper • 2410.05993 • Published Oct 8, 2024 • 108

upvoted a collection 6 months ago

Korean-English Parallel Datasets (한국어-영어 병렬 데이터셋)

6 items • Updated Jul 17, 2024 • 3

upvoted 2 papers 6 months ago

DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models

Paper • 2309.03883 • Published Sep 7, 2023 • 34

Better & Faster Large Language Models via Multi-token Prediction

Paper • 2404.19737 • Published Apr 30, 2024 • 73

upvoted a collection 6 months ago

Text datasets with missing language information

120 items • Updated Jul 3, 2024 • 4