1 13 5

Minsoo Kim

minsoo2333

https://marsjacobs.github.io

AI & ML interests

LLM compression

Recent Activity

upvoted a paper about 2 months ago

NVILA: Efficient Frontier Visual Language Models

authored a paper 4 months ago

Enhancing Computation Efficiency in Large Language Models through Weight and Activation Quantization

authored a paper 4 months ago

Improving Conversational Abilities of Quantized Large Language Models via Direct Preference Alignment

View all activity

Organizations

None yet

minsoo2333's activity

upvoted a paper about 2 months ago

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 57

upvoted a paper 4 months ago

A Controlled Study on Long Context Extension and Generalization in LLMs

Paper • 2409.12181 • Published Sep 18, 2024 • 44

upvoted 3 papers 6 months ago

upvoted a collection 7 months ago

Gradient's Long Context Models

Collection

6 items • Updated Jun 13, 2024 • 2

upvoted a paper 7 months ago

MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention

Paper • 2407.02490 • Published Jul 2, 2024 • 23

upvoted a paper 8 months ago

Block Transformer: Global-to-Local Language Modeling for Fast Inference

Paper • 2406.02657 • Published Jun 4, 2024 • 38

upvoted a paper 9 months ago

TransformerFAM: Feedback attention is working memory

Paper • 2404.09173 • Published Apr 14, 2024 • 44

upvoted a collection 10 months ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 708

upvoted a paper 10 months ago

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Paper • 2402.13753 • Published Feb 21, 2024 • 115

upvoted a paper 12 months ago

Speculative Streaming: Fast LLM Inference without Auxiliary Models

Paper • 2402.11131 • Published Feb 16, 2024 • 43

upvoted a paper over 1 year ago

QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

Paper • 2309.14717 • Published Sep 26, 2023 • 44