Jinyeong Kim's picture

435 9

Jinyeong Kim

rubatoyeong

·

rubato-yeong

AI & ML interests

None yet

Recent Activity

upvoted a paper about 6 hours ago

Judge Anything: MLLM as a Judge Across Any Modality

upvoted a paper about 6 hours ago

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

upvoted a paper about 7 hours ago

When Less is Enough: Adaptive Token Reduction for Efficient Image Representation

View all activity

Organizations

None yet

rubatoyeong's activity

upvoted 2 papers about 6 hours ago

Judge Anything: MLLM as a Judge Across Any Modality

Paper • 2503.17489 • Published 4 days ago • 18

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published 1 day ago • 90

upvoted a paper about 7 hours ago

When Less is Enough: Adaptive Token Reduction for Efficient Image Representation

Paper • 2503.16660 • Published 5 days ago • 64

upvoted 2 papers 2 days ago

Where do Large Vision-Language Models Look at when Answering Questions?

Paper • 2503.13891 • Published 8 days ago • 6

CaKE: Circuit-aware Editing Enables Generalizable Knowledge Learners

Paper • 2503.16356 • Published 6 days ago • 14

upvoted 2 papers 5 days ago

M3: 3D-Spatial MultiModal Memory

Paper • 2503.16413 • Published 6 days ago • 14

TULIP: Towards Unified Language-Image Pretraining

Paper • 2503.15485 • Published 7 days ago • 43

upvoted 2 papers 7 days ago

Aligning Multimodal LLM with Human Preference: A Survey

Paper • 2503.14504 • Published 8 days ago • 20

Using Mechanistic Interpretability to Craft Adversarial Attacks against Large Language Models

Paper • 2503.06269 • Published 18 days ago • 4

upvoted 2 papers 8 days ago

DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models

Paper • 2503.12885 • Published 9 days ago • 41

On the Limitations of Vision-Language Models in Understanding Image Transforms

Paper • 2503.09837 • Published 13 days ago • 10

upvoted a paper 11 days ago

TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention

Paper • 2503.10602 • Published 13 days ago • 4

upvoted 4 papers 12 days ago

Discovering Influential Neuron Path in Vision Transformers

Paper • 2503.09046 • Published 14 days ago • 6

Charting and Navigating Hugging Face's Model Atlas

Paper • 2503.10633 • Published 13 days ago • 72

Transformers without Normalization

Paper • 2503.10622 • Published 13 days ago • 135

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published 14 days ago • 62

upvoted 4 papers 13 days ago

Mixture of Experts Made Intrinsically Interpretable

Paper • 2503.07639 • Published 21 days ago • 8

Gemini Embedding: Generalizable Embeddings from Gemini

Paper • 2503.07891 • Published 15 days ago • 33

What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization

Paper • 2503.06698 • Published 17 days ago • 4

Words or Vision: Do Vision-Language Models Have Blind Faith in Text?

Paper • 2503.02199 • Published 22 days ago • 8