4 21 8

Hyogun Lee

Haawron

AI & ML interests

Video understanding, multi-modal LLMs

Recent Activity

authored a paper 3 months ago

Flashback: Memory-Driven Zero-shot, Real-time Video Anomaly Detection

upvoted a paper 3 months ago

Flashback: Memory-Driven Zero-shot, Real-time Video Anomaly Detection

upvoted a paper 4 months ago

COMPACT: COMPositional Atomic-to-Complex Visual Capability Tuning

View all activity

Organizations

None yet

authored a paper 3 months ago

Flashback: Memory-Driven Zero-shot, Real-time Video Anomaly Detection

Paper • 2505.15205 • Published May 21 • 2

upvoted a paper 3 months ago

Flashback: Memory-Driven Zero-shot, Real-time Video Anomaly Detection

Paper • 2505.15205 • Published May 21 • 2

upvoted a paper 4 months ago

COMPACT: COMPositional Atomic-to-Complex Visual Capability Tuning

Paper • 2504.21850 • Published Apr 30 • 27

liked a model 4 months ago

facebook/PE-Core-L14-336

Zero-Shot Image Classification • Updated Apr 30 • 23.1k • 42

upvoted a collection 4 months ago

InternVideo2

Collection

InternVideo2 • 21 items • Updated Jun 9 • 22

upvoted a paper 5 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 114

liked 2 models 5 months ago

google/gemma-3-27b-it

Image-Text-to-Text • 27B • Updated Mar 21 • 418k • • 1.56k

lmms-lab/llava-onevision-qwen2-7b-ov-chat

Text Generation • 8B • Updated Oct 23, 2024 • 4.34k • 23

upvoted a paper 6 months ago

MM-RLHF: The Next Step Forward in Multimodal LLM Alignment

Paper • 2502.10391 • Published Feb 14 • 35

upvoted 4 papers 8 months ago

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Paper • 2412.04424 • Published Dec 5, 2024 • 64

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 60

Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding

Paper • 2412.00493 • Published Nov 30, 2024 • 17

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 109

commented a paper 8 months ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 147 •

upvoted 6 papers 8 months ago

Hyogun Lee

AI & ML interests

Recent Activity

Organizations

Haawron's activity