9 21 3

Xiang Liu

Dominic789654

https://dominic789654.github.io/

Dominic789654

AI & ML interests

None yet

Recent Activity

authored a paper about 13 hours ago

The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?

upvoted a paper 1 day ago

The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?

commented on a paper 1 day ago

The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?

View all activity

Organizations

None yet

Dominic789654's activity

upvoted a paper 1 day ago

The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?

Paper • 2502.17535 • Published 3 days ago • 6

upvoted a paper 9 days ago

Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research

Paper • 2502.12669 • Published 10 days ago • 2

upvoted a paper 15 days ago

Mediator: Memory-efficient LLM Merging with Less Parameter Conflicts and Uncertainty Based Routing

Paper • 2502.04411 • Published 21 days ago • 4

upvoted a paper 23 days ago

Can LLMs Maintain Fundamental Abilities under KV Cache Compression?

Paper • 2502.01941 • Published 24 days ago • 14

upvoted a paper 24 days ago

ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference

Paper • 2502.00299 • Published 27 days ago • 3

upvoted 3 papers about 1 month ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 83

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 99

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 333

upvoted a paper about 2 months ago

Graph Generative Pre-trained Transformer

Paper • 2501.01073 • Published Jan 2 • 17

upvoted 3 papers 2 months ago

A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression

Paper • 2412.17483 • Published Dec 23, 2024 • 31

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published Dec 19, 2024 • 51

SCOPE: Optimizing Key-Value Cache Compression in Long-context Generation

Paper • 2412.13649 • Published Dec 18, 2024 • 20

upvoted 2 papers 4 months ago

Should We Really Edit Language Models? On the Evaluation of Edited Language Models

Paper • 2410.18785 • Published Oct 24, 2024 • 6

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Paper • 2410.13085 • Published Oct 16, 2024 • 22

upvoted 3 papers 5 months ago

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Paper • 2410.08261 • Published Oct 10, 2024 • 50

LPZero: Language Model Zero-cost Proxy Search from Zero

Paper • 2410.04808 • Published Oct 7, 2024 • 2

LongGenBench: Long-context Generation Benchmark

Paper • 2410.04199 • Published Oct 5, 2024 • 20

upvoted a paper 7 months ago

3D Question Answering for City Scene Understanding

Paper • 2407.17398 • Published Jul 24, 2024 • 22

upvoted a paper 9 months ago

Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for Large Language Models

Paper • 2406.02924 • Published Jun 5, 2024 • 2

upvoted a paper 11 months ago

LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning

Paper • 2403.17919 • Published Mar 26, 2024 • 16