Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Gausson Tschen's picture
4 3 2

Gausson Tschen

Gausson
Reybabylon's profile picture zhg2025's profile picture
·
https://www.xiaohongshu.com/user/profile/615bd9080000000002018213
  • GaussonTschen
  • GaussonTschen

AI & ML interests

LLM Architecture, Pre-training, Deep Neural Network Optimization, Sparsity

Recent Activity

new activity 12 days ago
nvidia/kvpress-leaderboard:Upload the results of the training-free version of the method [SepLLM - ICML 2025 Paper](https://arxiv.org/abs/2412.12094) based on "meta-llama/Meta-Llama-3.1-8B-Instruct"
updated a model 14 days ago
Gausson/sep_cache
updated a model 14 days ago
transformers-community/sep_cache
View all activity

Organizations

Data Intelligence Lab@HKU's profile picture Transformers Community's profile picture

upvoted 2 collections 19 days ago

Custom generation methods - Community

Collection
Custom generation methods created and maintained by the community, and highlighted by our team • 1 item • Updated 19 days ago • 3

SepLLM - ICML 2025

Collection
The related code & checkpoints for [SepLLM - ICML 2025](https://arxiv.org/abs/2412.12094) paper. • 8 items • Updated 19 days ago • 1
upvoted a paper 27 days ago

SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator

Paper • 2412.12094 • Published Dec 16, 2024 • 11
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs