Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Ceshine Lee's picture
2 7 47

Ceshine Lee

ceshine
Youzhuzhu's profile picture Mi6paulino's profile picture Gargaz's profile picture
·
https://blog.ceshine.net
  • ceshine_en
  • ceshine

AI & ML interests

None yet

Organizations

Gradio-Blocks-Party's profile picture AI Starter Pack's profile picture

ceshine 's collections 1

Reading List
  • Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

    Paper • 2404.02258 • Published Apr 2, 2024 • 106
  • Jamba: A Hybrid Transformer-Mamba Language Model

    Paper • 2403.19887 • Published Mar 28, 2024 • 111
  • EfficientVMamba: Atrous Selective Scan for Light Weight Visual Mamba

    Paper • 2403.09977 • Published Mar 15, 2024 • 11
  • SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series

    Paper • 2403.15360 • Published Mar 22, 2024 • 13
Reading List
  • Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

    Paper • 2404.02258 • Published Apr 2, 2024 • 106
  • Jamba: A Hybrid Transformer-Mamba Language Model

    Paper • 2403.19887 • Published Mar 28, 2024 • 111
  • EfficientVMamba: Atrous Selective Scan for Light Weight Visual Mamba

    Paper • 2403.09977 • Published Mar 15, 2024 • 11
  • SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series

    Paper • 2403.15360 • Published Mar 22, 2024 • 13
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs