Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
realbenpope 's Collections
Embeddings
Diffusion Language Models
Steady state model
Visual reasoning
Reasoning
Memory Tokens
In context learning
Recurrent architecture
MoE
Small LMs

Recurrent architecture

updated Aug 16, 2024
Upvote
-

  • Layerwise Recurrent Router for Mixture-of-Experts

    Paper • 2408.06793 • Published Aug 13, 2024 • 33
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs