Collections
Discover the best community collections!
Collections trending this week
-
togethercomputer/StripedHyena-Hessian-7B
Text Generation • Updated • 147 • 65 -
Zebra: Extending Context Window with Layerwise Grouped Local-Global Attention
Paper • 2312.08618 • Published • 15 -
SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention
Paper • 2312.07987 • Published • 41 -
LLM360: Towards Fully Transparent Open-Source LLMs
Paper • 2312.06550 • Published • 57