Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Chenyang Song's picture
12 20 7

Chenyang Song

Raincleared
BryantMcGill's profile picture 21world's profile picture ZSKHGA's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago
μ-Parametrization for Mixture of Experts
authored a paper about 1 month ago
ConPET: Continual Parameter-Efficient Tuning for Large Language Models
authored a paper about 1 month ago
ReLU$^2$ Wins: Discovering Efficient Activation Functions for Sparse LLMs
View all activity

Organizations

OpenBMB's profile picture SparseLLMs's profile picture PowerInfer's profile picture

liked 5 models about 1 year ago

mistralai/Mistral-Large-Instruct-2407

123B • Updated 21 days ago • 10.8k • 837

openbmb/MiniCPM-S-1B-sft-gguf

Updated Jul 4, 2024 • 6 • 6

openbmb/MiniCPM-S-1B-sft-llama-format

Text Generation • Updated Sep 7, 2024 • 8 • 4

openbmb/MiniCPM-S-1B-sft

Text Generation • 1B • Updated Nov 6, 2024 • 868 • 11

SparseLLM/ProSparse-MiniCPM-1B-sft

Text Generation • Updated Jun 3, 2024 • 6 • 3
liked 2 models over 1 year ago

databricks/dbrx-base

Text Generation • 132B • Updated Apr 19, 2024 • 6 • 560

deepseek-ai/deepseek-moe-16b-chat

Text Generation • 16B • Updated Feb 5, 2024 • 7.25k • 146
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs