Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ZhenyuanGuo's picture
2 4 51

ZhenyuanGuo

YuanZ77
Xdotnet's profile picture
·

AI & ML interests

LLM

Organizations

Zhejiang University's profile picture Zhejiang University's profile picture

Collections 1

Papers
  • Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

    Paper • 2304.01373 • Published Apr 3, 2023 • 9
  • Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

    Paper • 2502.11089 • Published Feb 16 • 165
Papers
  • Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

    Paper • 2304.01373 • Published Apr 3, 2023 • 9
  • Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

    Paper • 2502.11089 • Published Feb 16 • 165

models 5

YuanZ77/zephyr-7b-sft-qlora

Updated Oct 22, 2024 • 2

YuanZ77/try

Updated Sep 17, 2024

YuanZ77/gemma2b-b-dei-easy

Updated Sep 16, 2024

YuanZ77/gemma2b-b

Text Generation • 3B • Updated Sep 16, 2024 • 2

YuanZ77/gemma2b-a

Text Generation • 3B • Updated Sep 16, 2024 • 3

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs