Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yifan Peng's picture
9 10 18

Yifan Peng

pyf98
michaljunczyk's profile picture Soroor's profile picture ak4off's profile picture
·
https://pyf98.github.io
  • pyf98

AI & ML interests

Multimodal LLMs, Speech-to-Speech, Speech Recognition

Recent Activity

new activity 2 days ago
nvidia/Nemotron-H-8B-Reasoning-128K:Errors in HybridMambaAttentionDynamicCache
upvoted an article about 1 month ago
Gotchas in Tokenizer Behavior Every Developer Should Know
liked a model about 1 month ago
google/gemma-3-1b-pt
View all activity

Organizations

ESPnet's profile picture Blog-explorers's profile picture YODAS Sharing inc's profile picture Nvidia Data&Tools team's profile picture

pyf98 's collections 1

Open Whisper-style Speech Models (OWSM)
Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/
  • Sleeping
    55
    55

    OWSM Demo

    🔊

  • espnet/yodas_owsmv4

    Updated Jun 4 • 105 • 10
  • espnet/owsm_ctc_v4_1B

    Automatic Speech Recognition • Updated Jun 8 • 1.13k • 4
  • espnet/owsm_v4_medium_1B

    Automatic Speech Recognition • Updated Jun 4 • 16 • 2
Open Whisper-style Speech Models (OWSM)
Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/
  • Sleeping
    55
    55

    OWSM Demo

    🔊

  • espnet/yodas_owsmv4

    Updated Jun 4 • 105 • 10
  • espnet/owsm_ctc_v4_1B

    Automatic Speech Recognition • Updated Jun 8 • 1.13k • 4
  • espnet/owsm_v4_medium_1B

    Automatic Speech Recognition • Updated Jun 4 • 16 • 2
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs