Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
48
9
819
nothing
trysem
Follow
VishnuPJ's profile picture
Mirou2n's profile picture
asigalov61's profile picture
9 followers
Β·
28 following
AI & ML interests
None yet
Recent Activity
liked
a dataset
7 days ago
opendiffusionai/laion2b-23ish-woman-solo
reacted
to
hexgrad
's
post
with π₯
8 days ago
Wanted: Peak Data. I'm collecting audio data to train another TTS model: + AVM data: ChatGPT Advanced Voice Mode audio & text from source + Professional audio: Permissive (CC0, Apache, MIT, CC-BY) This audio should *impress* most native speakers, not just barely pass their audio Turing tests. Professional-caliber means S or A-tier, not your average bloke off the street. Traditional TTS may not make the cut. Absolutely no low-fi microphone recordings like Common Voice. The bar is much higher than last time, so there are no timelines yet and I expect it may take longer to collect such mythical data. Raising the bar means evicting quite a bit of old data, and voice/language availability may decrease. The theme is *quality* over quantity. I would rather have 1 hour of A/S-tier than 100 hours of mid data. I have nothing to offer but the north star of a future Apache 2.0 TTS model, so prefer data that you *already have* and costs you *nothing extra* to send. Additionally, *all* the new data may be used to construct public, Apache 2.0 voicepacks, and if that arrangement doesn't work for you, no need to send any audio. Last time I asked for horses; now I'm asking for unicorns. As of writing this post, I've currently got a few English & Chinese unicorns, but there is plenty of room in the stable. Find me over on Discord at `rzvzn`: https://discord.gg/QuGxSWBfQy
liked
a Space
8 days ago
Steveeeeeeen/Zonos
View all activity
Organizations
None yet
spaces
55
Sort:Β Recently updated
pinned
Runtime error
2
AnimeGANv3
π
pinned
Runtime error
2
Test Vin
π
Sleeping
1
test nemo
π
Sleeping
test nemo
π
Runtime error
LLaMAX3-Translator
π
LLaMAX3 Translator
Sleeping
SeemoRe
π»
Expand 55 spaces
models
10
Sort:Β Recently updated
trysem/conformer-ml
Automatic Speech Recognition
β’
Updated
Sep 25, 2024
β’
7
β’
1
trysem/W2V2-BERT-withLM-Malayalam
Automatic Speech Recognition
β’
Updated
Sep 25, 2024
β’
9
β’
1
trysem/IndicTTS-Malayalam
Text-to-Speech
β’
Updated
Apr 5, 2024
β’
8
trysem/resemble-enhance
Audio-to-Audio
β’
Updated
Mar 11, 2024
β’
1
trysem/DualStyleGANplus
Updated
Mar 19, 2023
β’
1
trysem/restyle-e4e-ffhq
Updated
Feb 23, 2023
trysem/pixel2style2pixel
Updated
Feb 23, 2023
β’
1
trysem/DreamShaper-3.3
Text-to-Image
β’
Updated
Jan 19, 2023
β’
75
β’
β’
2
trysem/Time-Travel-Rephotography_Spectral_sensitivy
Updated
Jul 2, 2022
trysem/rfag
Updated
Jun 15, 2022
datasets
None public yet