Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
11
Daniel Jones
Markon
Follow
0 followers
Ā·
3 following
https://www.danieljosephjones.com
AI & ML interests
None yet
Recent Activity
reacted
to
KaiChen1998
's
post
with š„
about 7 hours ago
š¢ Our EMOVA paper has been accepted by CVPR 2025, and we are glad to release all resources, including code (training & inference), datasets (training & evaluation), and checkpoints (EMOVA-3B/7B/72B)! š¤ EMOVA is a novel end-to-end omni-modal LLM that can see, hear and speak. Given omni-modal (i.e., textual, visual and speech) inputs, EMOVA can generate both textual and speech responses with vivid emotional controls by utilizing the speech decoder and a style controller. āØ EMOVA Highlights ā State-of-the-art omni-modality: EMOVA achieves SoTA comparable results on both vision-language and speech benchmarks simultaneously. ā Device adaptation: our codebase supports training/inference on both NVIDIA GPUs (e.g., A800 & H20) and Ascend NPUs (e.g., 910B3)! ā Modular design: we integrate multiple implementations of vision encoder, vision projector, and language model, even including the most recent DeepSeekMoE-tiny! š„ You are all welcome to try and star! - Project page: https://emova-ollm.github.io/ - Github: https://github.com/emova-ollm/EMOVA - Demo: https://huggingface.co/spaces/Emova-ollm/EMOVA-demo
liked
a Space
3 months ago
JeffreyXiang/TRELLIS
liked
a model
4 months ago
stabilityai/stable-diffusion-3.5-medium
View all activity
Organizations
None yet
Markon
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a Space
3 months ago
Running
on
Zero
4.21k
4.21k
TRELLIS
š¢
Scalable and Versatile 3D Generation from images
liked
3 models
4 months ago
stabilityai/stable-diffusion-3.5-medium
Text-to-Image
ā¢
Updated
Oct 31, 2024
ā¢
147k
ā¢
ā¢
646
genmo/mochi-1-preview
Text-to-Video
ā¢
Updated
Dec 18, 2024
ā¢
21.5k
ā¢
ā¢
1.19k
gpt-omni/mini-omni2
Any-to-Any
ā¢
Updated
Oct 24, 2024
ā¢
473
ā¢
264
liked
a Space
7 months ago
Running
on
Zero
4.43k
4.43k
FLUX.1 [Schnell]
š
Generate images from text prompts
liked
a model
7 months ago
xinsir/controlnet-union-sdxl-1.0
Text-to-Image
ā¢
Updated
Jul 30, 2024
ā¢
117k
ā¢
1.35k
liked
a Space
9 months ago
Build error
443
443
Open Sora
ā”
liked
a model
about 1 year ago
segmind/SegMoE-4x2-v0
Text-to-Image
ā¢
Updated
Feb 8, 2024
ā¢
573
ā¢
25
liked
3 models
over 1 year ago
rozek/StableLM-3B-4E1T_GGUF
Updated
Nov 22, 2023
ā¢
8
ā¢
3
TheBloke/Thespis-13B-v0.4-AWQ
Text Generation
ā¢
Updated
Nov 9, 2023
ā¢
86
ā¢
2
TheBloke/JanniesBasedLigma-L2-13B-GGUF
Updated
Sep 27, 2023
ā¢
357
ā¢
3