Daniel Jones's picture

11

Daniel Jones

Markon

·

https://www.danieljosephjones.com

AI & ML interests

None yet

Recent Activity

reacted to KaiChen1998's post with 🔥 about 7 hours ago

📢 Our EMOVA paper has been accepted by CVPR 2025, and we are glad to release all resources, including code (training & inference), datasets (training & evaluation), and checkpoints (EMOVA-3B/7B/72B)! 🤗 EMOVA is a novel end-to-end omni-modal LLM that can see, hear and speak. Given omni-modal (i.e., textual, visual and speech) inputs, EMOVA can generate both textual and speech responses with vivid emotional controls by utilizing the speech decoder and a style controller. ✨ EMOVA Highlights ✅ State-of-the-art omni-modality: EMOVA achieves SoTA comparable results on both vision-language and speech benchmarks simultaneously. ✅ Device adaptation: our codebase supports training/inference on both NVIDIA GPUs (e.g., A800 & H20) and Ascend NPUs (e.g., 910B3)! ✅ Modular design: we integrate multiple implementations of vision encoder, vision projector, and language model, even including the most recent DeepSeekMoE-tiny! 🔥 You are all welcome to try and star! - Project page: https://emova-ollm.github.io/ - Github: https://github.com/emova-ollm/EMOVA - Demo: https://huggingface.co/spaces/Emova-ollm/EMOVA-demo

liked a Space 3 months ago

JeffreyXiang/TRELLIS

liked a model 4 months ago

stabilityai/stable-diffusion-3.5-medium

View all activity

Organizations

None yet

Markon's activity

liked a Space 3 months ago

TRELLIS

Scalable and Versatile 3D Generation from images

liked 3 models 4 months ago

stabilityai/stable-diffusion-3.5-medium

Text-to-Image • Updated Oct 31, 2024 • 147k • • 646

genmo/mochi-1-preview

Text-to-Video • Updated Dec 18, 2024 • 21.5k • • 1.19k

gpt-omni/mini-omni2

Any-to-Any • Updated Oct 24, 2024 • 473 • 264

liked a Space 7 months ago

FLUX.1 [Schnell]

Generate images from text prompts

liked a model 7 months ago

xinsir/controlnet-union-sdxl-1.0

Text-to-Image • Updated Jul 30, 2024 • 117k • 1.35k

liked a Space 9 months ago

Open Sora

liked a model about 1 year ago

segmind/SegMoE-4x2-v0

Text-to-Image • Updated Feb 8, 2024 • 573 • 25

liked 3 models over 1 year ago

rozek/StableLM-3B-4E1T_GGUF

Updated Nov 22, 2023 • 8 • 3

TheBloke/Thespis-13B-v0.4-AWQ

Text Generation • Updated Nov 9, 2023 • 86 • 2

TheBloke/JanniesBasedLigma-L2-13B-GGUF

Updated Sep 27, 2023 • 357 • 3