lf's picture

9 10

lf

lfnothing

·

AI & ML interests

None yet

Recent Activity

reacted to KaiChen1998's post with 👍 16 days ago

📢 Our EMOVA paper has been accepted by CVPR 2025, and we are glad to release all resources, including code (training & inference), datasets (training & evaluation), and checkpoints (EMOVA-3B/7B/72B)! 🤗 EMOVA is a novel end-to-end omni-modal LLM that can see, hear and speak. Given omni-modal (i.e., textual, visual and speech) inputs, EMOVA can generate both textual and speech responses with vivid emotional controls by utilizing the speech decoder and a style controller. ✨ EMOVA Highlights ✅ State-of-the-art omni-modality: EMOVA achieves SoTA comparable results on both vision-language and speech benchmarks simultaneously. ✅ Device adaptation: our codebase supports training/inference on both NVIDIA GPUs (e.g., A800 & H20) and Ascend NPUs (e.g., 910B3)! ✅ Modular design: we integrate multiple implementations of vision encoder, vision projector, and language model, even including the most recent DeepSeekMoE-tiny! 🔥 You are all welcome to try and star! - Project page: https://emova-ollm.github.io/ - Github: https://github.com/emova-ollm/EMOVA - Demo: https://huggingface.co/spaces/Emova-ollm/EMOVA-demo

liked a dataset 5 months ago

Skylion007/openwebtext

liked a model 5 months ago

Salesforce/blip2-opt-2.7b

View all activity

Organizations

None yet

Collections 1

models 7

lfnothing/kapai-man

Text-to-Image • Updated Jul 17, 2024 • 4

lfnothing/audio-diffusion-electronic

Updated Jul 17, 2024 • 3

lfnothing/sd-class-butterflies-32

Unconditional Image Generation • Updated Jul 15, 2024 • 5

lfnothing/whisper-small-dv

Automatic Speech Recognition • Updated Jul 12, 2024 • 12

lfnothing/opt-125m-gptq

Text Generation • Updated Jul 2, 2024 • 5

lfnothing/distilbert-base-uncased-finetuned-imdb

Updated Jun 27, 2024

lfnothing/code-search-net-tokenizer

Updated Jun 27, 2024

datasets 3

lfnothing/agents_small_benchmark

Viewer • Updated Aug 8, 2024 • 100 • 30

lfnothing/dreambooth-hackathon-images

Viewer • Updated Jul 17, 2024 • 15 • 30

lfnothing/github-issues

Preview • Updated Jun 26, 2024 • 4