36 129

boqsc

AI & ML interests

None yet

Recent Activity

liked a model 22 days ago

arcee-ai/Arcee-Maestro-7B-Preview

liked a model 4 months ago

PramaLLC/BEN

liked a Space 4 months ago

Qwen/Qwen2.5-Coder-Artifacts

View all activity

Organizations

None yet

boqsc's activity

liked a model 22 days ago

arcee-ai/Arcee-Maestro-7B-Preview

Text Generation • Updated 21 days ago • 4.62k • 36

liked a model 4 months ago

PramaLLC/BEN

Image Segmentation • Updated Jan 26 • 253 • 85

liked a Space 4 months ago

1.42k

Qwen2.5 Coder Artifacts

🐢

Generate code from a description

liked a model 6 months ago

openbmb/MiniCPM-Llama3-V-2_5

Image-Text-to-Text • Updated Jan 15 • 25.1k • 1.39k

New activity in mattshumer/Reflection-Llama-3.1-70B 6 months ago

Is this really Llama 3.1-70B? the config.json say: Meta-Llama-3-70B-Instruct

#35 opened 6 months ago by

boqsc

I created the Llama-3.1-8B Version

#38 opened 6 months ago by

gr0010

liked 2 models 6 months ago

mradermacher/Reflection-Llama-3.1-70B-bf16-GGUF

Updated Sep 6, 2024 • 383 • 7

mattshumer/Reflection-Llama-3.1-70B

Text Generation • Updated Sep 24, 2024 • 698 • 1.72k

liked a model 7 months ago

multimodalart/flux-tarot-v1

Text-to-Image • Updated Aug 16, 2024 • 11.5k • • 209

liked a model 9 months ago

gradientai/Llama-3-70B-Instruct-Gradient-1048k

Text Generation • Updated Oct 28, 2024 • 161 • 121

liked a model 11 months ago

gradientai/Llama-3-8B-Instruct-Gradient-1048k

Text Generation • Updated Oct 29, 2024 • 5.3k • 682

liked 3 models about 1 year ago

liked a Space about 1 year ago

382

WhisperSpeech

🌬

reacted to fffiloni's post with ❤️ about 1 year ago

Post

I'm happy to announce that ✨ Image to Music v2 ✨ is ready for you to try and i hope you'll like it too ! 😌

This new version has been crafted with transparency in mind,
so you can understand the process of translating an image to a musical equivalent.

How does it works under the hood ? 🤔

First, we get a very literal caption from microsoft/kosmos-2-patch14-224; this caption is then given to a LLM Agent (currently HuggingFaceH4/zephyr-7b-beta )which task is to translate the image caption to a musical and inspirational prompt for the next step.

Once we got a nice musical text from the LLM, we can send it to the text-to-music model of your choice:
MAGNet, MusicGen, AudioLDM-2, Riffusion or Mustango

Instead of the previous version of Image to Music which used Mubert API, and could output curious and obscure combinations, we only provide open sourced models available on the hub, called via the gradio API.

Also i guess the music result should be more accurate to the atmosphere of the image input, thanks to the LLM Agent step.

Pro tip, you can adjust the inspirational prompt to match your expectations, according to the chosen model and specific behavior of each one 👌

Try it, explore different models and tell me which one is your favorite 🤗
—› fffiloni/image-to-music-v2