3 2 78

Syahmi Azhar

prsyahmi

prsyahmi

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

deepseek-ai/DeepSeek-R1

liked a model about 1 month ago

deepseek-ai/DeepSeek-V3

reacted to ginipick's post with 👍 about 2 months ago

🌟 Digital Odyssey: AI Image & Video Generation Platform 🎨 Welcome to our all-in-one AI platform for image and video generation! 🚀 ✨ Key Features 🎨 High-quality image generation from text 🎥 Video creation from still images 🌐 Multi-language support with automatic translation 🛠️ Advanced customization options 💫 Unique Advantages ⚡ Fast and accurate results using FLUX.1-dev and Hyper-SD models 🔒 Robust content safety filtering system 🎯 Intuitive user interface 🛠️ Extended toolkit including image upscaling and logo generation 🎮 How to Use Enter your image or video description Adjust settings as needed Click generate Save and share your results automatically 🔧 Tech Stack FluxPipeline Gradio PyTorch OpenCV link: https://huggingface.co/spaces/ginigen/Dokdo Turn your imagination into reality with AI! ✨ #AI #ImageGeneration #VideoGeneration #MachineLearning #CreativeTech

View all activity

Organizations

None yet

prsyahmi's activity

liked a model 2 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated about 16 hours ago • 2.43M • • 7.89k

liked a model about 1 month ago

deepseek-ai/DeepSeek-V3

Text Generation • Updated 16 days ago • 1.19M • • 3.31k

reacted to ginipick's post with 👍 about 2 months ago

Post

4358

🌟 Digital Odyssey: AI Image & Video Generation Platform 🎨
Welcome to our all-in-one AI platform for image and video generation! 🚀
✨ Key Features

🎨 High-quality image generation from text
🎥 Video creation from still images
🌐 Multi-language support with automatic translation
🛠️ Advanced customization options

💫 Unique Advantages

⚡ Fast and accurate results using FLUX.1-dev and Hyper-SD models
🔒 Robust content safety filtering system
🎯 Intuitive user interface
🛠️ Extended toolkit including image upscaling and logo generation

🎮 How to Use

Enter your image or video description
Adjust settings as needed
Click generate
Save and share your results automatically

🔧 Tech Stack

FluxPipeline
Gradio
PyTorch
OpenCV

link: ginigen/Dokdo

Turn your imagination into reality with AI! ✨
#AI #ImageGeneration #VideoGeneration #MachineLearning #CreativeTech

7 replies

reacted to MoritzLaurer's post with 👍 about 2 months ago

Post

2610

Quite excited by the ModernBERT release! 0.15/0.4B small, 2T modern pre-training data and tokenizer with code, 8k context window, great efficient model for embeddings & classification!

This will probably be the basis for many future SOTA encoders! And I can finally stop using DeBERTav3 from 2021 :D

Congrats @answerdotai , @LightOnIO and collaborators like @tomaarsen !

Paper and models here 👇https://huggingface.co/collections/answerdotai/modernbert-67627ad707a4acbf33c41deb

3 replies

liked a model about 2 months ago

Etched/oasis-500m

Updated Nov 4, 2024 • 166 • 442

liked 4 models 3 months ago

liked a Space 3 months ago

182

OmniParser

😻

Convert GUI screen to structured elements

liked a model 3 months ago

alimama-creative/SDXL-EcomID

Text-to-Image • Updated Oct 24, 2024 • 1.7k • 74

liked a model 4 months ago

suno/bark

Text-to-Speech • Updated Oct 4, 2023 • 48.3k • 1.24k

liked a model 6 months ago

city96/FLUX.1-dev-gguf

Text-to-Image • Updated Aug 18, 2024 • 200k • 848

liked a model 7 months ago

ChristianAzinn/snowflake-arctic-embed-l-gguf

liked 2 models 8 months ago

stabilityai/stable-diffusion-3-medium

Text-to-Image • Updated Aug 12, 2024 • 28.2k • • 4.69k

sd-community/sdxl-flash

Text-to-Image • Updated Jun 3, 2024 • 13.5k • • 193

reacted to singhsidhukuldeep's post with 🤗 9 months ago

Post

1457

You are all happy 😊 that @meta-llama released Llama 3.

Then you are sad 😔 that it only has a context length of 8k.

Then you are happy 😄 that you can just scale llama-3 PoSE to 96k without training, only needing to modify max_position_embeddings and rope_theta.

But then you are sad 😢 it only improves the model's long-context retrieval performance (i.e., finding needles) while hardly improving its long-context utilization capability (doing QA and summarization).

But then you are happy 😁 that the
@GradientsTechnologies community has released the long-context Llama-3-8B-Instruct-262K with long context (262k-1M+).

Now we have another paper "Extending Llama-3's Context Ten-Fold Overnight" 📜.

The context length of Llama-3-8B-Instruct is extended from 8K to 80K using QLoRA fine-tuning⚙️.

The training cycle is highly efficient, taking "only" 😂 8 hours on a single 8xA800 (80G) GPU machine.

The model also preserves its original capability over short contexts. ✁

The dramatic context extension is mainly attributed to merely 3.5K synthetic training samples generated by GPT-4.📊

The paper suggests that the context length could be extended far beyond 80K with more computation resources (😅 GPU-poor).

The team plans to publicly release all resources, including data, model, data generation pipeline, and training code, to facilitate future research from the ❤️ community.

Paper: https://arxiv.org/abs/2404.19553

This is where we are... until next time... 🌟

Extending Llama-3's Context Ten-Fold Overnight (2404.19553)

liked a Space 10 months ago

1.18k

GGUF My Repo

🦙

liked a model 10 months ago

Green-Sky/bark-ggml

Updated Apr 22, 2024 • 6

upvoted a paper 10 months ago

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

Paper • 2404.07647 • Published Apr 11, 2024 • 4