Ahmed El-Rufaei's picture
3 32

Ahmed El-Rufaei

ahmed-ai

AI & ML interests

Computer vision, LLMs, NLP, Deep learning, Healthcare, Genetics, Neuroscience

Recent Activity

updated a model about 1 month ago
ahmed-ai/galen
liked a model 6 months ago
meta-llama/Llama-3.1-8B-Instruct
View all activity

Organizations

AIModels.org's profile picture DevAgent's profile picture

ahmed-ai's activity

upvoted an article 4 months ago
view article
Article

Vision Language Models Explained

245
updated a Space 8 months ago
reacted to akhaliq's post with 😎👀❤️🚀 10 months ago
view post
Post
2254
Mora

Enabling Generalist Video Generation via A Multi-Agent Framework

Mora: Enabling Generalist Video Generation via A Multi-Agent Framework (2403.13248)

Sora is the first large-scale generalist video generation model that garnered significant attention across society. Since its launch by OpenAI in February 2024, no other video generation models have paralleled {Sora}'s performance or its capacity to support a broad spectrum of video generation tasks. Additionally, there are only a few fully published video generation models, with the majority being closed-source. To address this gap, this paper proposes a new multi-agent framework Mora, which incorporates several advanced visual AI agents to replicate generalist video generation demonstrated by Sora. In particular, Mora can utilize multiple visual agents and successfully mimic Sora's video generation capabilities in various tasks, such as (1) text-to-video generation, (2) text-conditional image-to-video generation, (3) extend generated videos, (4) video-to-video editing, (5) connect videos and (6) simulate digital worlds. Our extensive experimental results show that Mora achieves performance that is proximate to that of Sora in various tasks. However, there exists an obvious performance gap between our work and Sora when assessed holistically. In summary, we hope this project can guide the future trajectory of video generation through collaborative AI agents.