Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization Paper • 2412.18279 • Published Dec 24, 2024
Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs Paper • 2410.18451 • Published Oct 24, 2024 • 18
MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation Paper • 2412.04448 • Published Dec 5, 2024 • 10
A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist Paper • 2402.18485 • Published Feb 28, 2024
Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models Paper • 2406.06563 • Published Jun 3, 2024 • 20
Ingredients: Blending Custom Photos with Video Diffusion Transformers Paper • 2501.01790 • Published Jan 3 • 8
MotionCharacter: Identity-Preserving and Motion Controllable Human Video Generation Paper • 2411.18281 • Published Nov 27, 2024
LatentWarp: Consistent Diffusion Latents for Zero-Shot Video-to-Video Translation Paper • 2311.00353 • Published Nov 1, 2023
HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing Paper • 2412.04280 • Published Dec 5, 2024 • 14
RelationBooth: Towards Relation-Aware Customized Object Generation Paper • 2410.23280 • Published Oct 30, 2024
Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs Paper • 2410.18451 • Published Oct 24, 2024 • 18
Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs Paper • 2410.18451 • Published Oct 24, 2024 • 18
MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models Paper • 2410.13370 • Published Oct 17, 2024 • 37
LaT: Latent Translation with Cycle-Consistency for Video-Text Retrieval Paper • 2207.04858 • Published Jul 11, 2022
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis Paper • 2410.08261 • Published Oct 10, 2024 • 50