Mike Staub's picture

Mike Staub

mikestaub

·

https://michaelstaub.com

AI & ML interests

robot perception, 3d graphics

Recent Activity

liked a model 18 days ago

meta-llama/Llama-3.3-70B-Instruct

liked a model 26 days ago

Nexusflow/Athene-V2-Chat

liked a model 26 days ago

AIDC-AI/Marco-o1

View all activity

Organizations

None yet

mikestaub's activity

upvoted a paper 3 months ago

Imagine yourself: Tuning-Free Personalized Image Generation

Paper • 2409.13346 • Published Sep 20 • 68

upvoted 2 collections 4 months ago

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated 20 days ago • 180

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated Nov 14 • 536

upvoted an article 5 months ago

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Jul 31

• 59

upvoted a paper 5 months ago

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Paper • 2310.11511 • Published Oct 17, 2023 • 75

upvoted a paper 6 months ago

Gemini: A Family of Highly Capable Multimodal Models

Paper • 2312.11805 • Published Dec 19, 2023 • 44

upvoted 2 collections 6 months ago

Phi 3 - Smashed

Many variations of Phi 3 with many compression techniques. • 8 items • Updated Apr 30 • 1

Gemma 2 Release

15 items • Updated 12 days ago • 206

upvoted an article 8 months ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22

• 228

upvoted a paper 10 months ago

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Paper • 2402.10986 • Published Feb 16 • 77

upvoted 5 papers about 1 year ago

Context Tuning for Retrieval Augmented Generation

Paper • 2312.05708 • Published Dec 9, 2023 • 17

Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians

Paper • 2312.03029 • Published Dec 5, 2023 • 23

GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis

Paper • 2312.02155 • Published Dec 4, 2023 • 12

Magicoder: Source Code Is All You Need

Paper • 2312.02120 • Published Dec 4, 2023 • 80

LRM: Large Reconstruction Model for Single Image to 3D

Paper • 2311.04400 • Published Nov 8, 2023 • 47