SHI Labs

university

https://www.humphreyshi.com/

humphrey_shi

shi-labs

Activity Feed Request to join this org

AI & ML interests

Computer Vision, AI, Machine Learning

Recent Activity

praeclarumjj3 updated a Space about 15 hours ago

shi-labs/VCoder

Humphrey authored a paper 9 days ago

OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation

praeclarumjj3 updated a Space 11 days ago

shi-labs/OLA-VLM

View all activity

shi-labs's activity

praeclarumjj3

updated a Space about 15 hours ago

Runtime error

✌️

VCoder

Humphrey

authored a paper 9 days ago

OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation

Paper • 2412.09585 • Published 13 days ago • 10

praeclarumjj3

updated a Space 11 days ago

Running on Zero

🔍

OLA-VLM

praeclarumjj3

authored 2 papers 12 days ago

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

Paper • 2405.05949 • Published May 9 • 2

OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation

Paper • 2412.09585 • Published 13 days ago • 10

praeclarumjj3

updated a collection 14 days ago

Multimodal AI

Collection

Large multimodal models • 18 items • Updated 14 days ago • 2

praeclarumjj3

in shi-labs/OLA-VLM 15 days ago

Apply for community grant: Academic project (gpu)

#1 opened 17 days ago by

praeclarumjj3

Humphrey

authored a paper 4 months ago

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Paper • 2408.15998 • Published Aug 28 • 84

Humphrey

authored a paper 9 months ago

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Paper • 2403.14773 • Published Mar 21 • 10

praeclarumjj3

authored a paper 12 months ago

VCoder: Versatile Vision Encoders for Multimodal Large Language Models

Paper • 2312.14233 • Published Dec 21, 2023 • 16

Humphrey

authored 3 papers about 1 year ago

HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models

Paper • 2312.14091 • Published Dec 21, 2023 • 15

Neighborhood Attention Transformer

Paper • 2204.07143 • Published Apr 14, 2022

Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models

Paper • 2312.04410 • Published Dec 7, 2023 • 14

JamesXu

authored a paper about 1 year ago

Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models

Paper • 2312.04410 • Published Dec 7, 2023 • 14

JiayiGuo821

authored a paper about 1 year ago

Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models

Paper • 2312.04410 • Published Dec 7, 2023 • 14

Humphrey

authored 2 papers about 1 year ago

HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion Models

Paper • 2312.00079 • Published Nov 30, 2023 • 14

Video Instance Matting

Paper • 2311.04212 • Published Nov 7, 2023 • 7

Humphrey

authored 3 papers over 1 year ago

AI & ML interests

Recent Activity

Team members 11

shi-labs's activity

VCoder

OLA-VLM

Apply for community grant: Academic project (gpu)