PommesPeter's picture

PommesPeter

PommesPeter

·

PommesPeter

AI & ML interests

MM-LLM

Recent Activity

liked a dataset 25 days ago

agentica-org/DeepScaleR-Preview-Dataset

liked a dataset 25 days ago

open-r1/OpenR1-Math-220k

liked a Space 3 months ago

huggingface/open-source-ai-year-in-review-2024

View all activity

Organizations

PommesPeter's activity

upvoted a collection 5 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 574

upvoted 3 papers 6 months ago

PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions

Paper • 2409.15278 • Published Sep 23, 2024 • 25

A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor?

Paper • 2409.15277 • Published Sep 23, 2024 • 36

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 137

upvoted a paper 8 months ago

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16, 2024 • 131

upvoted an article 8 months ago

Article

We are hiring interns!

Nov 29, 2022

• 11

upvoted a collection 9 months ago

Lumina Family

Lumina-T2X is a unified framework for Text to Any Modality Generation • 8 items • Updated Jul 30, 2024 • 6

upvoted 2 collections 10 months ago

SPHINX Family

2 items • Updated May 18, 2024 • 1

ZeroGPU Spaces

ZeroGPU Spaces made by the community • 17 items • Updated Jun 6, 2024 • 235

upvoted an article 11 months ago

Article

Fit More and Train Faster With ZeRO via DeepSpeed and FairScale

Jan 19, 2021

• 4

upvoted a collection 11 months ago

WizardLM

0 items • Updated Jan 8 • 106

upvoted 2 papers 12 months ago

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Paper • 2402.13753 • Published Feb 21, 2024 • 115

BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences

Paper • 2403.09347 • Published Mar 14, 2024 • 21

upvoted a paper about 1 year ago

Meta-Transformer: A Unified Framework for Multimodal Learning

Paper • 2307.10802 • Published Jul 20, 2023 • 44

upvoted a paper over 1 year ago

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 123