100 19 20

Michael Goin

mgoin

mgoin_
mgoin

AI & ML interests

LLM inference optimization, compression, quantization, pruning, distillation

Recent Activity

updated a collection 17 days ago

vLLM Kernels

updated a collection 17 days ago

vLLM Kernels

updated a dataset about 1 month ago

mgoin/mlperf-inference-llama3.1-8b-data

View all activity

Organizations

Collections 1

Browse and filter text models by RedHatAI

Sleeping

Convert Fp8

💬

Paused

Hermes Mistral 7b Vllm

🚀

Paused

Sparse Llama Gsm8k

📚

Running

TinyStories DeepSparse

🏢

models 101

mgoin/mlperf-inference-llama3.1-8b-data

Updated Jul 15

mgoin/Llama-3.1-8B-Instruct-FP8-BLOCK

8B • Updated Jul 1 • 7

mgoin/Qwen3-0.6B-FP8-BLOCK

0.6B • Updated Jul 1 • 62

mgoin/Qwen3-30B-A3B-FP8-BLOCK

31B • Updated Jul 1 • 27

mgoin/SEMIKONG-70B-W4A16-G128

11B • Updated Jun 16 • 4

mgoin/llama-4-tiny-random

Text Generation • 0.0B • Updated May 14 • 3

mgoin/Qwen1.5-14B-Chat-GPTQ

Text Generation • Updated Mar 5 • 3

mgoin/pixtral-12b

Image-Text-to-Text • 13B • Updated Feb 7 • 959 • 1

mgoin/Llama-3.2-1B-Instruct-FP8-ATTN

1B • Updated Dec 23, 2024 • 3

mgoin/Llama-3.2-1B-Instruct-FP8-dynamic-ATTN

1B • Updated Dec 23, 2024 • 3

View 101 models

datasets 4

mgoin/mlperf-inference-llama3.1-8b-data

Viewer • Updated Jul 15 • 13.4k • 131

mgoin/mlperf-inference-llama2-data

Viewer • Updated May 22 • 24.6k • 98

mgoin/mlperf-inference-llama3.1-405b-data

Viewer • Updated May 22 • 8.31k • 23

mgoin/ultrachat_2k

Viewer • Updated May 24, 2024 • 2.05k • 242

Michael Goin

AI & ML interests

Recent Activity

Organizations

Collections 1

mgoin/Nemotron-4-340B-Instruct-hf-FP8

mgoin/Nemotron-4-340B-Base-hf-FP8

mgoin/Nemotron-4-340B-Instruct-hf

mgoin/Nemotron-4-340B-Base-hf

mgoin/Nemotron-4-340B-Instruct-hf-FP8

mgoin/Nemotron-4-340B-Base-hf-FP8

mgoin/Nemotron-4-340B-Instruct-hf

mgoin/Nemotron-4-340B-Base-hf

Papers 4

spaces 5

redhatai-model-explorer

Convert Fp8

Hermes Mistral 7b Vllm

Sparse Llama Gsm8k

TinyStories DeepSparse

models 101

mgoin/mlperf-inference-llama3.1-8b-data

mgoin/Llama-3.1-8B-Instruct-FP8-BLOCK

mgoin/Qwen3-0.6B-FP8-BLOCK

mgoin/Qwen3-30B-A3B-FP8-BLOCK

mgoin/SEMIKONG-70B-W4A16-G128

mgoin/llama-4-tiny-random

mgoin/Qwen1.5-14B-Chat-GPTQ

mgoin/pixtral-12b

mgoin/Llama-3.2-1B-Instruct-FP8-ATTN

mgoin/Llama-3.2-1B-Instruct-FP8-dynamic-ATTN

datasets 4

mgoin/mlperf-inference-llama3.1-8b-data

mgoin/mlperf-inference-llama2-data

mgoin/mlperf-inference-llama3.1-405b-data

mgoin/ultrachat_2k

Michael Goin

AI & ML interests

Recent Activity

Organizations

Collections 1

Papers 4

spaces 5 Sort: Recently updated

redhatai-model-explorer

Convert Fp8

Hermes Mistral 7b Vllm

Sparse Llama Gsm8k

TinyStories DeepSparse

models 101 Sort: Recently updated

datasets 4 Sort: Recently updated

spaces 5

models 101

datasets 4