2 3 1

Ofir Zafrir

ofirzaf

AI & ML interests

Sparsity, Qunatization, Model Compression

Recent Activity

authored a paper about 1 month ago

Q8BERT: Quantized 8Bit BERT

authored a paper about 1 month ago

FastDraft: How to Train Your Draft

upvoted a paper about 1 month ago

FastDraft: How to Train Your Draft

View all activity

Articles

A Chatbot on your Laptop: Phi-2 on Intel Meteor Lake

Mar 20

• 5

Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

Jan 30

• 9

Organizations

ofirzaf's activity

authored 2 papers about 1 month ago

Q8BERT: Quantized 8Bit BERT

Paper • 1910.06188 • Published Oct 14, 2019 • 2

FastDraft: How to Train Your Draft

Paper • 2411.11055 • Published Nov 17 • 9

upvoted a paper about 1 month ago

FastDraft: How to Train Your Draft

Paper • 2411.11055 • Published Nov 17 • 9

upvoted a paper 5 months ago

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

Paper • 2408.02545 • Published Aug 5 • 35

New activity in microsoft/Phi-3-mini-4k-instruct 8 months ago

Changed instruction/chat template

#54 opened 8 months ago by

ofirzaf

authored a paper over 1 year ago

An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs

Paper • 2306.16601 • Published Jun 28, 2023 • 4

liked a Space over 1 year ago

Running on CPU Upgrade

12.1k

🏆

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

updated 7 models over 2 years ago

updated a model about 3 years ago

Intel/bert-large-uncased-squadv1.1-sparse-90-unstructured

Question Answering • Updated Dec 5, 2021 • 168

updated 4 models over 3 years ago

Intel/bert-base-uncased-mnli-sparse-70-unstructured-no-classifier

Fill-Mask • Updated Jun 29, 2021 • 9

Intel/bert-base-uncased-sparse-1_2

Updated Jun 24, 2021 • 12

Intel/bert-base-uncased-mnli-sparse-70-unstructured

Text Classification • Updated May 24, 2021 • 28

Intel/bert-base-uncased-sparse-70-unstructured

Fill-Mask • Updated May 24, 2021 • 17