Maziyar Panahi's picture

Maziyar Panahi PRO

MaziyarPanahi

·

AI & ML interests

Fine-Tuning, RLHF, Merging, Quantizations, Leaderboards

Recent Activity

updated a collection about 22 hours ago

updated a model about 22 hours ago

MaziyarPanahi/relay-v0.1-Mistral-Nemo-2407-GGUF

new activity about 22 hours ago

MaziyarPanahi/relay-v0.1-Mistral-Nemo-2407-GGUF:Upload folder using huggingface_hub

View all activity

Organizations

MaziyarPanahi's activity

upvoted a collection about 24 hours ago

InternVL2.5-MPO

Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization • 16 items • Updated about 12 hours ago • 20

upvoted a collection 1 day ago

medical_ner

Medical Entity Recognition • 16 items • Updated 1 day ago • 1

upvoted an article 2 days ago

Article

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

By

•

Aug 25, 2023

• 23

upvoted a paper 5 days ago

NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data

Paper • 2402.15343 • Published Feb 23 • 13

upvoted a collection 5 days ago

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 5 days ago • 88

upvoted a collection 12 days ago

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 9 items • Updated 27 days ago • 58

upvoted a collection 14 days ago

OLMo 2

Artifacts for the second set of OLMo models. • 17 items • Updated 27 days ago • 57

upvoted a collection 19 days ago

Common Models

The first generation of models pretrained on Common Corpus. • 5 items • Updated 19 days ago • 27

upvoted an article 19 days ago

Article

They Said It Couldn’t Be Done

By

•

19 days ago

• 75

upvoted a paper 20 days ago

OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs

Paper • 2411.14199 • Published Nov 21 • 28

upvoted 2 collections 25 days ago

INTELLECT-1 Dataset

INTELLECT-1 Training dataset • 5 items • Updated Oct 8 • 21

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 32 items • Updated 27 days ago • 62

upvoted a paper about 1 month ago

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

Paper • 2402.10176 • Published Feb 15 • 36

upvoted 3 collections about 1 month ago

Awesome SFT datasets

A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12 • 123

OpenScholar_V1

The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". • 8 items • Updated Nov 22 • 30

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 7 items • Updated 27 days ago • 29

upvoted 2 articles about 1 month ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13

• 385

Article

Halo: Open Source Health Tracking with Wearables

By

•

Nov 19

• 96

upvoted 2 papers about 1 month ago

Thinking LLMs: General Instruction Following with Thought Generation

Paper • 2410.10630 • Published Oct 14 • 17

TPO: Aligning Large Language Models with Multi-branch & Multi-step Preference Trees

Paper • 2410.12854 • Published Oct 10 • 1