Phil's picture

Phil

phil111

·

AI & ML interests

None yet

Recent Activity

new activity about 1 month ago

mistralai/Mistral-Small-24B-Instruct-2501:This Mistral Small has FAR less knowledge than the last.

liked a model about 2 months ago

deepseek-ai/DeepSeek-R1

new activity about 2 months ago

internlm/internlm3-8b-instruct:English tests and tasks are absurdly overfit.

View all activity

Organizations

None yet

phil111's activity

New activity in mistralai/Mistral-Small-24B-Instruct-2501 about 1 month ago

This Mistral Small has FAR less knowledge than the last.

#5 opened about 1 month ago by

liked a model about 2 months ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 17 days ago • 2.75M • • 11.3k

New activity in internlm/internlm3-8b-instruct about 2 months ago

English tests and tasks are absurdly overfit.

#8 opened about 2 months ago by

New activity in microsoft/phi-4 2 months ago

A heavily filtered corpus simply doesn't work.

#19 opened 2 months ago by

I Don't Understand This Model

#9 opened 2 months ago by

New activity in matteogeniaccio/phi-4 3 months ago

Notably better than Phi3.5 in many ways, but something is wrong.

#5 opened 3 months ago by

liked a model 3 months ago

deepseek-ai/DeepSeek-V3

Text Generation • Updated 17 days ago • 3.12M • • 3.63k

New activity in deepseek-ai/DeepSeek-V3-Base 3 months ago

Very impressive. Good world knowledge (SimpleQA of 25) despite high math/coding performance.

#27 opened 3 months ago by

liked a model 3 months ago

deepseek-ai/DeepSeek-V3-Base

Updated 17 days ago • 762k • 1.59k

New activity in NyxKrage/Microsoft_Phi-4 3 months ago

SimpleQA score

#1 opened 3 months ago by

New activity in ibm-granite/granite-3.1-8b-instruct 3 months ago

Exceptional creative writer

#1 opened 3 months ago by

liked 2 models 3 months ago

ibm-granite/granite-3.1-8b-instruct

Text Generation • Updated 15 days ago • 94.7k • 153

QuantFactory/granite-3.1-8b-instruct-GGUF

Text Generation • Updated Dec 19, 2024 • 684 • 7

New activity in tiiuae/Falcon3-7B-Instruct 3 months ago

Very High English MMLU scores, Yet Extremely Low Broad English Knowledge

#8 opened 3 months ago by

New activity in CohereForAI/c4ai-command-r7b-12-2024 3 months ago

How was r7b?

#3 opened 3 months ago by

Add Qwen 2.5 7B & Tulu 3 8B results to OLLM benchmarks

#1 opened 3 months ago by

New activity in meta-llama/Llama-3.3-70B-Instruct 3 months ago

local Llama + GPU(cuda)

#34 opened 3 months ago by

Base Model?

#32 opened 3 months ago by

New activity in open-llm-leaderboard/open_llm_leaderboard 3 months ago

Add Hymba-1.5B to the leaderboard

#1030 opened 3 months ago by

liked a model 3 months ago

lmstudio-community/Llama-3.3-70B-Instruct-GGUF

Text Generation • Updated Dec 6, 2024 • 23.8k • 48