Phil
phil111
AI & ML interests
None yet
Recent Activity
new activity
about 1 month ago
mistralai/Mistral-Small-24B-Instruct-2501:This Mistral Small has FAR less knowledge than the last.
liked
a model
about 2 months ago
deepseek-ai/DeepSeek-R1
new activity
about 2 months ago
internlm/internlm3-8b-instruct:English tests and tasks are absurdly overfit.
Organizations
None yet
phil111's activity
This Mistral Small has FAR less knowledge than the last.
20
#5 opened about 1 month ago
by
phil111
English tests and tasks are absurdly overfit.
21
#8 opened about 2 months ago
by
phil111
A heavily filtered corpus simply doesn't work.
4
#19 opened about 2 months ago
by
phil111
I Don't Understand This Model
16
#9 opened 2 months ago
by
phil111
Notably better than Phi3.5 in many ways, but something is wrong.
8
#5 opened 3 months ago
by
phil111
Very impressive. Good world knowledge (SimpleQA of 25) despite high math/coding performance.
2
#27 opened 2 months ago
by
phil111
SimpleQA score
2
#1 opened 3 months ago
by
frappuccino

Exceptional creative writer
5
#1 opened 3 months ago
by
SubtleOne
Very High English MMLU scores, Yet Extremely Low Broad English Knowledge
2
#8 opened 3 months ago
by
phil111
How was r7b?
6
#3 opened 3 months ago
by
MRU4913
Add Qwen 2.5 7B & Tulu 3 8B results to OLLM benchmarks
12
#1 opened 3 months ago
by
Fizzarolli

local Llama + GPU(cuda)
7
#34 opened 3 months ago
by
Luciolla

Base Model?
3
#32 opened 3 months ago
by
User8213
Add Hymba-1.5B to the leaderboard
3
#1030 opened 3 months ago
by
pmolchanov

Hallucinates more than Mistral 7b
#13 opened 4 months ago
by
phil111
Looks like not as good as Qwen2.5 7B
9
#5 opened 5 months ago
by
MonolithFoundation
This LLM is hallucinating like crazy. Can someone verify these prompts?
28
#3 opened 5 months ago
by
phil111
Looks like not as good as Qwen2.5 7B
9
#5 opened 5 months ago
by
MonolithFoundation
This LLM is hallucinating like crazy. Can someone verify these prompts?
28
#3 opened 5 months ago
by
phil111
This LLM is hallucinating like crazy. Can someone verify these prompts?
28
#3 opened 5 months ago
by
phil111