Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
32
1
39
Phil
phil111
Follow
AhmedMoneim's profile picture
mondalsurojit's profile picture
nlpguy's profile picture
13 followers
·
15 following
AI & ML interests
None yet
Recent Activity
new
activity
about 1 month ago
mistralai/Mistral-Small-24B-Instruct-2501:
This Mistral Small has FAR less knowledge than the last.
liked
a model
about 2 months ago
deepseek-ai/DeepSeek-R1
new
activity
about 2 months ago
internlm/internlm3-8b-instruct:
English tests and tasks are absurdly overfit.
View all activity
Organizations
None yet
phil111
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
mistralai/Mistral-Small-24B-Instruct-2501
about 1 month ago
This Mistral Small has FAR less knowledge than the last.
20
#5 opened about 1 month ago by
phil111
liked
a model
about 2 months ago
deepseek-ai/DeepSeek-R1
Text Generation
•
Updated
14 days ago
•
3.64M
•
•
11.1k
New activity in
internlm/internlm3-8b-instruct
about 2 months ago
English tests and tasks are absurdly overfit.
21
#8 opened about 2 months ago by
phil111
New activity in
microsoft/phi-4
about 2 months ago
A heavily filtered corpus simply doesn't work.
4
#19 opened about 2 months ago by
phil111
I Don't Understand This Model
16
#9 opened 2 months ago by
phil111
New activity in
matteogeniaccio/phi-4
2 months ago
Notably better than Phi3.5 in many ways, but something is wrong.
8
#5 opened 3 months ago by
phil111
liked
a model
2 months ago
deepseek-ai/DeepSeek-V3
Text Generation
•
Updated
14 days ago
•
3.15M
•
•
3.62k
New activity in
deepseek-ai/DeepSeek-V3-Base
2 months ago
Very impressive. Good world knowledge (SimpleQA of 25) despite high math/coding performance.
2
#27 opened 2 months ago by
phil111
liked
a model
2 months ago
deepseek-ai/DeepSeek-V3-Base
Updated
14 days ago
•
765k
•
1.59k
New activity in
NyxKrage/Microsoft_Phi-4
3 months ago
SimpleQA score
2
#1 opened 3 months ago by
frappuccino
New activity in
ibm-granite/granite-3.1-8b-instruct
3 months ago
Exceptional creative writer
5
#1 opened 3 months ago by
SubtleOne
liked
2 models
3 months ago
ibm-granite/granite-3.1-8b-instruct
Text Generation
•
Updated
11 days ago
•
91.4k
•
153
QuantFactory/granite-3.1-8b-instruct-GGUF
Text Generation
•
Updated
Dec 19, 2024
•
669
•
7
New activity in
tiiuae/Falcon3-7B-Instruct
3 months ago
Very High English MMLU scores, Yet Extremely Low Broad English Knowledge
2
#8 opened 3 months ago by
phil111
New activity in
CohereForAI/c4ai-command-r7b-12-2024
3 months ago
How was r7b?
6
#3 opened 3 months ago by
MRU4913
Add Qwen 2.5 7B & Tulu 3 8B results to OLLM benchmarks
12
#1 opened 3 months ago by
Fizzarolli
New activity in
meta-llama/Llama-3.3-70B-Instruct
3 months ago
local Llama + GPU(cuda)
7
#34 opened 3 months ago by
Luciolla
Base Model?
3
#32 opened 3 months ago by
User8213
New activity in
open-llm-leaderboard/open_llm_leaderboard
3 months ago
Add Hymba-1.5B to the leaderboard
3
#1030 opened 3 months ago by
pmolchanov
liked
a model
3 months ago
lmstudio-community/Llama-3.3-70B-Instruct-GGUF
Text Generation
•
Updated
Dec 6, 2024
•
25.4k
•
47
Load more