Does this model support text insertion (fill in middle)?
2
#70 opened 11 days ago
by
AayushShah
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63ff5fc4fe6383d50b29052e/Vk9R5rKqG-Z_ou-55J9x-.jpeg)
Thoughts on deepseek-r1. Correct me if I'm wrong
1
#69 opened 11 days ago
by
pkms
![](https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/7P9vZMj14eRtONzi2duyz.png)
ImportError: cannot import name 'is_torch_greater_or_equal_than_1_13' from 'transformers.pytorch_utils'
10
#67 opened 11 days ago
by
bashir-abubakar
e-currency
3
#63 opened 12 days ago
by
Zhendaxie
Meet PEEPSEEK, the first meme made by DeepSeek r1
1
#61 opened 12 days ago
by
deepseeker3b56
鲸 Logo transparent
#60 opened 12 days ago
by
DorianDarko2525
Meet Finley, the Whale of DeepSeek!
#59 opened 12 days ago
by
deepseekjanus
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/wEzEnqTaJHLL8eBLnp3nS.png)
最近的炒作和硬币
#58 opened 12 days ago
by
Chester1111
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/G35RiPxJ2_VWEvUFLES9T.png)
Official DeepThink Crypto Currency
1
#56 opened 12 days ago
by
qwen-llm
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6797c71c3ec21fcc8c861847/tRtlIVOfHCznbuMVLMs0-.png)
Congrats, this is the by far the best open source model! Just a few steps until complete domination (feedback)
1
#54 opened 12 days ago
by
Dampfinchen
deepseek
#53 opened 12 days ago
by
denizkaya2022
Modify abbreviations in benchmark images into full name to avoid confusion
#52 opened 12 days ago
by
karminski
![](https://cdn-avatars.huggingface.co/v1/production/uploads/66832a5474fb1736a492a792/0HiZFay6_S4HhPA9Gb2ci.jpeg)
How to deploy DeepSeek-R1 witn LMDeploy ?
#48 opened 12 days ago
by
vansin
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678589663024-640d3eaa3623f6a56dde856d.jpeg)
使用不带 thinking 的数据集微调时无法正常生成
1
#46 opened 13 days ago
by
HuanLin
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1671291118635-637c0e8aa8716d64204e2f01.jpeg)
Use memory to store inactive experts
#45 opened 13 days ago
by
xm10086
qwen32B蒸馏模型,长度>8k时,预测一定比例乱码,出现<think><think><think><think><think><think>
5
#44 opened 14 days ago
by
daniellibin
Update LICENSE
#43 opened 14 days ago
by
town24
edit paper link to hf for easier conversations
#41 opened 15 days ago
by
clem
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1583857146757-5e67bdd61009063689407479.jpeg)
Upload 80b78bb2-3b7e-4a0c-a76c-93e1503c7b30.jpeg
#40 opened 15 days ago
by
Uman1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/LU2I3lXvsv-tXw0g-k1E9.jpeg)
The LICENSE-MODEL file is missing??
#39 opened 15 days ago
by
spanspek
New permissions gate doesn't look valid
3
#38 opened 15 days ago
by
AdjectiveAllison
Amazing Release! Can we also have DeepSeek-R1-Zero-Qwen-32B
#37 opened 15 days ago
by
cfpark00
Question about possible R1 - lite versions 70b / 32b
#36 opened 15 days ago
by
smokestudio
Update README.md
1
#35 opened 15 days ago
by
sloshywings
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/V7vMML8To-3xpY31cw2dd.jpeg)
Add pipeline tag
#34 opened 15 days ago
by
nielsr
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1608042047613-5f1158120c833276f61f1a84.jpeg)
Deploying production ready Deepseek R1 on your AWS with vLLM
6
#32 opened 16 days ago
by
samagra14
Create Stephy
#31 opened 16 days ago
by
Kouadio12
comfyui-deepseek-r1
#30 opened 16 days ago
by
zwpython
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61c55ef7f1141119998b05c4/IukOtCmEzYA5Xf4LvLjZH.png)
I can't use your model in hugginsface spaces
2
#29 opened 16 days ago
by
MrEscorpion
![](https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/SiRycVapzppdFyirr3x6m.png)
Upload IMG_2394.jpeg
#28 opened 16 days ago
by
Itsvijay12
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6791e01c400d620e9c5ad9de/1OglUHgClkrueDI2GSV5j.jpeg)
Upload IMG_2394.jpeg
#27 opened 16 days ago
by
Itsvijay12
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6791e01c400d620e9c5ad9de/1OglUHgClkrueDI2GSV5j.jpeg)
its amazing model , i found one free to experience r1
#26 opened 16 days ago
by
LLMhacker
Suggestion for censorship disclosure - odd responses from R1
7
#25 opened 16 days ago
by
vmajor
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63992e59afe0d224cf2b6bf1/q2JeqTcIb5j6fUg1SWGzL.jpeg)
Transformer version required?
#24 opened 16 days ago
by
Pradeep1995
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1599822346546-noauth.jpeg)
how to install and use on local machine windows
9
#23 opened 17 days ago
by
Merk0701234
深度思考和联网搜索的使用问题
#22 opened 17 days ago
by
hentaisenpai
Hardware requirements?
27
#19 opened 17 days ago
by
JohnnieB
Congratulating DeepSeek-R1 and Inviting Review of Our Team’s Early Research last year on Similar Ideas
#17 opened 17 days ago
by
zhengchenphd
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63be66194a2beec6555d49d6/Ck-VAuADj9BCDxU0NsdTF.jpeg)
BF16 model from open source community
#15 opened 18 days ago
by
OpenSourceRonin
还是要16卡才能推理吧?
2
#14 opened 18 days ago
by
qqianxiao
add library name & auto-tag
#13 opened 19 days ago
by
reach-vb
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1655385361868-61b85ce86eb1f2c5e6233736.jpeg)
Is this the same as DeepSeek-R1 (Preview) mentioned on LiveCodeBench?
2
#10 opened 19 days ago
by
KrishnaKaasyap
chore: update configuration_deepseek.py
#9 opened 19 days ago
by
eltociear
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1669482101081-noauth.jpeg)
Wen R2D2?
1
#8 opened 19 days ago
by
TiFoil
Where is R1-Lite?
2
#5 opened 19 days ago
by
aryadytm
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1646731853189-62272060667573caf7bade20.png)
The ASI-Singularity(Godsend) is the only Global Solution, people.
#4 opened 19 days ago
by
AntDX316
![](https://cdn-avatars.huggingface.co/v1/production/uploads/644cef01a116bdc2384745b8/IlsUxNG0p2xNCcSvpr0mR.png)