Benhao Tang PRO

benhaotang

AI & ML interests

Physics Master student in theoretical particle physics at Universität Heidelberg, actively looking into the possibilities of integrating AI into future physics research.

Recent Activity

liked a model 1 day ago

deepseek-ai/DeepSeek-V3-0324

liked a model 7 days ago

ds4sd/SmolDocling-256M-preview

liked a model 9 days ago

KandirResearch/CiSiMi-v0.1

View all activity

Organizations

None yet

benhaotang's activity

liked a model 1 day ago

deepseek-ai/DeepSeek-V3-0324

Text Generation • Updated about 16 hours ago • 6.67k • • 1.43k

liked a model 7 days ago

ds4sd/SmolDocling-256M-preview

Image-Text-to-Text • Updated 3 days ago • 32.9k • 920

liked 2 models 9 days ago

KandirResearch/CiSiMi-v0.1

Text-to-Audio • Updated 8 days ago • 561 • 7

hanzla/Falcon3-Mamba-R1-v0

Text Generation • Updated 3 days ago • 1.47k • 9

liked a Space 11 days ago

Sesame CSM

🌱

Conversational speech generation

liked a model 11 days ago

mradermacher/OLMoE-1B-7B-0125-DPO-i1-GGUF

Updated Feb 20 • 828 • 1

liked a Space 12 days ago

118

Graph Mind

👀

Extract and visualize knowledge graphs from any text

liked a model 12 days ago

sesame/csm-1b

Text-to-Speech • Updated 9 days ago • 37.7k • 1.63k

liked a Space 16 days ago

Talk to OpenAI

🗣

Talk to OpenAI using their multimodal API

reacted to albertvillanova's post with 👍 17 days ago

Post

3681

🚀 New smolagents update: Safer Local Python Execution! 🦾🐍

With the latest release, we've added security checks to the local Python interpreter: every evaluation is now analyzed for dangerous builtins, modules, and functions. 🔒

Here's why this matters & what you need to know! 🧵👇

1️⃣ Why is local execution risky? ⚠️
AI agents that run arbitrary Python code can unintentionally (or maliciously) access system files, run unsafe commands, or exfiltrate data.

2️⃣ New Safety Layer in smolagents 🛡️
We now inspect every return value during execution:
✅ Allowed: Safe built-in types (e.g., numbers, strings, lists)
⛔ Blocked: Dangerous functions/modules (e.g., os.system, subprocess, exec, shutil)

3️⃣ Immediate Benefits 💡
- Prevent agents from accessing unsafe builtins
- Block unauthorized file or network access
- Reduce accidental security vulnerabilities

4️⃣ Security Disclaimer ⚠️
🚨 Despite these improvements, local Python execution is NEVER 100% safe. 🚨
If you need true isolation, use a remote sandboxed executor like Docker or E2B.

5️⃣ The Best Practice: Use Sandboxed Execution 🔐
For production-grade AI agents, we strongly recommend running code in a Docker or E2B sandbox to ensure complete isolation.

6️⃣ Upgrade Now & Stay Safe! 🚀
Check out the latest smolagents release and start building safer AI agents today.

🔗 https://github.com/huggingface/smolagents

What security measures do you take when running AI-generated code? Let’s discuss! 👇

#AI #smolagents #Python #Security

2 replies

liked a model 19 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 30 days ago • 1.5M • • 11.6k

liked a model 20 days ago

Qwen/QwQ-32B

Text Generation • Updated 14 days ago • 616k • • 2.52k

liked a model 21 days ago

CohereForAI/aya-vision-8b

Image-Text-to-Text • Updated 21 days ago • 150k • 267

updated a model 27 days ago

benhaotang/llama3.2-1B-physics-finetuned

Text Generation • Updated 27 days ago • 34

liked a dataset about 1 month ago

driaforall/verifiable-pythonic-function-calling-lite

Viewer • Updated Feb 7 • 16.4k • 276 • 6

liked 2 models about 1 month ago

driaforall/Tiny-Agent-a-1.5B

Text Generation • Updated Feb 17 • 245 • 4

homebrewltd/AlphaMaze-v0.2-1.5B

Text Generation • Updated 30 days ago • 2.65k • • 91

liked a Space about 1 month ago

2.34k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

replied to their post about 1 month ago

OK, grok 3 deep research also failed on my benchmark...

And this is the final solution it gives me:

Use wsl --shutdown before hibernating; if it fails, try net stop LxssManager.

What? How about just tell me if WSL have problem, just do not using WSL... How can this be a solution when there is even an official troubleshooting guide that provide more solutions. This is the even worst than gemini and perplexity, at least they read the official guide, just got lost in github issue threads... Now I really want to know how OpenAI's compares to mine, if I have 200 dollars.