19 3 20

Mariusz Kurman PRO

mkurman

AI & ML interests

AI Tech Lead | MD

Recent Activity

updated a collection 2 days ago

MedIT One

updated a collection 2 days ago

MedIT One

reacted to albertvillanova's post with 👍 2 days ago

🚀 New smolagents update: Safer Local Python Execution! 🦾🐍 With the latest release, we've added security checks to the local Python interpreter: every evaluation is now analyzed for dangerous builtins, modules, and functions. 🔒 Here's why this matters & what you need to know! 🧵👇 1️⃣ Why is local execution risky? ⚠️ AI agents that run arbitrary Python code can unintentionally (or maliciously) access system files, run unsafe commands, or exfiltrate data. 2️⃣ New Safety Layer in smolagents 🛡️ We now inspect every return value during execution: ✅ Allowed: Safe built-in types (e.g., numbers, strings, lists) ⛔ Blocked: Dangerous functions/modules (e.g., os.system, subprocess, exec, shutil) 3️⃣ Immediate Benefits 💡 - Prevent agents from accessing unsafe builtins - Block unauthorized file or network access - Reduce accidental security vulnerabilities 4️⃣ Security Disclaimer ⚠️ 🚨 Despite these improvements, local Python execution is NEVER 100% safe. 🚨 If you need true isolation, use a remote sandboxed executor like Docker or E2B. 5️⃣ The Best Practice: Use Sandboxed Execution 🔐 For production-grade AI agents, we strongly recommend running code in a Docker or E2B sandbox to ensure complete isolation. 6️⃣ Upgrade Now & Stay Safe! 🚀 Check out the latest smolagents release and start building safer AI agents today. 🔗 https://github.com/huggingface/smolagents What security measures do you take when running AI-generated code? Let’s discuss! 👇 #AI #smolagents #Python #Security

View all activity

Organizations

mkurman's activity

updated a collection 2 days ago

MedIT One

Collection

A compilation of MedIT One checkpoints • 2 items • Updated 2 days ago

reacted to albertvillanova's post with 👍 2 days ago

Post

3319

🚀 New smolagents update: Safer Local Python Execution! 🦾🐍

With the latest release, we've added security checks to the local Python interpreter: every evaluation is now analyzed for dangerous builtins, modules, and functions. 🔒

Here's why this matters & what you need to know! 🧵👇

1️⃣ Why is local execution risky? ⚠️
AI agents that run arbitrary Python code can unintentionally (or maliciously) access system files, run unsafe commands, or exfiltrate data.

2️⃣ New Safety Layer in smolagents 🛡️
We now inspect every return value during execution:
✅ Allowed: Safe built-in types (e.g., numbers, strings, lists)
⛔ Blocked: Dangerous functions/modules (e.g., os.system, subprocess, exec, shutil)

3️⃣ Immediate Benefits 💡
- Prevent agents from accessing unsafe builtins
- Block unauthorized file or network access
- Reduce accidental security vulnerabilities

4️⃣ Security Disclaimer ⚠️
🚨 Despite these improvements, local Python execution is NEVER 100% safe. 🚨
If you need true isolation, use a remote sandboxed executor like Docker or E2B.

5️⃣ The Best Practice: Use Sandboxed Execution 🔐
For production-grade AI agents, we strongly recommend running code in a Docker or E2B sandbox to ensure complete isolation.

6️⃣ Upgrade Now & Stay Safe! 🚀
Check out the latest smolagents release and start building safer AI agents today.

🔗 https://github.com/huggingface/smolagents

What security measures do you take when running AI-generated code? Let’s discuss! 👇

#AI #smolagents #Python #Security

2 replies

New activity in meditsolutions/medit-one-140M-9B-tokens-checkpoint 3 days ago

Question on meaning of parameter of this model

#2 opened 3 days ago by

JLouisBiz

posted an update 3 days ago

Post

712

Just released NVAMP Loss!

✔️ modification of the cross-entropy loss function designed specifically for training LLMs.
✔️ twist on the standard cross-entropy loss by emphasizing the importance of outlier prediction errors and dynamically normalizing token-level variance.
✔️ more stable and efficient training, leading to models that generalize better.

Check it out, give it a spin, and let me know what you think!

Licensed under the Apache 2.0 license and ready to use. Happy training! 🔥🤖

https://github.com/mkurman/nvamp-loss

New activity in meditsolutions/medit-one-140M-9B-tokens-checkpoint 3 days ago

Can't install

#1 opened 3 days ago by

JLouisBiz

posted an update 4 days ago

Post

2339

MedIT One 140M Fifth checkpoint after 9B tokens
meditsolutions/medit-one-140M-9B-tokens-checkpoint

updated a model 4 days ago

meditsolutions/medit-one-140M-9B-tokens-checkpoint

Updated 4 days ago • 10

published a model 4 days ago

meditsolutions/medit-one-140M-9B-tokens-checkpoint

Updated 4 days ago • 10

liked a model 5 days ago

Qwen/QwQ-32B

Text Generation • Updated 3 days ago • 132k • • 1.79k

posted an update 6 days ago

Post

403

Test-time compute (TTC) scaling’s dope. Here’s my spin: Adaptive train-time compute scaling.

https://open.substack.com/pub/mkurman/p/adaptive-train-time-compute-scaling?r=7bzqr

What’s your take? Hit me!

posted an update 7 days ago

Post

545

I have uploaded the third pre-training checkpoint after 6 billion tokens to demonstrate that the MedIT One architecture is trainable.

Give it some noise plz! Love u all :D

meditsolutions/medit-one-140M-6B-tokens-checkpoint

updated a model 7 days ago

meditsolutions/medit-one-140M-6B-tokens-checkpoint

Updated 7 days ago • 39 • 1

published a model 7 days ago

meditsolutions/medit-one-140M-6B-tokens-checkpoint

Updated 7 days ago • 39 • 1

reacted to Jaward's post with ❤️ 8 days ago

Post

4922

made a few improvements on custom grpo trainer:
- added sequence similarity reward (seems to work)
- improved vllm support (5x inference speed)
- adjusted reward scores (this helped with format/accuracy)
- can now push to hf hub (already pushed mine lol: Jaward/smollm2_360m_grpo_gsm8k_reasoner)

Code: https://github.com/Jaykef/ai-algorithms/blob/main/smollm2_360M_135M_grpo_gsm8k.ipynb

posted an update 9 days ago

Post

3638

Introducing a new architecture, MedIT One – a single-token transformer with LSTM-like recurrence.

It is extremely fast in training and inference, but we lack funding for large-scale training. Enjoy 🍓

https://github.com/MedITSolutionsKurman/medit-one

reacted to JingzeShi's post with 🚀 16 days ago

Post

2928

🤗Welcome to the Doge Edge Device Small language Model.

SmallDoge/Doge-160M-Instruct

updated a model 16 days ago

mkurman/llama-3.2-MEDIT-3B-o1-GRPO-LLM-Eval

Text Generation • Updated 16 days ago • 44

published a model 16 days ago

mkurman/llama-3.2-MEDIT-3B-o1-GRPO-LLM-Eval

Text Generation • Updated 16 days ago • 44

updated a model 24 days ago

mkurman/Llama-3.2-MedIT-3B-R1

Updated 24 days ago • 132 • 1