Jason Stillerman's picture
7 66

Jason Stillerman

stillerman

AI & ML interests

None yet

Recent Activity

liked a model about 8 hours ago
teapotai/teapotllm
reacted to merve's post with πŸ‘ 1 day ago
So many open releases at Hugging Face past week 🀯 recapping all here ‡️ https://huggingface.co/collections/merve/march-21-releases-67dbe10e185f199e656140ae πŸ‘€ Multimodal > Mistral AI released a 24B vision LM, both base and instruction FT versions, sota πŸ”₯ (OS) > with IBM we released SmolDocling, a sota 256M document parser with Apache 2.0 license (OS) > SpatialLM is a new vision LM that outputs 3D bounding boxes, comes with 0.5B (QwenVL based) and 1B (Llama based) variants > SkyWork released SkyWork-R1V-38B, new vision reasoning model (OS) πŸ’¬ LLMs > NVIDIA released new Nemotron models in 49B and 8B with their post-training dataset > LG released EXAONE, new reasoning models in 2.4B, 7.8B and 32B > Dataset: Glaive AI released a new reasoning dataset of 22M+ examples > Dataset: NVIDIA released new helpfulness dataset HelpSteer3 > Dataset: OpenManusRL is a new agent dataset based on ReAct framework (OS) > Open-R1 team released OlympicCoder, new competitive coder model in 7B and 32B > Dataset: GeneralThought-430K is a new reasoning dataset (OS) πŸ–ΌοΈ Image Generation/Computer Vision > Roboflow released RF-DETR, new real-time sota object detector (OS) πŸ”₯ > YOLOE is a new real-time zero-shot object detector with text and visual prompts πŸ₯Ή > Stability AI released Stable Virtual Camera, a new novel view synthesis model > Tencent released Hunyuan3D-2mini, new small and fast 3D asset generation model > ByteDance released InfiniteYou, new realistic photo generation model > StarVector is a new 8B model that generates svg from images > FlexWorld is a new model that expands 3D views (OS) 🎀 Audio > Sesame released CSM-1B new speech generation model (OS) πŸ€– Robotics > NVIDIA released GR00T, new robotics model for generalized reasoning and skills, along with the dataset *OS ones have Apache 2.0 or MIT license
reacted to merve's post with πŸ€— 1 day ago
So many open releases at Hugging Face past week 🀯 recapping all here ‡️ https://huggingface.co/collections/merve/march-21-releases-67dbe10e185f199e656140ae πŸ‘€ Multimodal > Mistral AI released a 24B vision LM, both base and instruction FT versions, sota πŸ”₯ (OS) > with IBM we released SmolDocling, a sota 256M document parser with Apache 2.0 license (OS) > SpatialLM is a new vision LM that outputs 3D bounding boxes, comes with 0.5B (QwenVL based) and 1B (Llama based) variants > SkyWork released SkyWork-R1V-38B, new vision reasoning model (OS) πŸ’¬ LLMs > NVIDIA released new Nemotron models in 49B and 8B with their post-training dataset > LG released EXAONE, new reasoning models in 2.4B, 7.8B and 32B > Dataset: Glaive AI released a new reasoning dataset of 22M+ examples > Dataset: NVIDIA released new helpfulness dataset HelpSteer3 > Dataset: OpenManusRL is a new agent dataset based on ReAct framework (OS) > Open-R1 team released OlympicCoder, new competitive coder model in 7B and 32B > Dataset: GeneralThought-430K is a new reasoning dataset (OS) πŸ–ΌοΈ Image Generation/Computer Vision > Roboflow released RF-DETR, new real-time sota object detector (OS) πŸ”₯ > YOLOE is a new real-time zero-shot object detector with text and visual prompts πŸ₯Ή > Stability AI released Stable Virtual Camera, a new novel view synthesis model > Tencent released Hunyuan3D-2mini, new small and fast 3D asset generation model > ByteDance released InfiniteYou, new realistic photo generation model > StarVector is a new 8B model that generates svg from images > FlexWorld is a new model that expands 3D views (OS) 🎀 Audio > Sesame released CSM-1B new speech generation model (OS) πŸ€– Robotics > NVIDIA released GR00T, new robotics model for generalized reasoning and skills, along with the dataset *OS ones have Apache 2.0 or MIT license
View all activity

Organizations

Hugging Face's profile picture BigCode's profile picture Ontocord's M*DEL's profile picture Hugging Face Smol Models Research's profile picture MechaCroc Data Science & ML's profile picture Aurora-M's profile picture Hugging Face SMOL's profile picture

stillerman's activity

reacted to merve's post with πŸ‘πŸ€— 1 day ago
view post
Post
2925
So many open releases at Hugging Face past week 🀯 recapping all here ‡️ merve/march-21-releases-67dbe10e185f199e656140ae

πŸ‘€ Multimodal
> Mistral AI released a 24B vision LM, both base and instruction FT versions, sota πŸ”₯ (OS)
> with IBM we released SmolDocling, a sota 256M document parser with Apache 2.0 license (OS)
> SpatialLM is a new vision LM that outputs 3D bounding boxes, comes with 0.5B (QwenVL based) and 1B (Llama based) variants
> SkyWork released SkyWork-R1V-38B, new vision reasoning model (OS)

πŸ’¬ LLMs
> NVIDIA released new Nemotron models in 49B and 8B with their post-training dataset
> LG released EXAONE, new reasoning models in 2.4B, 7.8B and 32B
> Dataset: Glaive AI released a new reasoning dataset of 22M+ examples
> Dataset: NVIDIA released new helpfulness dataset HelpSteer3
> Dataset: OpenManusRL is a new agent dataset based on ReAct framework (OS)
> Open-R1 team released OlympicCoder, new competitive coder model in 7B and 32B
> Dataset: GeneralThought-430K is a new reasoning dataset (OS)

πŸ–ΌοΈ Image Generation/Computer Vision
> Roboflow released RF-DETR, new real-time sota object detector (OS) πŸ”₯
> YOLOE is a new real-time zero-shot object detector with text and visual prompts πŸ₯Ή
> Stability AI released Stable Virtual Camera, a new novel view synthesis model
> Tencent released Hunyuan3D-2mini, new small and fast 3D asset generation model
> ByteDance released InfiniteYou, new realistic photo generation model
> StarVector is a new 8B model that generates svg from images
> FlexWorld is a new model that expands 3D views (OS)

🎀 Audio
> Sesame released CSM-1B new speech generation model (OS)

πŸ€– Robotics
> NVIDIA released GR00T, new robotics model for generalized reasoning and skills, along with the dataset

*OS ones have Apache 2.0 or MIT license
reacted to davanstrien's post with ❀️ about 1 month ago
view post
Post
1938
How do you make 1M+ Hugging Face models & datasets more discoverable?

davanstrien/Smol-Hub-tldr!

I fine-tuned HuggingFaceTB/SmolLM2-360M to generate one-line summaries from a model or dataset README.

Its own self-description?
"A model for generating concise summaries of model & dataset cards from the Hugging Face Hub"

The goal? Make it easier to find the right models and datasets for your specific needs. It's already powering a semantic search for datasets Space.

It's still a WIP but thanks to @loubnabnl , @anton-l , @eliebak et al, for cooking such a nice base model for fine-tuning small, efficient models for specific domains and tasks. πŸ™