Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
3
Maarten Grootendorst
MaartenGr
Follow
pierrci's profile picture
Kathakali's profile picture
itisdom's profile picture
23 followers
·
0 following
https://newsletter.maartengrootendorst.com
MaartenGr
MaartenGr
mgrootendorst
AI & ML interests
None yet
Recent Activity
reacted
to
asoria
's
post
with ❤️
about 2 months ago
🚀 Exploring Topic Modeling with BERTopic 🤖 When you come across an interesting dataset, you often wonder: Which topics frequently appear in these documents? 🤔 What is this data really about? 📊 Topic modeling helps answer these questions by identifying recurring themes within a collection of documents. This process enables quick and efficient exploratory data analysis. I’ve been working on an app that leverages BERTopic, a flexible framework designed for topic modeling. Its modularity makes BERTopic powerful, allowing you to switch components with your preferred algorithms. It also supports handling large datasets efficiently by merging models using the BERTopic.merge_models approach. 🔗 🔍 How do we make this work? Here’s the stack we’re using: 📂 Data Source ➡️ Hugging Face datasets with DuckDB for retrieval 🧠 Text Embeddings ➡️ Sentence Transformers (all-MiniLM-L6-v2) ⚡ Dimensionality Reduction ➡️ RAPIDS cuML UMAP for GPU-accelerated performance 🔍 Clustering ➡️ RAPIDS cuML HDBSCAN for fast clustering ✂️ Tokenization ➡️ CountVectorizer 🔧 Representation Tuning ➡️ KeyBERTInspired + Hugging Face Inference Client with Meta-Llama-3-8B-Instruct 🌍 Visualization ➡️ Datamapplot library Check out the space and see how you can quickly generate topics from your dataset: https://huggingface.co/spaces/datasets-topics/topics-generator Powered by @MaartenGr - BERTopic
View all activity
Articles
Introducing BERTopic Integration with Hugging Face Hub
May 31, 2023
•
7
Organizations
MaartenGr
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
MaartenGr/BERTopic_Wikipedia
9 months ago
What is the training benchmark for model `BERTopic_Wikipedia`
1
#3 opened 9 months ago by
benjaminliupenrose
How to merge topics for model `BERTopic_Wikipedia`
1
#2 opened 9 months ago by
benjaminliupenrose
New activity in
MaartenGr/BERTopic_Wikipedia
over 1 year ago
Inference API err: HfApiJson Deserialize Error
2
#1 opened over 1 year ago by
ongkn