|
--- |
|
title: README |
|
emoji: π |
|
colorFrom: yellow |
|
colorTo: yellow |
|
sdk: static |
|
pinned: false |
|
--- |
|
|
|
<p align="center"> |
|
<img src="bunka_logo.png" alt="Bunka Logo" width="600"/> |
|
</p> |
|
|
|
<h1 align="center">Welcome to Bunka</h1> |
|
|
|
<p align="center"> |
|
Bunka provides visual investigation of textual datasets, using Topic Modeling and Frame Analysis. |
|
</p> |
|
|
|
<p align="center"> |
|
Whether you want to understand your training datasets, its content or what it's missing before fine-tuning your model, you're at the right place! |
|
</p> |
|
|
|
<p align="center"> |
|
<a href="https://www.bunka.ai/"><img src="https://img.shields.io/badge/Website-FF7139?style=for-the-badge&logo=firefox-browser&logoColor=white" alt="Website"></a> |
|
<a href="https://github.com/charlesdedampierre/BunkaTopics"><img src="https://img.shields.io/badge/GitHub-100000?style=for-the-badge&logo=github&logoColor=white" alt="GitHub"></a> |
|
<a href="https://www.linkedin.com/company/85881815"><img src="https://img.shields.io/badge/LinkedIn-0077B5?style=for-the-badge&logo=linkedin&logoColor=white" alt="LinkedIn"></a> |
|
<a href="https://discord.gg/3YRPVqXabQ"><img src="https://img.shields.io/badge/Discord-7289DA?style=for-the-badge&logo=discord&logoColor=white" alt="Discord"></a> |
|
</p> |
|
|
|
## π Explore Our Platform |
|
|
|
<p align="center"> |
|
<a href="https://beta.bunkasearch.com/"> |
|
<img src="platform_hub.png" alt="Bunka Platform Overview" width="800"/> |
|
</a> |
|
</p> |
|
|
|
Our platform, based on Bunka's open-source backend technology, can be used to visualize, explore, and refine unstructured information. Public datasets are already available for exploration. |
|
|
|
<p align="center"> |
|
<a href="https://beta.bunkasearch.com/"><img src="https://img.shields.io/badge/Try%20Bunka%20Platform-4285F4?style=for-the-badge&logo=google-maps&logoColor=white" alt="Try Bunka Platform"></a> |
|
</p> |
|
|
|
## π§ Topic Modeling |
|
|
|
We summarize information with Topic Modeling & Generative AI for RAG. |
|
This provides an overview of your dataset contents in the blink of an eye! |
|
|
|
<p align="center"> |
|
<img src="newsmap.png" alt="Topic Modeling Example" width="600"/> |
|
</p> |
|
|
|
## πΌοΈ Frame Analysis |
|
|
|
We project information on a supervised Axis to Explore textual data in a completely new way. |
|
This allows you to investigate potential biases, or filter the content of your dataset in order to clean it faster! |
|
|
|
<p align="center"> |
|
<img src="bourdieu.png" alt="Frame Analysis Example" width="600"/> |
|
</p> |
|
|
|
## π¬ Example: IMDB Dataset Visualization |
|
|
|
Explore our visualization of the IMDB dataset: |
|
|
|
<p align="center"> |
|
<a href="https://beta.bunkasearch.com/map/206"> |
|
<img src="imdb_dataset.png" alt="IMDB Dataset Visualization" width="800"/> |
|
</a> |
|
</p> |
|
|
|
## π€ Join Our Community |
|
|
|
We're excited to have you join our community! Feel free to reach out on any of our platforms for questions, suggestions, or collaborations. |