File size: 2,843 Bytes
82f4818
 
1f99a45
82f4818
 
 
 
 
 
d6eeae4
 
 
a56c929
d6eeae4
a56c929
d6eeae4
 
 
a6571ed
d6eeae4
 
 
a56c929
d6eeae4
e5462e9
d6eeae4
 
 
 
 
 
 
 
 
 
 
e5462e9
 
 
 
 
88d2a89
d6eeae4
 
 
a56c929
c6d1961
d6eeae4
1f99a45
d6eeae4
 
 
a56c929
d6eeae4
1f99a45
c6d1961
d6eeae4
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1f99a45
d6eeae4
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
---
title: README
emoji: 🌍
colorFrom: yellow
colorTo: yellow
sdk: static
pinned: false
---

<p align="center">
  <img src="bunka_logo.png" alt="Bunka Logo" width="600"/>
</p>

<h1 align="center">Welcome to Bunka</h1>

<p align="center">
  Bunka provides visual investigation of textual datasets, using Topic Modeling and Frame Analysis.
</p>

<p align="center">
  Whether you want to understand your training datasets, its content or what it's missing before fine-tuning your model, you're at the right place!
</p>

<p align="center">
  <a href="https://www.bunka.ai/"><img src="https://img.shields.io/badge/Website-FF7139?style=for-the-badge&logo=firefox-browser&logoColor=white" alt="Website"></a>
  <a href="https://github.com/charlesdedampierre/BunkaTopics"><img src="https://img.shields.io/badge/GitHub-100000?style=for-the-badge&logo=github&logoColor=white" alt="GitHub"></a>
  <a href="https://www.linkedin.com/company/85881815"><img src="https://img.shields.io/badge/LinkedIn-0077B5?style=for-the-badge&logo=linkedin&logoColor=white" alt="LinkedIn"></a>
  <a href="https://discord.gg/3YRPVqXabQ"><img src="https://img.shields.io/badge/Discord-7289DA?style=for-the-badge&logo=discord&logoColor=white" alt="Discord"></a>
</p>

## πŸš€ Explore Our Platform

<p align="center">
  <a href="https://beta.bunkasearch.com/">
    <img src="platform_hub.png" alt="Bunka Platform Overview" width="800"/>
  </a>
</p>

Our platform, based on Bunka's open-source backend technology, can be used to visualize, explore, and refine unstructured information. Public datasets are already available for exploration.

<p align="center">
  <a href="https://beta.bunkasearch.com/"><img src="https://img.shields.io/badge/Try%20Bunka%20Platform-4285F4?style=for-the-badge&logo=google-maps&logoColor=white" alt="Try Bunka Platform"></a>
</p>

## 🧠 Topic Modeling

We summarize information with Topic Modeling & Generative AI for RAG.
This provides an overview of your dataset contents in the blink of an eye!

<p align="center">
  <img src="newsmap.png" alt="Topic Modeling Example" width="600"/>
</p>

## πŸ–ΌοΈ Frame Analysis

We project information on a supervised Axis to Explore textual data in a completely new way.
This allows you to investigate potential biases, or filter the content of your dataset in order to clean it faster!

<p align="center">
  <img src="bourdieu.png" alt="Frame Analysis Example" width="600"/>
</p>

## 🎬 Example: IMDB Dataset Visualization

Explore our visualization of the IMDB dataset:

<p align="center">
  <a href="https://beta.bunkasearch.com/map/206">
    <img src="imdb_dataset.png" alt="IMDB Dataset Visualization" width="800"/>
  </a>
</p>

## 🀝 Join Our Community

We're excited to have you join our community! Feel free to reach out on any of our platforms for questions, suggestions, or collaborations.