File size: 2,648 Bytes
82f4818
 
1f99a45
82f4818
 
 
 
 
 
d6eeae4
 
 
a56c929
d6eeae4
a56c929
d6eeae4
 
 
a6571ed
d6eeae4
 
 
a56c929
d6eeae4
 
 
 
 
a56c929
d6eeae4
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
a56c929
c6d1961
d6eeae4
1f99a45
d6eeae4
 
 
a56c929
d6eeae4
1f99a45
c6d1961
d6eeae4
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1f99a45
d6eeae4
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
---
title: README
emoji: 🌍
colorFrom: yellow
colorTo: yellow
sdk: static
pinned: false
---

<p align="center">
  <img src="bunka_logo.png" alt="Bunka Logo" width="600"/>
</p>

<h1 align="center">Welcome to Bunka</h1>

<p align="center">
  Bunka provides visual investigation of textual datasets, using Topic Modeling and Frame Analysis.
</p>

<p align="center">
  Whether you want to understand your training datasets, its content or what it's missing before fine-tuning your model, you're at the right place!
</p>

<p align="center">
  <a href="https://www.bunka.ai/">🌐 Website</a> |
  <a href="https://github.com/charlesdedampierre/BunkaTopics">πŸ“š GitHub</a> |
  <a href="https://beta.bunkasearch.com/">πŸš€ Bunka Platform</a>
</p>

<p align="center">
  <a href="https://github.com/charlesdedampierre/BunkaTopics"><img src="https://img.shields.io/badge/GitHub-100000?style=for-the-badge&logo=github&logoColor=white" alt="GitHub"></a>
  <a href="https://www.linkedin.com/company/85881815"><img src="https://img.shields.io/badge/LinkedIn-0077B5?style=for-the-badge&logo=linkedin&logoColor=white" alt="LinkedIn"></a>
  <a href="https://discord.gg/3YRPVqXabQ"><img src="https://img.shields.io/badge/Discord-7289DA?style=for-the-badge&logo=discord&logoColor=white" alt="Discord"></a>
  <a href="https://huggingface.co/bunkalab"><img src="https://img.shields.io/badge/πŸ€—%20Hugging%20Face-ffce6b?style=for-the-badge" alt="Hugging Face"></a>
</p>

## πŸš€ Explore Our Platform

<p align="center">
  <a href="https://beta.bunkasearch.com/">
    <img src="platform_hub.png" alt="Bunka Platform Overview" width="800"/>
  </a>
</p>

## 🧠 Topic Modeling

We summarize information with Topic Modeling & Generative AI for RAG.
This provides an overview of your dataset contents in the blink of an eye!

<p align="center">
  <img src="newsmap.png" alt="Topic Modeling Example" width="600"/>
</p>

## πŸ–ΌοΈ Frame Analysis

We project information on a supervised Axis to Explore textual data in a completely new way.
This allows you to investigate potential biases, or filter the content of your dataset in order to clean it faster!

<p align="center">
  <img src="bourdieu.png" alt="Frame Analysis Example" width="600"/>
</p>

## 🎬 Example: IMDB Dataset Visualization

Explore our visualization of the IMDB dataset:

<p align="center">
  <a href="https://beta.bunkasearch.com/map/206">
    <img src="imdb_dataset.png" alt="IMDB Dataset Visualization" width="800"/>
  </a>
</p>

## 🀝 Join Our Community

We're excited to have you join our community! Feel free to reach out on any of our platforms for questions, suggestions, or collaborations.