Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -1,10 +1,58 @@
|
|
1 |
---
|
2 |
title: README
|
3 |
-
emoji:
|
4 |
colorFrom: yellow
|
5 |
colorTo: green
|
6 |
sdk: static
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
title: README
|
3 |
+
emoji: π«
|
4 |
colorFrom: yellow
|
5 |
colorTo: green
|
6 |
sdk: static
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
+
<p align="center">
|
11 |
+
<img src="https://raw.githubusercontent.com/ml6team/fondant/main/docs/art/fondant_banner.svg" height="250px"/>
|
12 |
+
</p>
|
13 |
+
<p align="center">
|
14 |
+
<i>Sweet data-centric foundation model fine-tuning</i>
|
15 |
+
<br>
|
16 |
+
<a href="https://fondant.readthedocs.io/en/stable/"><strong>Explore the docs Β»</strong></a>
|
17 |
+
<br>
|
18 |
+
<br>
|
19 |
+
<a href="https://discord.gg/HnTdWhydGp"><img alt="Discord" src="https://dcbadge.vercel.app/api/server/HnTdWhydGp?style=flat-square"></a>
|
20 |
+
<a href="https://pypi.org/project/fondant/"><img alt="PyPI version" src="https://img.shields.io/pypi/v/fondant?color=brightgreen&style=flat-square"></a>
|
21 |
+
<a href="https://fondant.readthedocs.io/en/latest/license/"><img alt="License" src="https://img.shields.io/github/license/ml6team/fondant?style=flat-square&color=brightgreen"></a>
|
22 |
+
<a href="https://github.com/ml6team/fondant/actions/workflows/pipeline.yaml"><img alt="GitHub Workflow Status" src="https://img.shields.io/github/actions/workflow/status/ml6team/fondant/pipeline.yaml?style=flat-square"></a>
|
23 |
+
<a href="https://coveralls.io/github/ml6team/fondant?branch=main"><img alt="Coveralls" src="https://img.shields.io/coverallsCoverage/github/ml6team/fondant?style=flat-square"></a>
|
24 |
+
</p>
|
25 |
+
|
26 |
+
---
|
27 |
+
**Fondant helps you create high quality datasets to train or fine-tune foundation models such as:**
|
28 |
+
|
29 |
+
- π¨ Stable Diffusion
|
30 |
+
- π GPT-like Large Language Models (LLMs)
|
31 |
+
- π CLIP
|
32 |
+
- βοΈ Segment Anything (SAM)
|
33 |
+
- β And many more
|
34 |
+
|
35 |
+
## πͺ€ Why Fondant?
|
36 |
+
|
37 |
+
Foundation models simplify inference by solving multiple tasks across modalities with a simple
|
38 |
+
prompt-based interface. But what they've gained in the front, they've lost in the back.
|
39 |
+
**These models require enormous amounts of data, moving complexity towards data preparation**, and
|
40 |
+
leaving few parties able to train their own models.
|
41 |
+
|
42 |
+
We believe that **innovation is a group effort**, requiring collaboration. While the community has
|
43 |
+
been building and sharing models, everyone is still building their data preparation from scratch.
|
44 |
+
**Fondant is the platform where we meet to build and share data preparation workflows.**
|
45 |
+
|
46 |
+
Fondant offers a framework to build **composable data preparation pipelines, with reusable
|
47 |
+
components, optimized to handle massive datasets**. Stop building from scratch, and start
|
48 |
+
reusing components to:
|
49 |
+
|
50 |
+
- Extend your data with public datasets
|
51 |
+
- Generate new modalities using captioning, segmentation, translation, image generation, ...
|
52 |
+
- Distill knowledge from existing foundation models
|
53 |
+
- Filter out low quality data
|
54 |
+
- Deduplicate data
|
55 |
+
|
56 |
+
And create high quality datasets to fine-tune your own foundation models.
|
57 |
+
|
58 |
+
<p align="right">(<a href="#chocolate_bar-fondant">back to top</a>)</p>
|