Spaces:

fondant-ai
/

README

Running

App Files Files Community

mrchtr commited on Sep 18, 2023

Commit

b28ea04

1 Parent(s): e685483

Update README.md

Browse files

Files changed (1) hide show

README.md +50 -2

README.md CHANGED Viewed

@@ -1,10 +1,58 @@
 ---
 title: README
-emoji: 🐨
 colorFrom: yellow
 colorTo: green
 sdk: static
 pinned: false
 ---
-Edit this `README.md` markdown file to author your organization card.

 ---
 title: README
+emoji: 🍫
 colorFrom: yellow
 colorTo: green
 sdk: static
 pinned: false
 ---
+<p align="center">
+    <img src="https://raw.githubusercontent.com/ml6team/fondant/main/docs/art/fondant_banner.svg" height="250px"/>
+</p>
+<p align="center">
+    <i>Sweet data-centric foundation model fine-tuning</i>
+    <br>
+    <a href="https://fondant.readthedocs.io/en/stable/"><strong>Explore the docs »</strong></a>
+    <br>
+    <br>
+    <a href="https://discord.gg/HnTdWhydGp"><img alt="Discord" src="https://dcbadge.vercel.app/api/server/HnTdWhydGp?style=flat-square"></a>
+    <a href="https://pypi.org/project/fondant/"><img alt="PyPI version" src="https://img.shields.io/pypi/v/fondant?color=brightgreen&style=flat-square"></a>
+    <a href="https://fondant.readthedocs.io/en/latest/license/"><img alt="License" src="https://img.shields.io/github/license/ml6team/fondant?style=flat-square&color=brightgreen"></a>
+    <a href="https://github.com/ml6team/fondant/actions/workflows/pipeline.yaml"><img alt="GitHub Workflow Status" src="https://img.shields.io/github/actions/workflow/status/ml6team/fondant/pipeline.yaml?style=flat-square"></a>
+    <a href="https://coveralls.io/github/ml6team/fondant?branch=main"><img alt="Coveralls" src="https://img.shields.io/coverallsCoverage/github/ml6team/fondant?style=flat-square"></a>
+</p>
+---
+**Fondant helps you create high quality datasets to train or fine-tune foundation models such as:**
+- 🎨 Stable Diffusion
+- 📄 GPT-like Large Language Models (LLMs)
+- 🔎 CLIP
+- ✂️ Segment Anything (SAM)
+- ➕ And many more
+## 🪤 Why Fondant?
+Foundation models simplify inference by solving multiple tasks across modalities with a simple
+prompt-based interface. But what they've gained in the front, they've lost in the back.
+**These models require enormous amounts of data, moving complexity towards data preparation**, and
+leaving few parties able to train their own models.
+We believe that **innovation is a group effort**, requiring collaboration. While the community has
+been building and sharing models, everyone is still building their data preparation from scratch.
+**Fondant is the platform where we meet to build and share data preparation workflows.**
+Fondant offers a framework to build **composable data preparation pipelines, with reusable
+components, optimized to handle massive datasets**. Stop building from scratch, and start
+reusing components to:
+- Extend your data with public datasets
+- Generate new modalities using captioning, segmentation, translation, image generation, ...
+- Distill knowledge from existing foundation models
+- Filter out low quality data
+- Deduplicate data
+And create high quality datasets to fine-tune your own foundation models.
+<p align="right">(<a href="#chocolate_bar-fondant">back to top</a>)</p>