Update README.md
Browse files
README.md
CHANGED
@@ -7,4 +7,36 @@ sdk: static
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
+
# EasyDeL 🔮
|
11 |
+
|
12 |
+
[**Key Features**](https://github.com/erfanzar/EasyDeL?tab=readme-ov-file#key-features)
|
13 |
+
| [**Latest Updates**](https://github.com/erfanzar/EasyDeL?tab=readme-ov-file#latest-updates-)
|
14 |
+
| [**Vision**](https://github.com/erfanzar/EasyDeL?tab=readme-ov-file#future-updates-and-vision-)
|
15 |
+
| [**Quick Start**](https://github.com/erfanzar/EasyDeL?tab=readme-ov-file#quick-start)
|
16 |
+
| [**Reference docs**](https://easydel.readthedocs.io/en/latest/)
|
17 |
+
| [**License**](https://github.com/erfanzar/EasyDeL?tab=readme-ov-file#license-)
|
18 |
+
|
19 |
+
EasyDeL is an open-source framework designed to enhance and streamline the training process of machine learning models, with a primary focus on Jax/Flax. It provides convenient and effective solutions for training and serving Flax/Jax models on TPU/GPU at scale.
|
20 |
+
|
21 |
+
## Key Features
|
22 |
+
|
23 |
+
- **Diverse Architecture Support**: Seamlessly work with various model architectures including Transformers, Mamba, RWKV, and more.
|
24 |
+
- **Custom Kernels**: EasyDeL supports custom kernels and operation for both GPU (via mosaic and triton) and TPU (via pallas).
|
25 |
+
- **Diverse Model Support**: Implements a wide range of models in JAX, including Falcon, Qwen2, Phi2, Mixtral, Qwen2Moe, Cohere, Dbrx, Phi3, and MPT.
|
26 |
+
- **Advanced Trainers**: Offers specialized trainers like DPOTrainer, ORPOTrainer, SFTTrainer, and VideoCLM Trainer.
|
27 |
+
- **Serving and API Engines**: Provides engines for efficiently serving large language models (LLMs) in JAX.
|
28 |
+
- **Quantization and Bit Operations**: Supports various quantization methods and 8, 6, and 4-bit operations for optimized inference and training.
|
29 |
+
- **Performance Optimization**: Integrates FlashAttention, RingAttention, and other performance-enhancing features.
|
30 |
+
- **Model Conversion**: Supports automatic conversion between JAX-EasyDeL and PyTorch-HF models.
|
31 |
+
|
32 |
+
### Fully Customizable and Hackable 🛠️
|
33 |
+
|
34 |
+
EasyDeL stands out by providing unparalleled flexibility and transparency:
|
35 |
+
|
36 |
+
- **Open Architecture**: Every single component of EasyDeL is open for inspection, modification, and customization. There are no black boxes here.
|
37 |
+
- **Hackability at Its Core**: We believe in giving you full control. Whether you want to tweak a small function or completely overhaul a training loop, EasyDeL lets you do it.
|
38 |
+
- **Custom Code Access**: All custom implementations are readily available and well-documented, allowing you to understand, learn from, and modify the internals as needed.
|
39 |
+
- **Encourage Experimentation**: We actively encourage users to experiment, extend, and improve upon the existing codebase. Your innovations could become the next big feature!
|
40 |
+
- **Community-Driven Development**: Share your custom implementations and improvements with the community, fostering a collaborative environment for advancing ML research and development.
|
41 |
+
|
42 |
+
With EasyDeL, you're not constrained by rigid frameworks. Instead, you have a flexible, powerful toolkit that adapts to your needs, no matter how unique or specialized they may be. Whether you're conducting cutting-edge research or building production-ready ML systems, EasyDeL provides the freedom to innovate without limitations.
|