erfanzar commited on
Commit
6be7070
·
verified ·
1 Parent(s): ae61e07

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +33 -1
README.md CHANGED
@@ -7,4 +7,36 @@ sdk: static
7
  pinned: false
8
  ---
9
 
10
- Edit this `README.md` markdown file to author your organization card.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  pinned: false
8
  ---
9
 
10
+ # EasyDeL 🔮
11
+
12
+ [**Key Features**](https://github.com/erfanzar/EasyDeL?tab=readme-ov-file#key-features)
13
+ | [**Latest Updates**](https://github.com/erfanzar/EasyDeL?tab=readme-ov-file#latest-updates-)
14
+ | [**Vision**](https://github.com/erfanzar/EasyDeL?tab=readme-ov-file#future-updates-and-vision-)
15
+ | [**Quick Start**](https://github.com/erfanzar/EasyDeL?tab=readme-ov-file#quick-start)
16
+ | [**Reference docs**](https://easydel.readthedocs.io/en/latest/)
17
+ | [**License**](https://github.com/erfanzar/EasyDeL?tab=readme-ov-file#license-)
18
+
19
+ EasyDeL is an open-source framework designed to enhance and streamline the training process of machine learning models, with a primary focus on Jax/Flax. It provides convenient and effective solutions for training and serving Flax/Jax models on TPU/GPU at scale.
20
+
21
+ ## Key Features
22
+
23
+ - **Diverse Architecture Support**: Seamlessly work with various model architectures including Transformers, Mamba, RWKV, and more.
24
+ - **Custom Kernels**: EasyDeL supports custom kernels and operation for both GPU (via mosaic and triton) and TPU (via pallas).
25
+ - **Diverse Model Support**: Implements a wide range of models in JAX, including Falcon, Qwen2, Phi2, Mixtral, Qwen2Moe, Cohere, Dbrx, Phi3, and MPT.
26
+ - **Advanced Trainers**: Offers specialized trainers like DPOTrainer, ORPOTrainer, SFTTrainer, and VideoCLM Trainer.
27
+ - **Serving and API Engines**: Provides engines for efficiently serving large language models (LLMs) in JAX.
28
+ - **Quantization and Bit Operations**: Supports various quantization methods and 8, 6, and 4-bit operations for optimized inference and training.
29
+ - **Performance Optimization**: Integrates FlashAttention, RingAttention, and other performance-enhancing features.
30
+ - **Model Conversion**: Supports automatic conversion between JAX-EasyDeL and PyTorch-HF models.
31
+
32
+ ### Fully Customizable and Hackable 🛠️
33
+
34
+ EasyDeL stands out by providing unparalleled flexibility and transparency:
35
+
36
+ - **Open Architecture**: Every single component of EasyDeL is open for inspection, modification, and customization. There are no black boxes here.
37
+ - **Hackability at Its Core**: We believe in giving you full control. Whether you want to tweak a small function or completely overhaul a training loop, EasyDeL lets you do it.
38
+ - **Custom Code Access**: All custom implementations are readily available and well-documented, allowing you to understand, learn from, and modify the internals as needed.
39
+ - **Encourage Experimentation**: We actively encourage users to experiment, extend, and improve upon the existing codebase. Your innovations could become the next big feature!
40
+ - **Community-Driven Development**: Share your custom implementations and improvements with the community, fostering a collaborative environment for advancing ML research and development.
41
+
42
+ With EasyDeL, you're not constrained by rigid frameworks. Instead, you have a flexible, powerful toolkit that adapts to your needs, no matter how unique or specialized they may be. Whether you're conducting cutting-edge research or building production-ready ML systems, EasyDeL provides the freedom to innovate without limitations.