K-intelligence
/

Midm-2.0-Base-Instruct

Text Generation

text-generation-inference

Model card Files Files and versions Community

Midm-LLM commited on 28 days ago

Commit

7565af1

·

verified ·

1 Parent(s): 0f431e8

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ library_name: transformers
 <p align="center">
 <br>
-    <span style="font-size: 60px; font-weight: bold;">Mi:dm 2.0-Base</span>
 </br>
 </p>
@@ -62,11 +62,11 @@ library_name: transformers
 Mi:dm 2.0 is released in two versions:
-- **Mi:dm 2.0-Base**
   An 11.5B parameter dense model designed to balance model size and performance.
   It extends an 8B-scale model by applying the Depth-up Scaling (DuS) method, making it suitable for real-world applications that require both performance and versatility.
-- **Mi:dm 2.0-Mini**
   A lightweight 2.3B parameter dense model optimized for on-device environments and systems with limited GPU resources.
   It was derived from the Base model through pruning and distillation to enable compact deployment.

 <p align="center">
 <br>
+    <span style="font-size: 60px; font-weight: bold;">Mi:dm 2.0 Base</span>
 </br>
 </p>
 Mi:dm 2.0 is released in two versions:
+- **Mi:dm 2.0 Base**
   An 11.5B parameter dense model designed to balance model size and performance.
   It extends an 8B-scale model by applying the Depth-up Scaling (DuS) method, making it suitable for real-world applications that require both performance and versatility.
+- **Mi:dm 2.0 Mini**
   A lightweight 2.3B parameter dense model optimized for on-device environments and systems with limited GPU resources.
   It was derived from the Base model through pruning and distillation to enable compact deployment.