ED-Zephyria-48b [EXPRIMENTAL]

Model Information

Base Model: unsloth/Mistral-Small-Instruct-2409

Strategy: Early Duplication

Total Layers: 55

Duplication Start: Layer 14 (25.5% of model)

Duplicated Layers: 35 (63.6% of model)

Unique Final Layers: 7 (12.7% of model)

Model Characteristics

  • Models down_proj and o_proj layers have been nulled and will require healing
  • Focuses on refining early features
  • Largest duplicated section among all strategies
  • Suitable for tasks requiring intensive low-level feature processing
  • May excel in tasks that benefit from extensive refinement of basic patterns

Configuration Visualization


[   Unique   ][        Duplicated        ][Unique]
0 --------- 13 14 ------------------- 48 49 --- 54
    25.5%              63.6%            10.9%
      
Downloads last month
23
Safetensors
Model size
48.4B params
Tensor type
BF16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for TheSkullery/ED-Zephyria-48b

Finetuned
(9)
this model
Quantizations
2 models