arco+

This is an untrained passthrough model based on arco and danube as a first effort to train a small enough reasoning language model that generalizes across all kind of reasoning tasks.

Benchmarks

Parameters Model MMLU ARC HellaSwag PIQA Winogrande Average
488m arco-lite 23.22 33.45 56.55 69.70 59.19 48.46
773m arco-plus 23.06 36.43 60.09 72.36 60.46 50.48

Configuration

The following YAML configuration was used to produce this model:

slices:
  - sources:
    - model: appvoid/arco
      layer_range: [0, 14]
  - sources:
    - model: h2oai/h2o-danube3-500m-base
      layer_range: [4, 16]

merge_method: passthrough
dtype: float16
Downloads last month
85
Safetensors
Model size
773M params
Tensor type
FP16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for appvoid/arco-plus

Merge model
this model
Finetunes
3 models
Merges
1 model
Quantizations
1 model

Collection including appvoid/arco-plus