Severian
/

Jamba-UltraInteract-Instruct-1B

Text Generation

Inference Endpoints

Model card Files Files and versions Community

This Jamba model has been pruned to just 1B parameters. It was then trained on the first 50k examples of the Ultra Interact Pair dataset for Instruction based fine-tuning.

Initial tests work but may be inconsistent. More info and examples will be posted later

Training

50k Examples
6 hours x A100

Downloads last month: 19

Safetensors

Model size

1.02B params

Tensor type

F32

·

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

Model tree for Severian/Jamba-UltraInteract-Instruct-1B

Base model

ai21labs/Jamba-v0.1

Finetuned

(5)

this model

Dataset used to train Severian/Jamba-UltraInteract-Instruct-1B

Collection including Severian/Jamba-UltraInteract-Instruct-1B

Jamba Models

Experimentations with the new SSM-Transformers Jamba • 4 items • Updated Apr 9, 2024 • 1