nanoT5-base-65kBPE-v2

This is a "raw" pretrained model intended to be fine-tuned on downstream tasks

training code: https://github.com/pszemraj/nanoT5/tree/any-tokenizer

plots

more details are under checkpoints/

loss

image/png

gradients

image/png

weights

image/png

Downloads last month
5
Safetensors
Model size
298M params
Tensor type
F32
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Dataset used to train pszemraj/nanoT5-base-65kBPE-v2