microsoft-rho-math-1b-v0.1 / README.md

hflog

Duplicate from microsoft/rho-math-1b-v0.1

5574ecb verified about 1 year ago

preview code

raw

history blame contribute delete

514 Bytes

metadata

license: mit
tags:
  - nlp
  - math
language:
  - en
pipeline_tag: text-generation

Rho-1: Not All Tokens Are What You Need

The Rho-1 series are pretrained language models that utilize Selective Language Modeling (SLM) objectives. In math reasoning pretraining, SLM improves average few-shot accuracy on GSM8k and MATH by over 16%, achieving the baseline performance 5-10x faster.

For more details please check our github and paper.