tbs17
/

MathBERT

Fill-Mask

Transformers

PyTorch

bert

Model card Files Files and versions Community

tbs17 commited on Jun 17, 2021

Commit

518f69e

1 Parent(s): 8e63b05

Update README.md

Browse files

Files changed (1) hide show

README.md +6 -26

README.md CHANGED Viewed

@@ -16,32 +16,6 @@ You can use the raw model for either masked language modeling or next sentence p
 Note that this model is primarily aimed at being fine-tuned on math-related tasks that use the whole sentence (potentially masked) to make decisions, such as sequence classification, token classification or question answering. For tasks such as math text generation you should look at model like GPT2.
 #### How to use
-<!---You can use this model directly with a pipeline for masked language modeling:
->>> from transformers import pipeline
->>> unmasker = pipeline('fill-mask', model='bert-base-uncased')
->>> unmasker("Hello I'm a [MASK] model.")
-[{'sequence': "[CLS] hello i'm a fashion model. [SEP]",
-  'score': 0.1073106899857521,
-  'token': 4827,
-  'token_str': 'fashion'},
- {'sequence': "[CLS] hello i'm a role model. [SEP]",
-  'score': 0.08774490654468536,
-  'token': 2535,
-  'token_str': 'role'},
- {'sequence': "[CLS] hello i'm a new model. [SEP]",
-  'score': 0.05338378623127937,
-  'token': 2047,
-  'token_str': 'new'},
- {'sequence': "[CLS] hello i'm a super model. [SEP]",
-  'score': 0.04667217284440994,
-  'token': 3565,
-  'token_str': 'super'},
- {'sequence': "[CLS] hello i'm a fine model. [SEP]",
-  'score': 0.027095865458250046,
-  'token': 2986,
-  'token_str': 'fine'}]--->
 Here is how to use this model to get the features of a given text in PyTorch:
@@ -191,6 +165,12 @@ From above, one can tell that MathBERT is specifically designed for mathematics
   'token': 3182,
   'token_str': 'places'}]
 ```
 #### Training data
 The MathBERT model was pretrained on pre-k to HS math curriculum (engageNY, Utah Math, Illustrative Math), college math books from openculture.com as well as graduate level math from arxiv math paper abstracts. There is about 100M tokens got pretrained on.

 Note that this model is primarily aimed at being fine-tuned on math-related tasks that use the whole sentence (potentially masked) to make decisions, such as sequence classification, token classification or question answering. For tasks such as math text generation you should look at model like GPT2.
 #### How to use
 Here is how to use this model to get the features of a given text in PyTorch:
   'token': 3182,
   'token_str': 'places'}]
 ```
+Therefore, to try the 'fill-mask' hosted API on the right corner of the page, please use the sentences similar to below:
+```
+1 tenth times any [MASK] on the place value chart moves it one place value to the right. #from https://www.engageny.org/resource/grade-5-mathematics-module-1
+```
 #### Training data
 The MathBERT model was pretrained on pre-k to HS math curriculum (engageNY, Utah Math, Illustrative Math), college math books from openculture.com as well as graduate level math from arxiv math paper abstracts. There is about 100M tokens got pretrained on.