blimp

Running

yuvalw commited on Mar 10

Commit

c58b005

verified ·

1 Parent(s): 6aa3612

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,5 +1,5 @@
 ---
-title: Blimp
 emoji: 🎈
 colorFrom: blue
 colorTo: red
@@ -18,7 +18,7 @@ description: >-
   For more information on perplexity, see the [dataset card](https://huggingface.co/datasets/nyu-mll/blimp).
 ---
-# Metric Card for Perplexity
 ## Metric Description
 BLiMP is a challenge set for evaluating what language models (LMs) know about major grammatical phenomena in English. BLiMP consists of 67 sub-datasets,
@@ -39,12 +39,12 @@ results = blimp.compute(model_id='pico-lm/pico-decoder')
 ```
 ### Inputs
-- **model_id** (str): model used for calculating blimp.
 - **batch_size** (int): the batch size to run texts through the model. Defaults to 16.
 - **device** (str): device to run on, defaults to `cuda` when available
 ### Output Values
-This metric outputs a dictionary with the blimp scores for each subdataset.
 If one of the input texts is longer than the max input length of the model, then it is truncated to the max length for the perplexity computation.
 ```

 ---
+title: BLiMP
 emoji: 🎈
 colorFrom: blue
 colorTo: red
   For more information on perplexity, see the [dataset card](https://huggingface.co/datasets/nyu-mll/blimp).
 ---
+# Metric Card for BLiMP
 ## Metric Description
 BLiMP is a challenge set for evaluating what language models (LMs) know about major grammatical phenomena in English. BLiMP consists of 67 sub-datasets,
 ```
 ### Inputs
+- **model_id** (str): model used for calculating BLiMP.
 - **batch_size** (int): the batch size to run texts through the model. Defaults to 16.
 - **device** (str): device to run on, defaults to `cuda` when available
 ### Output Values
+This metric outputs a dictionary with the BLiMP scores for each subdataset.
 If one of the input texts is longer than the max input length of the model, then it is truncated to the max length for the perplexity computation.
 ```