LingoIITGN
/

ganga-1b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Lingo-IITGN commited on Jul 2

Commit

8fc0b91

•

1 Parent(s): 6dd3352

Update README.md

Files changed (1) hide show

README.md +36 -14

README.md CHANGED Viewed

@@ -81,16 +81,38 @@ This model described is a research preview and is under ongoing iterative updati
 ### Results
-|    Model    | Fertility |   PPL  |
 |:-----------:|:---------:|:------:|
-|   ganga-1b  |    1.12   |  34.85 |
-|  pragna-1b  |    1.58   |  12.74 |
-|  bloom-1b1  |    1.27   |  33.39 |
-|  bloom-1b7  |    1.27   |  26.63 |
-|   gemma-2b  |    1.89   |  41.67 |
-|   bloom-3b  |    1.27   |  23.77 |
-| airavata-7b |    1.69   |  46.24 |
 #### Summary
@@ -104,15 +126,15 @@ This model described is a research preview and is under ongoing iterative updati
 Ganga-1b is a decoder-only transformer model, featuring the following specifications:
-* #Layers: 16
-* #Attention heads: 32
-* Embedding dimension: 2048
-* Vocabulary size: 30000
 * Sliding window: 512
 * Intermediate dimension: 716
 ## Model Card Contact
-[Lingo Research Labs at IIT Gandhinagar, India](https://labs.iitgn.ac.in/lingo/)
 Mail at: [[email protected]]([email protected])

 ### Results
+<details open>
+<summary>Tokenizers Results</summary>
+<br>
+|    Model    | Fertility |
+|:-----------:|:---------:|
+|   ***ganga-1b***  |    ***1.12***   |
+|  pragna-1b  |    1.58   |
+|  bloom-1b1  |    1.27   |
+|  bloom-1b7  |    1.27   |
+|   gemma-2b  |    1.89   |
+|   bloom-3b  |    1.27   |
+| airavata-7b |    1.69   |
+</details>
+<details open>
+<summary>Metrics</summary>
+<br>
+|    Model    | PPL_{Ours |   PPL_{Airawat}  |
 |:-----------:|:---------:|:------:|
+|   ganga-1b  |       |  34.85 |
+|  pragna-1b  |       |  12.74 |
+|  bloom-1b1  |       |  33.39 |
+|  bloom-1b7  |       |  26.63 |
+|   gemma-2b  |       |  41.67 |
+|   bloom-3b  |       |  23.77 |
+| airavata-7b |       |  46.24 |
+</details>
 #### Summary
 Ganga-1b is a decoder-only transformer model, featuring the following specifications:
+* Layers: 16
+* Attention heads: 32
+* Embedding dimension: 2,048
+* Vocabulary size: 30,000
 * Sliding window: 512
 * Intermediate dimension: 716
 ## Model Card Contact
+[Lingo Research Labs at IIT Gandhinagar, India](https://labs.iitgn.ac.in/lingo/) </br>
 Mail at: [[email protected]]([email protected])