norallm
/

normistral-7b-scratch

Text Generation

Norwegian Bokmål

Norwegian Nynorsk

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

davda54 commited on Feb 7

Commit

d0a30cf

•

1 Parent(s): 69e469f

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -97,7 +97,7 @@ We use the binary formulation of this task (positive vs. negative).
 <summary>Method</summary>
 * Evaluation setting: zero-shot and few-shot perplexity-based evaluation.
-* Prompt: ```"Tekst: {text}\nSentiment:{label}"```, where the ```label``` is either "positiv" or "negativ". Based on [Brown et al. (2020)](https://arxiv.org/abs/2005.14165).
 * Few-shot results show the average scores across 5 repetitions
 * Evaluation script: https://github.com/ltgoslo/norallm/blob/main/initial_evaluation/sentiment_analysis.py
 * Performance metric: macro-averaged F1-score.
@@ -124,13 +124,13 @@ We use the binary formulation of this task (positive vs. negative).
 ### Reading comprehension
-[NorQuAD](https://huggingface.co/datasets/ltg/norquad) ([Ivanova et al., 2023](https://aclanthology.org/2023.nodalida-1.17/)) is a dataset for extractive question answering in Norwegian designed similarly to [SQuAD (Rajpurkar et al., 2016)](https://aclanthology.org/D16-1264/).
 <details>
 <summary>Method</summary>
 * Evaluation setting: zero-shot and few-shot settings via natural language generation using the greedy decoding strategy.
-* Prompt: ```"Tittel: {title}\n\nTekst: {text}\n\nSpørsmål: {question}\n\nSvar:{answer}"```
 * Few-shot results show the average scores across 5 repetitions
 * Evaluation script: https://github.com/ltgoslo/norallm/blob/main/initial_evaluation/norquad.py
 * Performance metrics: macro-averaged F1-score and exact match (EM).

 <summary>Method</summary>
 * Evaluation setting: zero-shot and few-shot perplexity-based evaluation.
+* Prompt: ```"Tekst: {text}\nSentiment:{label}"```, where the ```label``` is either "positiv" or "negativ".
 * Few-shot results show the average scores across 5 repetitions
 * Evaluation script: https://github.com/ltgoslo/norallm/blob/main/initial_evaluation/sentiment_analysis.py
 * Performance metric: macro-averaged F1-score.
 ### Reading comprehension
+[NorQuAD](https://huggingface.co/datasets/ltg/norquad) ([Ivanova et al., 2023](https://aclanthology.org/2023.nodalida-1.17/)) is a dataset for extractive question answering in Norwegian designed similarly to [SQuAD (Rajpurkar et al., 2016)](https://aclanthology.org/D16-1264/).
 <details>
 <summary>Method</summary>
 * Evaluation setting: zero-shot and few-shot settings via natural language generation using the greedy decoding strategy.
+* Prompt: ```"Tittel: {title}\n\nTekst: {text}\n\nSpørsmål: {question}\n\nSvar:{answer}"``` Based on [Brown et al. (2020)](https://arxiv.org/abs/2005.14165).
 * Few-shot results show the average scores across 5 repetitions
 * Evaluation script: https://github.com/ltgoslo/norallm/blob/main/initial_evaluation/norquad.py
 * Performance metrics: macro-averaged F1-score and exact match (EM).