moshew
/

distilbilstm-finetuned-sst-2-english

Model card Files Files and versions Community

moshew commited on Dec 22, 2022

Commit

cd4126b

·

1 Parent(s): 6448dc5

Update README.md

Files changed (1) hide show

README.md +46 -5

README.md CHANGED Viewed

@@ -2,17 +2,58 @@
 library_name: keras
 ---
 ## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure

 library_name: keras
 ---
+x100 smaller with less than 0.5 accuracy drop vs. distilbert-base-uncased-finetuned-sst-2-english
 ## Model description
+2 Layers Bilstm model finetuned on SST-2 and distlled from RoBERTa teacher
+distilbert-base-uncased-finetuned-sst-2-english:    92.2 accuracy, 67M parameters
+moshew/distilbilstm-finetuned-sst-2-english:        91.9 accuracy, 66K parameters
+## How to get started with the model
+Example on SST-2 test dataset classification:
+```python
+from datasets import load_dataset
+import numpy as np
+from sklearn.metrics import accuracy_score
+from keras.preprocessing.text import Tokenizer
+from keras.utils import pad_sequences
+import tensorflow as tf
+from huggingface_hub import from_pretrained_keras
+from datasets import load_dataset
+sst2 = load_dataset("SetFit/sst2")
+augmented_sst2_dataset = load_dataset("jmamou/augmented-glue-sst2")
+oov_token = '<UNK>' # Required only if test is not given
+pad_type = 'post'
+trunc_type = 'post'
+# Tokenize our training data
+tokenizer = Tokenizer(num_words=10000)
+tokenizer.fit_on_texts(augmented_sst2_dataset['train']['sentence'])
+# Encode training data sentences into sequences
+test_sequences = tokenizer.texts_to_sequences(sst2['test']['text'])
+# Pad the training sequences
+test_padded = pad_sequences(test_sequences, padding=pad_type, truncating=trunc_type, maxlen=64)
+reloaded_model = from_pretrained_keras('moshew/distilbilstm-finetuned-sst-2-english')
+pred=reloaded_model.predict(test_padded)
+pred_bin = np.argmax(pred,1)
+accuracy_score(pred_bin, sst2['test']['label'])
+```
+0.9187259747391543
 ## Training procedure