Spaces:

bertin-project
/

bertin

Runtime error

App Files Files Community

versae commited on Jul 26, 2021

Commit

9ed2311

1 Parent(s): f97cce1

Reorganizing demo

Browse files

Files changed (1) hide show

app.py +27 -11

app.py CHANGED Viewed

@@ -52,17 +52,27 @@ st.set_page_config(page_title="BERTIN Demo", page_icon=LOGO)
 st.title("BERTIN")
 #Sidebar
-st.sidebar.image(LOGO)
 # Body
 st.markdown(
     """
-    BERTIN is a series of BERT-based models for Spanish.
-    The models are trained with Flax and using TPUs sponsored by Google since this is part of the
-    [Flax/Jax Community Week](https://discuss.huggingface.co/t/open-to-the-community-community-week-using-jax-flax-for-nlp-cv/7104)
-    organised by HuggingFace.
     All models are variations of **RoBERTa-base** trained from scratch in **Spanish** using a sample from the **mc4 dataset**.
     We reduced the dataset size to 50 million documents to keep training times shorter, and also to be able to bias training examples based on their perplexity.
@@ -72,15 +82,21 @@ st.markdown(
     * **Stepwise** applies different four sampling probabilities to each of the four quartiles of the perplexity distribution.
     The first models have been trained (250.000 steps) on sequence length 128, and then training for Gaussian changed to sequence length 512 for the last 25.000 training steps to yield another version.
     Please read our [full report](https://huggingface.co/bertin-project/bertin-roberta-base-spanish) for more details on the methodology and metrics on downstream tasks.
     """
 )
-model_name = st.selectbox("Model", list(MODELS.keys()))
-model_url = MODELS[model_name]["url"]
-prompt = st.selectbox("Prompt", ["Random", "Custom"])
 if prompt == "Custom":
     prompt_box = "Enter your masked text here..."
 else:

 st.title("BERTIN")
 #Sidebar
+st.sidebar.markdown(f"""
+<div align=center>
+<img src="{LOGO}" width=200/>
+# BERTIN
+</div>
+BERTIN is a series of BERT-based models for Spanish.
+The models are trained with Flax and using TPUs sponsored by Google since this is part of the
+[Flax/Jax Community Week](https://discuss.huggingface.co/t/open-to-the-community-community-week-using-jax-flax-for-nlp-cv/7104)
+organised by HuggingFace.
+Please read our [full report](https://huggingface.co/bertin-project/bertin-roberta-base-spanish) for more details on the methodology and metrics on downstream tasks.
+""", unsafe_allow_html=True)
 # Body
 st.markdown(
     """
     All models are variations of **RoBERTa-base** trained from scratch in **Spanish** using a sample from the **mc4 dataset**.
     We reduced the dataset size to 50 million documents to keep training times shorter, and also to be able to bias training examples based on their perplexity.
     * **Stepwise** applies different four sampling probabilities to each of the four quartiles of the perplexity distribution.
     The first models have been trained (250.000 steps) on sequence length 128, and then training for Gaussian changed to sequence length 512 for the last 25.000 training steps to yield another version.
     Please read our [full report](https://huggingface.co/bertin-project/bertin-roberta-base-spanish) for more details on the methodology and metrics on downstream tasks.
     """
 )
+col1, col2, col3 = st.beta_columns(3)
+strategy = col1.selectbox("Sampling strategy", ["Gaussian", "Stepwise", "Random"])
+seq_len = col2.selectbox("Sequence length", [128, 512])
+if seq_len == 128:
+    model_url = f"bertin-project/bertin-base-{str(strategy).lower()}"
+else:
+    model_url = f"bertin-project/bertin-base-{str(strategy).lower()}-exp-512seqlen"
+prompt = col3.selectbox("Prompt", ["Random", "Custom"])
 if prompt == "Custom":
     prompt_box = "Enter your masked text here..."
 else: