Update app.py
Browse files
app.py
CHANGED
@@ -39,20 +39,13 @@ def main():
|
|
39 |
|
40 |
st.title("Semantic Search for Questions on Reddit.")
|
41 |
|
42 |
-
st.write("""This application lets you perform a semantic search through questions in the r/ELI5 [dataset](https://huggingface.co/datasets/eli5).
|
43 |
-
|
44 |
-
|
45 |
-
|
46 |
-
|
47 |
-
|
48 |
-
|
49 |
-
|
50 |
-
The score corresponds to the rating each answer recieved when posted on Reddit.
|
51 |
-
|
52 |
-
User the slider on the left to change the number of results shown.
|
53 |
-
|
54 |
-
We unfortunately cannot verify the veracity of any of the answers posted!
|
55 |
-
""")
|
56 |
|
57 |
|
58 |
# User search
|
|
|
39 |
|
40 |
st.title("Semantic Search for Questions on Reddit.")
|
41 |
|
42 |
+
st.write("""This application lets you perform a semantic search through questions in the r/ELI5 [dataset](https://huggingface.co/datasets/eli5). \
|
43 |
+
The questions and user input are encoded into a high-dimensional vectors space using a Sentence-Transformer model, and in particular the checkpoint [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2").
|
44 |
+
To perform the search we use FAISS, which performs an efficient similarity search through the (vectorized) questions.
|
45 |
+
The ELI5 dataset contains posts from three subreddits, AskScience (asks), AskHistorians (askh), and ExplainLikeImFive (eli5).
|
46 |
+
The score corresponds to the rating each answer recieved when posted on Reddit.
|
47 |
+
Use the slider on the left to change the number of results shown.
|
48 |
+
We unfortunately cannot verify the veracity of any of the answers posted!""")
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
49 |
|
50 |
|
51 |
# User search
|