dhmeltzer commited on
Commit
368d588
·
1 Parent(s): 669cc9e

Update app.py

Browse files
Files changed (1) hide show
  1. app.py +10 -5
app.py CHANGED
@@ -37,18 +37,23 @@ def main():
37
  return response.json()
38
 
39
 
40
- st.title("Vector-based of the r/ELI5 dataset with Sentence Transformers and Faiss")
41
 
42
- st.markdown("""This application lets you perform a semantic search through questions in the r/ELI5 <a href="https://huggingface.co/datasets/eli5">dataset</a>.
43
- The questions and user input are encoded into a high-dimensional vectors space using a Sentence-Transformer model, and in particular the checkpoint <a href="https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2">sentence-transformers/all-MiniLM-L6-v2</a>.
 
 
44
  To perform the search we use FAISS, which performs an efficient similarity search through the (vectorized) questions.
 
45
  The ELI5 dataset contains posts from three subreddits, AskScience (asks), AskHistorians (askh), and ExplainLikeImFive (eli5).
 
46
  The score corresponds to the rating each answer recieved when posted on Reddit.
 
 
 
47
  We unfortunately cannot verify the veracity of any of the answers posted!
48
  """)
49
 
50
- st.markdown(""" To change the number of search results listed, simply move the slider located in the sidebar.
51
- """)
52
 
53
  # User search
54
  user_input = st.text_area("Search box", "What is spacetime made out of?")
 
37
  return response.json()
38
 
39
 
40
+ st.title("Semantic Search for Questions on Reddit.")
41
 
42
+ st.write("""This application lets you perform a semantic search through questions in the r/ELI5 [dataset](https://huggingface.co/datasets/eli5).
43
+
44
+ The questions and user input are encoded into a high-dimensional vectors space using a Sentence-Transformer model, and in particular the checkpoint [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2").
45
+
46
  To perform the search we use FAISS, which performs an efficient similarity search through the (vectorized) questions.
47
+
48
  The ELI5 dataset contains posts from three subreddits, AskScience (asks), AskHistorians (askh), and ExplainLikeImFive (eli5).
49
+
50
  The score corresponds to the rating each answer recieved when posted on Reddit.
51
+
52
+ User the slider on the left to change the number of results shown.
53
+
54
  We unfortunately cannot verify the veracity of any of the answers posted!
55
  """)
56
 
 
 
57
 
58
  # User search
59
  user_input = st.text_area("Search box", "What is spacetime made out of?")