Lazyhope commited on
Commit
09b90bc
·
1 Parent(s): cd74778

Add README and change the prompt in the search bar to numpy/numpy

Browse files
Files changed (2) hide show
  1. README.md +37 -1
  2. app.py +1 -1
README.md CHANGED
@@ -1,2 +1,38 @@
1
  # RepoSnipy
2
- Neural search engine for discovering semantically similar Python repositories on GitHub
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  # RepoSnipy
2
+
3
+ [![Open in Spaces](https://huggingface.co/datasets/huggingface/badges/raw/main/open-in-hf-spaces-md-dark.svg)](https://huggingface.co/spaces/Lazyhope/RepoSnipy)
4
+
5
+ Neural search engine for discovering semantically similar Python repositories on GitHub.
6
+
7
+ ## About
8
+
9
+ RepoSnipy is a neural search engine built with [streamlit](https://github.com/streamlit/streamlit) and [docarray](https://github.com/docarray/docarray). You can query a public Python repository hosted on GitHub and find popular repositories that are semantically similar to it.
10
+
11
+ It uses the [RepoSim](https://github.com/RepoAnalysis/RepoSim/) pipeline to create embeddings for Python repositories. We have created a [vector dataset](data/index.bin) (stored as docarray index) of over 9700 GitHub Python repositories that has license and over 300 stars by the time of 20th May, 2023.
12
+
13
+ ## Running Locally
14
+
15
+ Download the repository and install the required packages:
16
+
17
+ ```bash
18
+ git clone https://github.com/RepoAnalysis/RepoSnipy
19
+ cd RepoSnipy
20
+ pip install -r requirements.txt
21
+ ```
22
+
23
+ Then run the app on your local machine using:
24
+
25
+ ```bash
26
+ streamlit run app.py
27
+ ```
28
+
29
+ ## License
30
+
31
+ Distributed under the MIT License. See [LICENSE](LICENSE) for more information.
32
+
33
+ ## Acknowledgments
34
+
35
+ The model and the fine-tuning dataset used:
36
+
37
+ * [UniXCoder](https://arxiv.org/abs/2203.03850)
38
+ * [AdvTest](https://arxiv.org/abs/1909.09436)
app.py CHANGED
@@ -109,7 +109,7 @@ st.text_input(
109
  "Enter a GitHub repo URL or owner/repo (case-sensitive):",
110
  value="",
111
  max_chars=200,
112
- placeholder="huggingface/transformers",
113
  key="repo_input",
114
  )
115
 
 
109
  "Enter a GitHub repo URL or owner/repo (case-sensitive):",
110
  value="",
111
  max_chars=200,
112
+ placeholder="numpy/numpy",
113
  key="repo_input",
114
  )
115