Cachoups commited on
Commit
8aa4d6d
·
verified ·
1 Parent(s): 065778e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -1
README.md CHANGED
@@ -10,4 +10,13 @@ pinned: false
10
  short_description: A chatbot using RAG with Langchain and Hugging face model
11
  ---
12
 
13
- An example chatbot using [Gradio](https://gradio.app), [`huggingface_hub`](https://huggingface.co/docs/huggingface_hub/v0.22.2/en/index), and the [Hugging Face Inference API](https://huggingface.co/docs/api-inference/index).
 
 
 
 
 
 
 
 
 
 
10
  short_description: A chatbot using RAG with Langchain and Hugging face model
11
  ---
12
 
13
+ An example chatbot using [Gradio](https://gradio.app), [`huggingface_hub`](https://huggingface.co/docs/huggingface_hub/v0.22.2/en/index), and the [Hugging Face Inference API](https://huggingface.co/docs/api-inference/index).
14
+
15
+ Steps:
16
+ - Retrieve text data from website using Beautifulsoup
17
+ - Clean the data
18
+ - Chunk the text base on token character split using Langchain
19
+ - Use bm25 frequency based as a retriever as characters name are unique and have better results finding similar text to the query. Additionally it takes less time than embedding
20
+ - Promp engineering on the system (Adding context on system and update it each query or adding context on the user message). I chose to put it in the user message as results are better.
21
+ - Use open source to correct user queries syntax and grammar and for the chatbot
22
+ - Build gradio app