Spaces:

jchen8000
/

RAG_Demo

Sleeping

App Files Files Community

jchen8000 commited on May 28

Commit

f88614f

verified ·

1 Parent(s): ed2063e

Update app.py

Browse files

Files changed (1) hide show

app.py +17 -30

app.py CHANGED Viewed

@@ -19,50 +19,38 @@ print(f"Pyton version {sys.version}.")
 vector_store = None
 # Sample PDF file
-sample_filenames = ["Attention Is All You Need.pdf",
-                   "Generative Adversarial Nets.pdf",
-                   "Parameter-Efficient Transfer Learning for NLP.pdf",
                   ]
-sample_desc = """
-### 1. Attention Is All You Need (Vaswani et al., 2017)
-This groundbreaking paper introduced the **Transformer** architecture. It revolutionized natural language processing by enabling parallelization and significantly improving performance on tasks like translation, leading to models like *BERT* and *GPT*.
-### 2. Generative Adversarial Nets (Goodfellow et al., 2014)
-This paper proposed **GANs**, a novel framework for generative modeling using two neural networks—a generator and a discriminator—that compete in a zero-sum game.
-### 3. Parameter-Efficient Transfer Learning for NLP (Houlsby et al., 2019)
-This paper introduces **adapter modules**, a method for fine-tuning large pre-trained language models with significantly fewer parameters.
-It could take several minutes to load and index the files.
-"""
-rag_desc = """
 ### This is a Demo of Retrieval-Augmented Generation (RAG)
 **RAG** is an approach that combines retrieval-based and generative LLM models to improve the accuracy and relevance of generated text.
 It works by first retrieving relevant documents from an external knowledge source (like PDF files) and then using a LLM model to produce responses based on both the input query and the retrieved content.
 This method enhances factual correctness and allows the model to access up-to-date or domain-specific information without retraining.
 """
-examples_questions = [["What is Transformer?"],
-            ["What is Attention?"],
-            ["What is Scaled Dot-Product Attention?"],
-            ["What are Encoder and Decoder?"],
-            ["Describe more about the Transformer."],
-            ["Why use self-attention?"],
-            ["Describe Parameter-Efficient fine-tuning?"],
-            ["Describe Generative Adversarial Networks?"],
-            ["How does GAN work?"]
         ]
 template = \
 """Use the following pieces of context to answer the question at the end.
-If you don't know the answer, just say that you don't know, don't try to make up an answer.
-Always say "Thanks for asking!" at the end of the answer.
 {context}
@@ -155,13 +143,12 @@ additional_inputs = [
 # Create the Gradio interface
 with gr.Blocks(theme="Nymbo/Alyx_Theme") as demo:
     with gr.Tab("Indexing"):
-        gr.Markdown(rag_desc)
         # pdf_input = gr.File(label="Upload PDF", file_types=[".pdf"])
         # pdf_input = gr.Textbox(label="PDF File")
         # index_button = gr.Button("Index PDF")
         # load_sample = gr.Button("Alternatively, Load and Index [Attention Is All You Need.pdf] as a Sample")
         load_sample = gr.Button("Load and Index the following three papers as a RAG Demo")
-        sample_description = gr.Markdown(sample_desc)
         index_output = gr.Textbox(label="Indexing Status")
         # index_button.click(index_pdf, inputs=pdf_input, outputs=index_output)
         load_sample.click(load_sample_pdf, inputs=None, outputs=index_output)

 vector_store = None
 # Sample PDF file
+sample_filenames = ["Installation.pdf",
+                   "User Guide.pdf",
                   ]
+desc = """
 ### This is a Demo of Retrieval-Augmented Generation (RAG)
 **RAG** is an approach that combines retrieval-based and generative LLM models to improve the accuracy and relevance of generated text.
 It works by first retrieving relevant documents from an external knowledge source (like PDF files) and then using a LLM model to produce responses based on both the input query and the retrieved content.
 This method enhances factual correctness and allows the model to access up-to-date or domain-specific information without retraining.
+Click the button below to load a **User Guide** and an **Installation Guide** for the smoke alarm device into the vector database.
+Once you see the message *"PDF indexed successfully!"*, go to the **Chatbot** tab to ask any relevant questions about the device.
 """
+examples_questions = [["How long is the lifespan of this smoke alarm?"],
+            ["How often should I change the battery?"],
+            ["Where should I install the smoke alarm in my home?"],
+            ["How do I test if the smoke alarm is working?"],
+            ["What should I do if the smoke alarm keeps beeping?"],
+            ["Can this smoke alarm detect carbon monoxide too?"],
+            ["How do I clean the smoke alarm properly?"],
+            ["What type of battery does this smoke alarm use?"],
+            ["How loud is the smoke alarm when it goes off?"],
+            ["Can I install this smoke alarm on a wall instead of a ceiling?"],
         ]
 template = \
 """Use the following pieces of context to answer the question at the end.
+If you don't know the answer, just say you don't know because no relevant information in the provided documents, don't try to make up an answer.
 {context}
 # Create the Gradio interface
 with gr.Blocks(theme="Nymbo/Alyx_Theme") as demo:
     with gr.Tab("Indexing"):
+        gr.Markdown(desc)
         # pdf_input = gr.File(label="Upload PDF", file_types=[".pdf"])
         # pdf_input = gr.Textbox(label="PDF File")
         # index_button = gr.Button("Index PDF")
         # load_sample = gr.Button("Alternatively, Load and Index [Attention Is All You Need.pdf] as a Sample")
         load_sample = gr.Button("Load and Index the following three papers as a RAG Demo")
         index_output = gr.Textbox(label="Indexing Status")
         # index_button.click(index_pdf, inputs=pdf_input, outputs=index_output)
         load_sample.click(load_sample_pdf, inputs=None, outputs=index_output)