Spaces:

Prajith04
/

MultiDoc-RAG-Agent

Sleeping

App Files Files Community

Prajith04 commited on May 8

Commit

db70da0

verified ·

1 Parent(s): 0f3e71e

Upload 10 files

Browse files

Files changed (10) hide show

Dockerfile +33 -0
Readme.md +73 -0
agents.py +11 -0
gradio_demo.py +34 -0
main.py +11 -0
prompts.py +14 -0
query_vectordb.py +29 -0
requirements.txt +124 -0
store2db.py +37 -0
tools.py +13 -0

Dockerfile ADDED Viewed

	@@ -0,0 +1,33 @@

+FROM python:3.10-slim
+# Install system dependencies
+RUN apt-get update && apt-get install -y \
+    git curl && \
+    rm -rf /var/lib/apt/lists/*
+# Set working directory
+WORKDIR /app
+# Copy project files
+COPY . /app
+# Create cache directory
+RUN mkdir -p /app/cache && chmod -R 777 /app/cache
+# Set environment variables
+ENV TRANSFORMERS_CACHE=/app/cache \
+    HF_HOME=/app/cache \
+    SENTENCE_TRANSFORMERS_HOME=/app/cache \
+    PORT=7860 \
+    PYTHONUNBUFFERED=1
+# Install Python dependencies
+RUN pip install --no-cache-dir --upgrade pip && \
+    pip install --no-cache-dir -r requirements.txt && \
+    pip install gradio
+# Expose Gradio port
+EXPOSE 7860
+# Run the app
+CMD ["python", "gradio_demo.py"]

Readme.md ADDED Viewed

	@@ -0,0 +1,73 @@

+---
+title: MultiDoc-RAG-Agent
+emoji: 💻
+colorFrom: blue
+colorTo: yellow
+sdk: gradio
+sdk_version: 5.28.0
+app_file: gradio_demo.py
+pinned: false
+---
+# MultiDoc-RAG-Agent
+## Overview
+The MultiDoc-RAG-Agent is a Retrieval-Augmented Generation (RAG) system designed to interact with users, retrieve relevant documents, and provide intelligent responses. It leverages advanced language models, vector databases, and tools to process queries effectively. This system is particularly useful for scenarios requiring document retrieval and contextual understanding, such as customer support, research assistance, and knowledge management.
+## Components
+### 1. Agents
+- **File**: `agents.py`
+- **Description**: Defines the `rag_agent` function, which creates a tool-calling agent using a language model, tools, and a prompt. The agent is responsible for orchestrating interactions between the user, tools, and the language model to generate accurate and contextually relevant responses.
+### 2. Main Application
+- **File**: `main.py`
+- **Description**: Implements a command-line interface for interacting with the RAG agent. It processes user queries, maintains a chat history, and ensures seamless communication between the user and the agent. This serves as the entry point for users to interact with the system.
+### 3. Prompts
+- **File**: `prompts.py`
+- **Description**: Contains functions to generate prompts for the agent and retriever using templates and a hub-pulled prompt. These prompts guide the language model in understanding the context and generating appropriate responses.
+### 4. Query Vector Database
+- **File**: `query_vectordb.py`
+- **Description**: Handles vector database interactions, initializes chat models, and provides a function to retrieve documents based on similarity. This component ensures efficient and accurate retrieval of relevant documents from the vector database.
+### 5. Document Storage
+- **File**: `store2db.py`
+- **Description**: Loads PDF documents, splits them into smaller chunks, and stores them in a Qdrant vector database. This enables the system to handle large documents and retrieve specific sections relevant to user queries.
+### 6. Tools
+- **File**: `tools.py`
+- **Description**: Defines tools for the agent, including a retriever tool for Samsung mobile-related queries and a calculator tool. These tools extend the agent's capabilities, allowing it to perform specialized tasks.
+## How to Use
+### 1. Setup
+- Ensure all dependencies are installed.
+- Configure environment variables in a `.env` file. For example:
+  - `GROQ_API_KEY`: API key for the language model.
+  - `QDRANT_URL`: URL for the Qdrant vector database.
+  - `QDRANT_API_KEY`: API key for the Qdrant vector database.
+### 2. Run the Application
+- Execute `main.py` to start the command-line interface.
+- Enter queries to interact with the agent and retrieve intelligent responses.
+### 3. Document Storage
+- Use `store2db.py` to load and store documents in the vector database. This step is essential for preparing the system to handle user queries effectively.
+## Dependencies
+- **Python**: The primary programming language used for the project.
+- **LangChain**: A framework for building applications with language models.
+- **Qdrant**: A vector database for storing and retrieving document embeddings.
+- **HuggingFace**: A library for natural language processing and machine learning models.
+- **dotenv**: A library for managing environment variables.
+## Example Use Case
+1. A user queries the system about a specific topic related to Samsung mobile devices.
+2. The agent retrieves relevant documents from the vector database using `query_vectordb.py`.
+3. The language model processes the retrieved documents and generates a coherent response.
+4. The user receives an intelligent and contextually accurate answer.
+## License
+This project is licensed under the MIT License.

agents.py ADDED Viewed

	@@ -0,0 +1,11 @@

+from langchain.agents import create_tool_calling_agent
+from query_vectordb import chat_model
+from tools import retrieve_tool, calculator_tool
+from prompts import agent_prompt
+def rag_agent():
+    llm=chat_model()
+    tools = [retrieve_tool(), calculator_tool()]
+    prompt=agent_prompt()
+    agent = create_tool_calling_agent(llm, tools, prompt)
+    return agent

gradio_demo.py ADDED Viewed

	@@ -0,0 +1,34 @@

+import gradio as gr
+from langchain_community.chat_message_histories import ChatMessageHistory
+from langchain.agents import AgentExecutor
+from agents import rag_agent
+from tools import retrieve_tool, calculator_tool
+chat_history_obj = ChatMessageHistory()
+agent_executor = AgentExecutor(
+    agent=rag_agent(),
+    tools=[retrieve_tool(), calculator_tool()],
+    verbose=True,
+    return_intermediate_steps=True,
+)
+def chat_interface(user_input,history_list):
+    response = agent_executor.invoke({"input": user_input, "chat_history": chat_history_obj.messages})
+    chat_history_obj.add_user_message(user_input)
+    chat_history_obj.add_ai_message(response['output'])
+    print(response)
+    if len(response['intermediate_steps']) > 0:
+        final_response ="Final Output:\n\n"+response['output']+'\n\nTool Used:'+response['intermediate_steps'][0][0].tool+'\n\nTool output:\n'+response['intermediate_steps'][0][1]
+        return final_response
+    response = "Final Output:\n\n"+response['output']
+    return response
+iface = gr.ChatInterface(
+    fn=chat_interface,
+    examples=["how to turn on dark mode in Samsung S25","what is 23*56-67+99*78"],
+    cache_examples=False,
+)
+if __name__ == "__main__":
+    iface.launch()

main.py ADDED Viewed

	@@ -0,0 +1,11 @@

+from langchain_community.chat_message_histories import ChatMessageHistory
+from langchain.agents import AgentExecutor
+from agents import rag_agent
+from tools import retrieve_tool, calculator_tool
+chat_history = ChatMessageHistory()
+agent_executor = AgentExecutor(agent=rag_agent(),tools=[retrieve_tool(),calculator_tool()], verbose=True)
+while True:
+    response=agent_executor.invoke({"input": input("Enter the query:"),"chat_history":chat_history.messages})
+    chat_history.add_ai_message(response['input'])
+    chat_history.add_ai_message(response['output'])
+    print(response)

prompts.py ADDED Viewed

	@@ -0,0 +1,14 @@

+from langchain.prompts import ChatPromptTemplate, HumanMessagePromptTemplate, SystemMessagePromptTemplate,MessagesPlaceholder
+from langchain import hub
+def retriever_prompt():
+    return ChatPromptTemplate.from_messages([
+    SystemMessagePromptTemplate.from_template(
+        "Use the context to answer the question:\nContext: {context}"
+        "these are the titles of manuals you have:\nManuals: {docs}"
+    ),
+    HumanMessagePromptTemplate.from_template("{query}"),
+])
+def agent_prompt():
+    prompt = hub.pull("hwchase17/openai-functions-agent")
+    return prompt

query_vectordb.py ADDED Viewed

	@@ -0,0 +1,29 @@

+from langchain.chat_models import init_chat_model
+from dotenv import load_dotenv
+import os
+from langchain_huggingface import HuggingFaceEmbeddings
+from langchain_qdrant import QdrantVectorStore
+load_dotenv()
+def chat_model():
+    groq_api_key = os.getenv('GROQ_API_KEY')
+    llm = init_chat_model("mistral-saba-24b", model_provider="groq",api_key=groq_api_key)
+    return llm
+def small_chat_model():
+    groq_api_key = os.getenv('GROQ_API_KEY')
+    llm = init_chat_model("llama-3.3-70b-versatile", model_provider="groq",api_key=groq_api_key)
+    return llm
+def init_vector_store():
+    embeddings = HuggingFaceEmbeddings(model_name="sentence-transformers/all-mpnet-base-v2")
+    doc_store = QdrantVectorStore.from_existing_collection(
+    embedding=embeddings,
+    collection_name="multidoc-rag-agent",
+    url=os.getenv('QDRANT_URL'),
+    api_key=os.getenv('QDRANT_API_KEY'))
+    return doc_store
+def retrieve_docs(query, doc_store):
+    retriever = doc_store.as_retriever(search_type="similarity", search_kwargs={"k": 3,})
+    response=retriever.invoke(query)
+    return response

requirements.txt ADDED Viewed

	@@ -0,0 +1,124 @@

+aiofiles==24.1.0
+aiohappyeyeballs==2.6.1
+aiohttp==3.11.18
+aiosignal==1.3.2
+annotated-types==0.7.0
+anyio==4.9.0
+attrs==25.3.0
+certifi==2025.4.26
+charset-normalizer==3.4.2
+click==8.1.8
+dataclasses-json==0.6.7
+distro==1.9.0
+dotenv==0.9.9
+fastapi==0.115.12
+ffmpy==0.5.0
+filelock==3.18.0
+frozenlist==1.6.0
+fsspec==2025.3.2
+gradio==5.29.0
+gradio-client==1.10.0
+greenlet==3.2.1
+groovy==0.1.2
+groq==0.24.0
+grpcio==1.71.0
+h11==0.16.0
+h2==4.2.0
+hpack==4.1.0
+httpcore==1.0.9
+httpx==0.28.1
+httpx-sse==0.4.0
+huggingface-hub==0.30.2
+hyperframe==6.1.0
+idna==3.10
+jinja2==3.1.6
+joblib==1.5.0
+jsonpatch==1.33
+jsonpointer==3.0.0
+langchain==0.3.25
+langchain-community==0.3.23
+langchain-core==0.3.58
+langchain-groq==0.3.2
+langchain-huggingface==0.1.2
+langchain-qdrant==0.2.0
+langchain-text-splitters==0.3.8
+langsmith==0.3.42
+markdown-it-py==3.0.0
+markupsafe==3.0.2
+marshmallow==3.26.1
+mdurl==0.1.2
+mpmath==1.3.0
+multidict==6.4.3
+mypy-extensions==1.1.0
+networkx==3.4.2
+numexpr==2.10.2
+numpy==2.2.5
+nvidia-cublas-cu12==12.6.4.1
+nvidia-cuda-cupti-cu12==12.6.80
+nvidia-cuda-nvrtc-cu12==12.6.77
+nvidia-cuda-runtime-cu12==12.6.77
+nvidia-cudnn-cu12==9.5.1.17
+nvidia-cufft-cu12==11.3.0.4
+nvidia-cufile-cu12==1.11.1.6
+nvidia-curand-cu12==10.3.7.77
+nvidia-cusolver-cu12==11.7.1.2
+nvidia-cusparse-cu12==12.5.4.2
+nvidia-cusparselt-cu12==0.6.3
+nvidia-nccl-cu12==2.26.2
+nvidia-nvjitlink-cu12==12.6.85
+nvidia-nvtx-cu12==12.6.77
+orjson==3.10.18
+packaging==24.2
+pandas==2.2.3
+pillow==11.2.1
+portalocker==2.10.1
+propcache==0.3.1
+protobuf==6.30.2
+pydantic==2.11.4
+pydantic-core==2.33.2
+pydantic-settings==2.9.1
+pydub==0.25.1
+pygments==2.19.1
+pypdf==5.4.0
+python-dateutil==2.9.0.post0
+python-dotenv==1.1.0
+python-multipart==0.0.20
+pytz==2025.2
+pyyaml==6.0.2
+qdrant-client==1.14.2
+regex==2024.11.6
+requests==2.32.3
+requests-toolbelt==1.0.0
+rich==14.0.0
+ruff==0.11.8
+safehttpx==0.1.6
+safetensors==0.5.3
+scikit-learn==1.6.1
+scipy==1.15.2
+semantic-version==2.10.0
+sentence-transformers==4.1.0
+setuptools==80.3.1
+shellingham==1.5.4
+six==1.17.0
+sniffio==1.3.1
+sqlalchemy==2.0.40
+starlette==0.46.2
+sympy==1.14.0
+tenacity==9.1.2
+threadpoolctl==3.6.0
+tokenizers==0.21.1
+tomlkit==0.13.2
+torch==2.7.0
+tqdm==4.67.1
+transformers==4.51.3
+triton==3.3.0
+typer==0.15.3
+typing-extensions==4.13.2
+typing-inspect==0.9.0
+typing-inspection==0.4.0
+tzdata==2025.2
+urllib3==2.4.0
+uvicorn==0.34.2
+websockets==15.0.1
+yarl==1.20.0
+zstandard==0.23.0

store2db.py ADDED Viewed

	@@ -0,0 +1,37 @@

+from langchain_qdrant import QdrantVectorStore
+from langchain_community.document_loaders import PyPDFLoader
+from langchain_text_splitters import RecursiveCharacterTextSplitter
+from langchain_huggingface import HuggingFaceEmbeddings
+from qdrant_client import QdrantClient
+from qdrant_client.http.models import Distance, VectorParams
+import os
+from dotenv import load_dotenv
+load_dotenv()
+url=os.getenv('QDRANT_URL')
+api_key=os.getenv('QDRANT_API_KEY')
+client=QdrantClient(
+    url=url,
+    api_key=api_key,
+)
+embeddings = HuggingFaceEmbeddings(model_name="sentence-transformers/all-mpnet-base-v2")
+loader1 = PyPDFLoader("sam-a16.pdf")
+loader2 = PyPDFLoader("sam-s25.pdf")
+loader3 = PyPDFLoader("sam-fold.pdf")
+docs1 = loader1.load()
+docs2 = loader2.load()
+docs3 = loader3.load()
+docs = docs1 + docs2 + docs3
+text_splitter = RecursiveCharacterTextSplitter(
+    chunk_size=1000,  # chunk size (characters)
+    chunk_overlap=200,  # chunk overlap (characters)
+    add_start_index=True,  # track index in original document
+)
+all_splits = text_splitter.split_documents(docs)
+client.create_collection(
+    collection_name="multidoc-rag-agent",
+    vectors_config=VectorParams(size=768, distance=Distance.COSINE),
+)
+print(f"Split blog post into {len(all_splits)} sub-documents.")
+vector_store = QdrantVectorStore(client=client, embedding=embeddings, collection_name="multidoc-rag-agent")
+vector_store.add_documents(all_splits)
+print("Documents stored in Qdrant.")

tools.py ADDED Viewed

	@@ -0,0 +1,13 @@

+from langchain.tools.retriever import create_retriever_tool
+from query_vectordb import chat_model,init_vector_store,small_chat_model
+from langchain_community.agent_toolkits.load_tools import load_tools
+def retrieve_tool():
+    doc_store=init_vector_store()
+    retriever = doc_store.as_retriever(search_type="similarity", search_kwargs={"k": 3,})
+    retriever_tool = create_retriever_tool(
+    retriever,
+    "VectorDB_search",
+    "Use this tool when you need to answer questions about Samsung mobile phones, including their features, settings, or troubleshooting. For example: how to enable dark mode, battery saving tips, or camera settings.",)
+    return retriever_tool
+def calculator_tool():
+    return load_tools(["llm-math"],llm=small_chat_model())[0]