File size: 1,625 Bytes
6d06df0
 
 
 
36e9004
6d06df0
 
 
 
 
1799ab9
 
 
 
342b7bf
 
1799ab9
46c3b14
1799ab9
 
46c3b14
1799ab9
36174cd
1799ab9
46c3b14
1799ab9
 
 
 
 
 
46c3b14
1799ab9
 
 
46c3b14
 
342b7bf
1799ab9
342b7bf
1799ab9
342b7bf
 
1799ab9
 
 
 
 
 
 
 
 
342b7bf
1799ab9
 
 
 
 
342b7bf
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
---
title: RepoSage Chatbot
emoji: 🤖
colorFrom: indigo
colorTo: blue
sdk: streamlit
sdk_version: "1.46.0"
app_file: app.py
pinned: true
---
# RepoSage™ Chatbot

An MVP AI chatbot built in AML-3304 using Bayesian embeddings, a simple transformer block, and DeepSeek-R1 integration — all wired up with a GitHub-driven CI/CD pipeline to Hugging Face Spaces.

---

## 🚀 Live Demo

Try it out live:  
👉 https://huggingface.co/spaces/rohitrajpoot/reposage-chatbot

---

## 📖 Overview

**What it is:**  
- A command-line & web demo (via Streamlit) that shows:
  1. **Embedding Q&A**: nearest‐neighbor lookup in a trained token embedding (`assist/chat.py`)  
  2. **Bayesian Q&A**: frequency‐based “co-occurrence” embedding lookup (`assist/bayes_chat.py`)  
  3. **Transformer Demo**: single‐block transformer next‐token prediction (`assist/transformer_demo.py`)  
  4. **DeepSeek-R1**: calls to a 1.3B-parameter model for generative Q&A (wrapped to skip gracefully in Colab)  

**Why it matters:**  
- Demonstrates core GPT “atoms” (token → embedding → attention → generation)  
- Shows an end-to-end MLOps flow: local dev → GitHub Actions → Docker → Hugging Face Spaces

---

## ⚙️ Installation

### Local (macOS/Linux)

```bash
git clone https://github.com/rohitrajpoot/reposage.git
cd reposage

# 1) Create & activate venv
python3 -m venv .venv
source .venv/bin/activate

# 2) Install dependencies
pip install --upgrade pip setuptools wheel
pip install -r requirements.txt

# 3) Smoke-test CLI
python -m assist.main chat "hello world"

# 4) Run Streamlit demo
streamlit run app.py