VarsaGupta
commited on
Commit
·
27b71a0
1
Parent(s):
89b21de
initial-commit
Browse files
README.md
CHANGED
@@ -1,3 +1,75 @@
|
|
1 |
-
|
2 |
-
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
--------------
|
2 |
+
language: en
|
3 |
+
tags:
|
4 |
+
- chatbot
|
5 |
+
- natural language processing
|
6 |
+
license: Apache 2.0
|
7 |
+
datasets:
|
8 |
+
- Custom Dataset (Dronealexa)
|
9 |
+
---------------
|
10 |
+
|
11 |
+
Model Card: NLP-Based Chatbot
|
12 |
+
|
13 |
+
----------Overview---------
|
14 |
+
|
15 |
+
The NLP-Based Chatbot is designed to explore Science & Technology topics. It utilizes a combination of semantic search and summarization techniques to provide relevant and concise responses to user queries.
|
16 |
+
|
17 |
+
Model Details
|
18 |
+
|
19 |
+
- Model Name: NLP-Based Chatbot
|
20 |
+
- Model Type: Natural Language Processing (NLP) Chatbot
|
21 |
+
- Framework: Gradio Blocks Interface, spaCy, Transformers
|
22 |
+
|
23 |
+
--------Components------------
|
24 |
+
|
25 |
+
1. Semantic Search
|
26 |
+
|
27 |
+
The chatbot employs semantic search to retrieve relevant information from a preprocessed dataset (Dronealexa.csv). The search is based on a TF-IDF vectorizer and cosine similarity calculations.
|
28 |
+
|
29 |
+
2. Summarization
|
30 |
+
|
31 |
+
A summarization pipeline is used to generate concise summaries of the retrieved information. The Hugging Face Transformers library is utilized for summarization tasks.
|
32 |
+
|
33 |
+
3. Custom Embeddings
|
34 |
+
|
35 |
+
The model incorporates custom text embeddings using spaCy and pre-trained word embeddings. These embeddings enhance the understanding of user queries and contribute to the semantic search.
|
36 |
+
|
37 |
+
4. Gradio Blocks Interface
|
38 |
+
|
39 |
+
The chatbot's frontend is built using Gradio Blocks Interface, providing an interactive and user-friendly platform for users to input queries and receive responses.
|
40 |
+
|
41 |
+
5. Model Card Generation
|
42 |
+
|
43 |
+
The model card generation involves constructing prompts based on search results and utilizing a summarization pipeline to produce model card content.
|
44 |
+
|
45 |
+
-----------Intended Use--------------
|
46 |
+
|
47 |
+
The NLP-Based Chatbot is intended for users interested in exploring Science & Technology topics. It can be used to obtain information from the provided dataset, and users are encouraged to provide feedback for continuous improvement.
|
48 |
+
|
49 |
+
---------------Training Data----------
|
50 |
+
|
51 |
+
The model is trained on a custom dataset (Dronealexa.csv) containing Science & Technology-related information. The dataset has been preprocessed to handle missing values and ensure efficient semantic search.
|
52 |
+
|
53 |
+
------------Evaluation Metrics---------------
|
54 |
+
|
55 |
+
- Semantic Search: TF-IDF Vectorizer, Cosine Similarity
|
56 |
+
- Summarization: Hugging Face Transformers Pipeline
|
57 |
+
|
58 |
+
----------------Ethical Considerations---------------------
|
59 |
+
|
60 |
+
The chatbot aims to provide accurate and relevant information. However, users are advised to critically evaluate the responses and understand that the model's knowledge is based on the training data.
|
61 |
+
|
62 |
+
-----------------Usage Instructions----------------------
|
63 |
+
|
64 |
+
1. Input your query in the provided textbox.
|
65 |
+
2. Click the "Send" button to receive a response.
|
66 |
+
3. Optionally, submit feedback using the "Submit Feedback" button.
|
67 |
+
|
68 |
+
--------------License--------------------
|
69 |
+
|
70 |
+
This model is released under the Apache 2.0 License.
|
71 |
+
|
72 |
+
--------------Contact Information---------------------
|
73 |
+
|
74 |
+
For inquiries or issues, please contact [email protected].
|
75 |
+
|