VarsaGupta commited on
Commit
27b71a0
·
1 Parent(s): 89b21de

initial-commit

Browse files
Files changed (1) hide show
  1. README.md +75 -3
README.md CHANGED
@@ -1,3 +1,75 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ --------------
2
+ language: en
3
+ tags:
4
+ - chatbot
5
+ - natural language processing
6
+ license: Apache 2.0
7
+ datasets:
8
+ - Custom Dataset (Dronealexa)
9
+ ---------------
10
+
11
+ Model Card: NLP-Based Chatbot
12
+
13
+ ----------Overview---------
14
+
15
+ The NLP-Based Chatbot is designed to explore Science & Technology topics. It utilizes a combination of semantic search and summarization techniques to provide relevant and concise responses to user queries.
16
+
17
+ Model Details
18
+
19
+ - Model Name: NLP-Based Chatbot
20
+ - Model Type: Natural Language Processing (NLP) Chatbot
21
+ - Framework: Gradio Blocks Interface, spaCy, Transformers
22
+
23
+ --------Components------------
24
+
25
+ 1. Semantic Search
26
+
27
+ The chatbot employs semantic search to retrieve relevant information from a preprocessed dataset (Dronealexa.csv). The search is based on a TF-IDF vectorizer and cosine similarity calculations.
28
+
29
+ 2. Summarization
30
+
31
+ A summarization pipeline is used to generate concise summaries of the retrieved information. The Hugging Face Transformers library is utilized for summarization tasks.
32
+
33
+ 3. Custom Embeddings
34
+
35
+ The model incorporates custom text embeddings using spaCy and pre-trained word embeddings. These embeddings enhance the understanding of user queries and contribute to the semantic search.
36
+
37
+ 4. Gradio Blocks Interface
38
+
39
+ The chatbot's frontend is built using Gradio Blocks Interface, providing an interactive and user-friendly platform for users to input queries and receive responses.
40
+
41
+ 5. Model Card Generation
42
+
43
+ The model card generation involves constructing prompts based on search results and utilizing a summarization pipeline to produce model card content.
44
+
45
+ -----------Intended Use--------------
46
+
47
+ The NLP-Based Chatbot is intended for users interested in exploring Science & Technology topics. It can be used to obtain information from the provided dataset, and users are encouraged to provide feedback for continuous improvement.
48
+
49
+ ---------------Training Data----------
50
+
51
+ The model is trained on a custom dataset (Dronealexa.csv) containing Science & Technology-related information. The dataset has been preprocessed to handle missing values and ensure efficient semantic search.
52
+
53
+ ------------Evaluation Metrics---------------
54
+
55
+ - Semantic Search: TF-IDF Vectorizer, Cosine Similarity
56
+ - Summarization: Hugging Face Transformers Pipeline
57
+
58
+ ----------------Ethical Considerations---------------------
59
+
60
+ The chatbot aims to provide accurate and relevant information. However, users are advised to critically evaluate the responses and understand that the model's knowledge is based on the training data.
61
+
62
+ -----------------Usage Instructions----------------------
63
+
64
+ 1. Input your query in the provided textbox.
65
+ 2. Click the "Send" button to receive a response.
66
+ 3. Optionally, submit feedback using the "Submit Feedback" button.
67
+
68
+ --------------License--------------------
69
+
70
+ This model is released under the Apache 2.0 License.
71
+
72
+ --------------Contact Information---------------------
73
+
74
+ For inquiries or issues, please contact [email protected].
75
+