Commit
·
27b71a0
1
Parent(s):
89b21de
initial-commit
Browse files
README.md
CHANGED
|
@@ -1,3 +1,75 @@
|
|
| 1 |
-
|
| 2 |
-
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
--------------
|
| 2 |
+
language: en
|
| 3 |
+
tags:
|
| 4 |
+
- chatbot
|
| 5 |
+
- natural language processing
|
| 6 |
+
license: Apache 2.0
|
| 7 |
+
datasets:
|
| 8 |
+
- Custom Dataset (Dronealexa)
|
| 9 |
+
---------------
|
| 10 |
+
|
| 11 |
+
Model Card: NLP-Based Chatbot
|
| 12 |
+
|
| 13 |
+
----------Overview---------
|
| 14 |
+
|
| 15 |
+
The NLP-Based Chatbot is designed to explore Science & Technology topics. It utilizes a combination of semantic search and summarization techniques to provide relevant and concise responses to user queries.
|
| 16 |
+
|
| 17 |
+
Model Details
|
| 18 |
+
|
| 19 |
+
- Model Name: NLP-Based Chatbot
|
| 20 |
+
- Model Type: Natural Language Processing (NLP) Chatbot
|
| 21 |
+
- Framework: Gradio Blocks Interface, spaCy, Transformers
|
| 22 |
+
|
| 23 |
+
--------Components------------
|
| 24 |
+
|
| 25 |
+
1. Semantic Search
|
| 26 |
+
|
| 27 |
+
The chatbot employs semantic search to retrieve relevant information from a preprocessed dataset (Dronealexa.csv). The search is based on a TF-IDF vectorizer and cosine similarity calculations.
|
| 28 |
+
|
| 29 |
+
2. Summarization
|
| 30 |
+
|
| 31 |
+
A summarization pipeline is used to generate concise summaries of the retrieved information. The Hugging Face Transformers library is utilized for summarization tasks.
|
| 32 |
+
|
| 33 |
+
3. Custom Embeddings
|
| 34 |
+
|
| 35 |
+
The model incorporates custom text embeddings using spaCy and pre-trained word embeddings. These embeddings enhance the understanding of user queries and contribute to the semantic search.
|
| 36 |
+
|
| 37 |
+
4. Gradio Blocks Interface
|
| 38 |
+
|
| 39 |
+
The chatbot's frontend is built using Gradio Blocks Interface, providing an interactive and user-friendly platform for users to input queries and receive responses.
|
| 40 |
+
|
| 41 |
+
5. Model Card Generation
|
| 42 |
+
|
| 43 |
+
The model card generation involves constructing prompts based on search results and utilizing a summarization pipeline to produce model card content.
|
| 44 |
+
|
| 45 |
+
-----------Intended Use--------------
|
| 46 |
+
|
| 47 |
+
The NLP-Based Chatbot is intended for users interested in exploring Science & Technology topics. It can be used to obtain information from the provided dataset, and users are encouraged to provide feedback for continuous improvement.
|
| 48 |
+
|
| 49 |
+
---------------Training Data----------
|
| 50 |
+
|
| 51 |
+
The model is trained on a custom dataset (Dronealexa.csv) containing Science & Technology-related information. The dataset has been preprocessed to handle missing values and ensure efficient semantic search.
|
| 52 |
+
|
| 53 |
+
------------Evaluation Metrics---------------
|
| 54 |
+
|
| 55 |
+
- Semantic Search: TF-IDF Vectorizer, Cosine Similarity
|
| 56 |
+
- Summarization: Hugging Face Transformers Pipeline
|
| 57 |
+
|
| 58 |
+
----------------Ethical Considerations---------------------
|
| 59 |
+
|
| 60 |
+
The chatbot aims to provide accurate and relevant information. However, users are advised to critically evaluate the responses and understand that the model's knowledge is based on the training data.
|
| 61 |
+
|
| 62 |
+
-----------------Usage Instructions----------------------
|
| 63 |
+
|
| 64 |
+
1. Input your query in the provided textbox.
|
| 65 |
+
2. Click the "Send" button to receive a response.
|
| 66 |
+
3. Optionally, submit feedback using the "Submit Feedback" button.
|
| 67 |
+
|
| 68 |
+
--------------License--------------------
|
| 69 |
+
|
| 70 |
+
This model is released under the Apache 2.0 License.
|
| 71 |
+
|
| 72 |
+
--------------Contact Information---------------------
|
| 73 |
+
|
| 74 |
+
For inquiries or issues, please contact varsagupta07@gmail.com.
|
| 75 |
+
|