Daniel kiani commited on
Commit
8fb7841
·
verified ·
1 Parent(s): ff8ae15

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -14,7 +14,7 @@ app_file: scripts/app.py
14
 
15
  An intelligent, multi-modal customer service agent built with a Retrieval-Augmented Generation (RAG) pipeline. This agent can understand user sentiment, retrieve relevant information from a knowledge base, and provide empathetic, context-aware responses in both text and voice.
16
 
17
- the gradio demo can be found [Here](https://huggingface.co/datasets/MakTek/Customer_support_faqs_dataset)
18
 
19
  ![Gradio](assets/gradio.png)
20
 
@@ -89,6 +89,8 @@ A key decision in this project was selecting the right LLM for response generati
89
 
90
  While this project is a fully functional proof-of-concept, there are several ways it could be enhanced for a production environment:
91
 
 
 
92
  - **📈 Scale the LLM**: For even higher quality responses and more nuanced conversations, we could upgrade to a much larger model (e.g., Llama 3, Mistral Large). This would require a more powerful GPU for inference to maintain an acceptable response time.
93
 
94
  - **🎯 Customize the Knowledge Base**: Instead of a generic FAQ dataset [(MakTek/Customer_support_faqs_dataset)](https://huggingface.co/datasets/MakTek/Customer_support_faqs_dataset), the agent could be provided with a company's internal documentation, product manuals, or past support tickets. This would make it a highly specialized and valuable internal tool.
 
14
 
15
  An intelligent, multi-modal customer service agent built with a Retrieval-Augmented Generation (RAG) pipeline. This agent can understand user sentiment, retrieve relevant information from a knowledge base, and provide empathetic, context-aware responses in both text and voice.
16
 
17
+ the gradio demo can be found [Here](https://huggingface.co/spaces/Deathshot78/CustomerServiceAgent)
18
 
19
  ![Gradio](assets/gradio.png)
20
 
 
89
 
90
  While this project is a fully functional proof-of-concept, there are several ways it could be enhanced for a production environment:
91
 
92
+ - **🤖 RLHF-lite for Continuous Improvement**: Extend the agent with reinforcement learning from human feedback (RLHF) using Hugging Face’s TRL library and PPO. This would allow the model to learn from thumbs-up/down feedback or simulated reward signals, improving response quality, politeness, and relevance over time.
93
+
94
  - **📈 Scale the LLM**: For even higher quality responses and more nuanced conversations, we could upgrade to a much larger model (e.g., Llama 3, Mistral Large). This would require a more powerful GPU for inference to maintain an acceptable response time.
95
 
96
  - **🎯 Customize the Knowledge Base**: Instead of a generic FAQ dataset [(MakTek/Customer_support_faqs_dataset)](https://huggingface.co/datasets/MakTek/Customer_support_faqs_dataset), the agent could be provided with a company's internal documentation, product manuals, or past support tickets. This would make it a highly specialized and valuable internal tool.