Spaces:

OnurKerimoglu
/

rag_chat

Sleeping

App Files Files Community

OnurKerimoglu commited on May 26, 2025

Commit

8b78e2a

1 Parent(s): 46dfa3e

updated notebooks: rag_chatbot, chatbot_agentic_prebuilt

Browse files

Files changed (2) hide show

notebooks/chatbot_agentic_prebuilt.ipynb +6 -28
notebooks/rag_chatbot.ipynb +18 -15

notebooks/chatbot_agentic_prebuilt.ipynb CHANGED Viewed

@@ -17,23 +17,6 @@
     "from langchain_core.tools import tool"
    ]
   },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "@tool\n",
-    "def tool_mult(a: int, b: int) -> int:\n",
-    "    'Multiplies two given numbers'\n",
-    "    return a * b\n",
-    "\n",
-    "@tool\n",
-    "def tool_add(a:int, b:int) -> int:\n",
-    "    'Adds two given numbers'\n",
-    "    return a + b"
-   ]
-  },
   {
    "cell_type": "code",
    "execution_count": null,
@@ -49,8 +32,11 @@
     "    api_wrapper=WikipediaAPIWrapper(\n",
     "        top_k_results=2,\n",
     "        doc_content_chars_max=500))\n",
-    "\n",
-    "tools = [tool_search, tool_finance, tool_wiki, tool_mult, tool_add]\n",
     "agent_executor = create_react_agent(\n",
     "    llm,\n",
     "    tools,\n",
@@ -66,19 +52,11 @@
    "source": [
     "# use the agent\n",
     "config = {'configurable': {'thread_id': '1'}}\n",
-    "\n",
     "user_input = input(\"User: \")\n",
-    "\n",
     "for chunk in agent_executor.stream(\n",
     "    {\"messages\": [HumanMessage(content=user_input)]},\n",
     "     config):\n",
-    "    print(chunk)\n",
-    "\n",
-    "# response = agent_executor.invoke(\n",
-    "#     input={\"messages\": [HumanMessage(content=user_input)]},\n",
-    "#     config=config)\n",
-    "# for chunk in response['messages']:\n",
-    "#     print(chunk)"
    ]
   }
  ],

     "from langchain_core.tools import tool"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
     "    api_wrapper=WikipediaAPIWrapper(\n",
     "        top_k_results=2,\n",
     "        doc_content_chars_max=500))\n",
+    "@tool\n",
+    "def tool_mult(a: int, b: int) -> int:\n",
+    "    'Multiplies two given numbers'\n",
+    "    return a * b\n",
+    "tools = [tool_search, tool_finance, tool_wiki, tool_mult]\n",
     "agent_executor = create_react_agent(\n",
     "    llm,\n",
     "    tools,\n",
    "source": [
     "# use the agent\n",
     "config = {'configurable': {'thread_id': '1'}}\n",
     "user_input = input(\"User: \")\n",
     "for chunk in agent_executor.stream(\n",
     "    {\"messages\": [HumanMessage(content=user_input)]},\n",
     "     config):\n",
+    "    print(chunk)"
    ]
   }
  ],

notebooks/rag_chatbot.ipynb CHANGED Viewed

@@ -1,19 +1,25 @@
 {
  "cells": [
-  {
-   "cell_type": "markdown",
-   "metadata": {
-    "id": "d0jr_Q-wl6-0"
-   },
-   "source": [
-    "# RAG - Chatbot"
-   ]
-  },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "In this notebook I will demonstrate how to build the backend of a RAG-application, that will allow users to interact with uploaded pdf documents and provided URL's. What's special here is that the resulting solution is entirely free (subject to HuggingFace inference API rate limits).\n",
     "\n",
     "Note: the code snippets below have been copied and simplified from my original code [here](https://github.com/OnurKerimoglu/chat_with_docs/blob/main/src/rag.py), which is in turn deployed to HuggingFace space [here](https://huggingface.co/spaces/OnurKerimoglu/rag_chat), which may well be sleeping due to inactivity (don't hesitate to wake it up!)"
    ]
@@ -22,11 +28,8 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "Let's first import all the packages that will be needed. Here, we will use:\n",
-    "- LLM: [zephyr-7b-alpha](https://huggingface.co/HuggingFaceH4/zephyr-7b-alpha) through [Hugging Face serverless Inference API](https://huggingface.co/docs/api-inference/en/index)\n",
-    "- Embeddings: [HF Sentence Transformers all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2)\n",
-    "- Vectorstore: [FAISS](https://faiss.ai/index.html)\n",
-    "- For glueing them all: [LangChain (v0.3)](https://python.langchain.com/docs/versions/v0_3/)"
    ]
   },
   {

 {
  "cells": [
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
+    "# RAG - Chatbot\n",
+    "\n",
+    "## Background\n",
+    "\n",
+    "In this notebook I will demonstrate how to build the backend of a RAG-Chatbot, that will allow users to interact with uploaded pdf documents and provided URL's. \n",
+    "\n",
+    "The critical advantage of a RAG-chatbot, in comparison to a standard chatbot is the retrieval of best matching chunks of information provided by the user, and amendment of these information as context to the original question of the user, as illustrated here:\n",
+    "![RAG_Chatbot_Flowchart](RAG_Chatbot_transpBG.drawio.png)\n",
+    "\n",
+    "This technique therefore provides a feasible alternative to the expensive process of  fine-tuning an LLM with information that were not available during the pre-training phase.\n",
+    "\n",
+    "In this implementation, we will use:\n",
+    "- LLM: [zephyr-7b-alpha](https://huggingface.co/HuggingFaceH4/zephyr-7b-alpha) through [Hugging Face serverless Inference API](https://huggingface.co/docs/api-inference/en/index)\n",
+    "- Embeddings: [HF Sentence Transformers all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2)\n",
+    "- Vectorstore: [FAISS](https://faiss.ai/index.html)\n",
+    "- For glueing them all: [LangChain (v0.3)](https://python.langchain.com/docs/versions/v0_3/)\n",
     "\n",
     "Note: the code snippets below have been copied and simplified from my original code [here](https://github.com/OnurKerimoglu/chat_with_docs/blob/main/src/rag.py), which is in turn deployed to HuggingFace space [here](https://huggingface.co/spaces/OnurKerimoglu/rag_chat), which may well be sleeping due to inactivity (don't hesitate to wake it up!)"
    ]
    "cell_type": "markdown",
    "metadata": {},
    "source": [
+    "## Building the Bot\n",
+    "Let's first import all the packages that will be needed. Here, we will use:"
    ]
   },
   {