Spaces:

akshil-jain
/

Video-Transcript-Chatbot

Paused

Video-Transcript-Chatbot / README.md

Update README.md

df453a5 verified 6 months ago

2.3 kB

A newer version of the Gradio SDK is available: 6.2.0

Upgrade

metadata

title: Video Transcript Chatbot
emoji: 🎥
colorFrom: yellow
colorTo: indigo
sdk: gradio
python_version: '3.10'

Video Transcript Chatbot

A beginner-friendly Gradio app that turns any YouTube video into a conversational chatbot using LangChain and Hugging Face Inference API.

Dynamic Video Input: Paste a full YouTube URL or raw video ID.
Embedding Model Selection: Pick any HF embedding model (default: sentence-transformers/all-MiniLM-L6-v2).
LLM Model Selection: Choose any HF text-generation model (default: meta-llama/Llama-3.1-8B-Instruct).
Secure Token Entry: You must enter your own HF API token at runtime—no hard-coded defaults.
Conversational Memory: Multi-turn chat history is preserved.
Retrieval-Augmented Generation: Uses FAISS + transcript context to ground answers.

Python 3.8+
Hugging Face API Token with Inference access:
https://huggingface.co/settings/tokens
Git (for cloning the repo)

Clone the repo

git clone https://github.com/<your-username>/yt-rag-chatbot.git
cd yt-rag-chatbot

(Optional) Create a virtual environment

python -m venv venv
source venv/bin/activate    # macOS/Linux
venv\Scripts\activate       # Windows

Install dependencies

python -m venv venv
source venv/bin/activate    # macOS/Linux
venv\Scripts\activate       # Windows

Usage

Customization

Default Models: Edit the default values for embedding_model_input and llm_model_input in app.py.
Retrieval Size: Change the k value in the retriever configuration:
```
retriever = vector_store.as_retriever(search_kwargs={'k': 4})
```